Search Autocomplete System

Implemented a scalable search autocomplete system optimized for suggesting top five queries to users in constant time.

System Overview

The search autocomplete system is powered by a Trie (prefix tree) data structure, complemented by an integrated caching mechanism. This caching feature efficiently stores and retrieves the top five most relevant query suggestions in response to user input, thereby substantially improving query suggestion response times. Additionally, user-submitted queries are aggregated and incorporated into the pool of suggestions on a weekly basis, ensuring a dynamic system that continuously provides relevant query suggestions.

System Design

Developed the application with system design principles in mind, ensuring a reliable system that is able to scale as user traffic increases.

Query Gathering Service

User queries undergo logging and preservation within log files. Prior to preservation, each query request is intercepted by a custom filter, ensuring input validation measures are met.

Query Aggregation Service

User submitted queries are aggregated to permanent storage once per week through the use of batch processing. The pool of available suggestions is updated every week, enabling dynamic query suggestions based on user searches. Furthermore, weekly updates stop the constant rebuilding of the Trie structure used to generate query suggestions, resulting in faster query suggestions.

Trie (Prefix Tree) Structure

The underlying data structure behind the search autocomplete system is the Trie (prefix tree) data structure. Its operations allow for fast insertion and retrieval of queries with a common prefix, effectively satisfying the requirements of the search autocomplete system. Query suggestions response time is further improved with the addition of a custom cache mechanism that stores the top five most promising query suggestions.

Trie Structure Without Caching Mechanism

Time complexity: O(p) + O(c) + O(clogc)

Find the query prefix: O(p)
Traverse the subtree from the prefix node to get all valid children: O(c)
Sort the children and get top k queries: O(clogc)

Trie Structure With Caching Mechanism

Time complexity: O(1)

Find query prefix: O(1)
- Users rarely search long queries, therefore limiting its length helps improve time complexity.
The cache mechanism rules out the need to traverse the subtree from the prefix node: O(1)
The caching mechanism already contains the top k queries with the given prefix, therefore no sorting is needed: O(1)

Technologies

Java
Spring Boot
Batch Processing
MongoDB
Redis Cache
React JS
REST API Rate Limiting

Local Project Setup

Mandatory Requirements

Java Version 17 or higher
Docker Engine and Docker Compose
Node JS

Optional Requirements

Gradle version >= 8.5 (the project includes the Gradle Wrapper to run the project)

Running The Project Locally

Mke sure Docker is running.
Start required infrastructure with docker-compose up.
Run the application ./gradlew bootRun -Dspring.profiles.active=dev.
Run to the application web client with npm run dev in the "search-autocomplete-client" directory.
Open your browser and navigate to http://localhost:5173/ to interact with the application.

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
assets		assets
gradle/wrapper		gradle/wrapper
scripts		scripts
search-autocomplete-client		search-autocomplete-client
src		src
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
app-demo.gif		app-demo.gif
build.gradle.kts		build.gradle.kts
docker-compose.yml		docker-compose.yml
gradlew		gradlew
gradlew.bat		gradlew.bat
mongo-queries.js		mongo-queries.js
settings.gradle.kts		settings.gradle.kts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Search Autocomplete System

System Overview

System Design

Query Gathering Service

Query Aggregation Service

Trie (Prefix Tree) Structure

Trie Structure Without Caching Mechanism

Trie Structure With Caching Mechanism

Technologies

Local Project Setup

Mandatory Requirements

Optional Requirements

Running The Project Locally

About

Releases

Packages

Languages

Al3zama1/search-autocomplete

Folders and files

Latest commit

History

Repository files navigation

Search Autocomplete System

System Overview

System Design

Query Gathering Service

Query Aggregation Service

Trie (Prefix Tree) Structure

Trie Structure Without Caching Mechanism

Trie Structure With Caching Mechanism

Technologies

Local Project Setup

Mandatory Requirements

Optional Requirements

Running The Project Locally

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages