Discover engine powered by AI 🚀
-
Indexing:
- Index message text in Telegram history export (JSON).
- Index scraped links from content found in messages.
-
Categorization:
- Organize indexed data into relevant categories for better search results.
-
Fuzzy Search and Typo Tolerance:
- Implement a customizable fuzzy search algorithm for typo-tolerant search capabilities.
-
Bot Interface:
- Develop a Telegram bot as an interface to facilitate user queries.
- Display search results for entered queries, utilizing the indexed data.
-
Retrival Improvement:
- Enhance the recall mechanism to improve the accuracy and relevance of search results.
-
Encourage Reviewing:
- Implement features to encourage users to review and provide feedback on search results.
-
Web Integration:
- Integrate the search engine with a web interface for broader accessibility.
-
Discover New Connections:
- Explore and implement methods to discover new connections within the indexed data.
-
Timeline Overview:
- Provide a timeline overview feature for users to navigate through historical data more efficiently.
-
Learning Tracking:
- Incorporate a learning tracking system to monitor and analyze user interactions for continuous improvement.
Ultimately, the goal is to make sure all the information you consume (your input) can lead to increased productivity and creativity (your output) instead of festering and getting forgotten in your mind backyard
- embedding model - bge-small-en-v1.5
- telegram library - pyrogram
- similarity search - usearch
- image model - uform
- json serialization - orjson