Docsite search: Add a new docsite search RAG #172

hanna-paasivirta · 2025-02-17T13:46:30Z

Add a new docsite search RAG to replace the experimental Search service. Use the Search service as an example, but rewrite to match the structure of our new embeddings services.

Leverage the Embeddings service and its SearchResult and VectorStore classes and add module to connect to docs embeddings
Add separate service to embed docs
Replace the Search service (doesn't seem to be in use) with a new docsite search service that can search by connecting to the Embeddings service
Use the same vector store and embeddings APIs as new services (Pinecone, OpenAI)
Analyse and optimise the chunking of the docsite and adaptor APIs
Include metadata for each document returned to present to the user

josephjclark · 2025-02-17T16:21:46Z

Can confirm that the existing search is not in use and we can freely rename it.

To consider (and let's call Elias): is the VectorStore abstraction helping at all or should we just use langchain directly?

Update: let's drop vector store and just use langchain

Structure:

embeddings/
    -docs_store.py <-- this will 
docs_search.py   <-- this is a connected service which will search the docsite and return useful document chunks (it might be very lightweight)

hanna-paasivirta self-assigned this Feb 17, 2025

hanna-paasivirta changed the title ~~Add a new docsite search RAG~~ Docsite search: Add a new docsite search RAG Feb 17, 2025

hanna-paasivirta linked a pull request Feb 21, 2025 that will close this issue

Docsite rag #176

Open

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Docsite search: Add a new docsite search RAG #172

Docsite search: Add a new docsite search RAG #172

hanna-paasivirta commented Feb 17, 2025 •

edited

Loading

josephjclark commented Feb 17, 2025 •

edited

Loading

Docsite search: Add a new docsite search RAG #172

Docsite search: Add a new docsite search RAG #172

Comments

hanna-paasivirta commented Feb 17, 2025 • edited Loading

josephjclark commented Feb 17, 2025 • edited Loading

hanna-paasivirta commented Feb 17, 2025 •

edited

Loading

josephjclark commented Feb 17, 2025 •

edited

Loading