Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Docsite search: Set update mechanism #173

Open
hanna-paasivirta opened this issue Feb 17, 2025 · 2 comments
Open

Docsite search: Set update mechanism #173

hanna-paasivirta opened this issue Feb 17, 2025 · 2 comments
Assignees

Comments

@hanna-paasivirta
Copy link
Contributor

Once a basic docsite RAG service is done, design a mechanism to update the embeddings at appropriate intervals. Ideally, these would update only when the docs are updated, and only the parts that are updated would be regenerated, but this may be impractical.

@hanna-paasivirta hanna-paasivirta self-assigned this Feb 17, 2025
@josephjclark
Copy link
Collaborator

Yes - I think the solution here is that we do something like POST services/embeddings/docs, with credential/database data in the payload, which triggers the server to run the docs loader and update the database.

We can then trigger this from a github action whenever docs are merged to main.

I still want to think about what happens while the update is running. Do we drop the prod vector DB to update it? How long does the update take? How many RAG requests will fail because the database is updating?

@hanna-paasivirta
Copy link
Contributor Author

More ideas from Joe here: #176 (comment)

I'm not sure how to increment parts of an online table though because it's chunked roughly by character length, and different types of changes might get complicated. It's a small dataset so regenerating should be ok.

@hanna-paasivirta hanna-paasivirta mentioned this issue Mar 6, 2025
7 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants