You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Once a basic docsite RAG service is done, design a mechanism to update the embeddings at appropriate intervals. Ideally, these would update only when the docs are updated, and only the parts that are updated would be regenerated, but this may be impractical.
The text was updated successfully, but these errors were encountered:
Yes - I think the solution here is that we do something like POST services/embeddings/docs, with credential/database data in the payload, which triggers the server to run the docs loader and update the database.
We can then trigger this from a github action whenever docs are merged to main.
I still want to think about what happens while the update is running. Do we drop the prod vector DB to update it? How long does the update take? How many RAG requests will fail because the database is updating?
I'm not sure how to increment parts of an online table though because it's chunked roughly by character length, and different types of changes might get complicated. It's a small dataset so regenerating should be ok.
Once a basic docsite RAG service is done, design a mechanism to update the embeddings at appropriate intervals. Ideally, these would update only when the docs are updated, and only the parts that are updated would be regenerated, but this may be impractical.
The text was updated successfully, but these errors were encountered: