ADR: Vector store for RAG #168

anastasds · 2024-12-19T20:33:34Z

dmartinol · 2024-12-19T21:25:14Z

What about moving adrs/ at the root folder, with README and the template files, and have a rag/ subfolder?
Apart from that, lgtm

anastasds · 2024-12-19T21:27:04Z

That would probably need org-wide approval and we don't have that. I'll bring it up with the wider team next year to advocate for adoption of the format.

jwm4

I like the decomposition into Context, Decision, Status, and Consequences. It seems much smoother and more natural than the approach used in #164 which feels awkward and cumbersome to me. I have some minor change requests here, but mostly I like the direction of both the ADR template and the proposed Milvus ADR.

docs/rag/adrs/template.md

docs/rag/adrs/adr-001-vectordb.md

.gitignore

anastasds · 2024-12-20T04:06:49Z

I like the decomposition into Context, Decision, Status, and Consequences. It seems much smoother and more natural than the approach used in #164 which feels awkward and cumbersome to me.

I'm glad the approach resonates. Within reason, people should take priority over processes and not the other way around. This is a core tenet of the Agile Manifesto which, for better or for worse, has been a solid philosophical framework for ways of working for software teams since 2004:

Individuals and interactions over processes and tools

https://agilemanifesto.org/

anastasds · 2024-12-20T04:08:52Z

docs/rag/adrs/adr-vectordb.md

+
+## Context
+
+One of the first choices to make in implementing RAG is to choose an initial vector store to develop against. Though the usage of frameworks like LangChain or Haystack make it easy to swap vector databases, we need a working end to end implementation for RAG that is tested against and available to install with InstructLab. There are many options (see [here](https://docs.haystack.deepset.ai/docs/choosing-a-document-store)). 


How do I disable the below linting bot comment? It seems to me that this is a useless nitpick for markdown which will be rendered and in which whitespace does not matter. @nathan-weinberg maybe you can help here?

I would also be fine with shutting off the linting, which I also find kind of tedious. However, I don't feel strongly. The markdownlint plugin in VSCode has made it a lot easier for me to comply with the linting in my ADRs since I can see the issues before submitting.

You can run make md-lint locally to catch things locally so you don't need to wait for the GitHub CI to run

jwm4

I've added a couple more comments. Also, I do think there is a lot of controversy around the Milvus dependencies specifically, but I am OK with waiting to see if someone else wants to challenge this decision on those grounds.

I see that this PR will need an update to the spellcheck dictionary and a couple of spelling corrections to pass the spell check. Also, I see that it is not passing DCO. I tried using the DCO guidance to bring my last dev-doc into compliance with DCO and it didn't work, but I do think it can be done with enough github expertise (just more than I have). So I don't think it is ready to merge. However, the only open item from me is the title line of the template.

docs/rag/adrs/template.md

jwm4 · 2024-12-20T13:59:49Z

docs/rag/adrs/adr-vectordb.md

+
+## Context
+
+One of the first choices to make in implementing RAG is to choose an initial vector store to develop against. Though the usage of frameworks like LangChain or Haystack make it easy to swap vector databases, we need a working end to end implementation for RAG that is tested against and available to install with InstructLab. There are many options (see [here](https://docs.haystack.deepset.ai/docs/choosing-a-document-store)). 


I would also be fine with shutting off the linting, which I also find kind of tedious. However, I don't feel strongly. The markdownlint plugin in VSCode has made it a lot easier for me to comply with the linting in my ADRs since I can see the issues before submitting.

jwm4

The issues I raised earlier are all resolved now, so I am happy with this in its current form.

Signed-off-by: Anastas Stoyanovsky <[email protected]>

cdoern

this all looks really good to me. I really appreciate the detail you put in here about ADRs in general.

anastasds · 2025-01-09T20:26:20Z

@cdoern @nathan-weinberg thanks both! I don't have a merge button, so, if you would be so kind, that would get the template and guidance merged, and a subsequent ADR will record any changes regarding vector stores based on situational / informational changes.

Signed-off-by: Anastas Stoyanovsky <[email protected]>

jwm4 suggested changes Dec 19, 2024

View reviewed changes

docs/rag/adrs/template.md Outdated Show resolved Hide resolved

docs/rag/adrs/adr-001-vectordb.md Outdated Show resolved Hide resolved

.gitignore Show resolved Hide resolved

anastasds changed the title ~~ADR 001: Vector store for RAG~~ ADR: Vector store for RAG Dec 20, 2024

anastasds commented Dec 20, 2024

View reviewed changes

jwm4 suggested changes Dec 20, 2024

View reviewed changes

jwm4 mentioned this pull request Dec 20, 2024

ADR for IBM Granite Embeddings #169

Merged

jwm4 approved these changes Jan 3, 2025

View reviewed changes

jwm4 mentioned this pull request Jan 6, 2025

ADR for PaRAGon dependency #175

Closed

Introduce ADR format for RAG decisions

db834ca

Signed-off-by: Anastas Stoyanovsky <[email protected]>

anastasds force-pushed the adr branch from e1a53fa to 0b566f6 Compare January 9, 2025 15:39

cdoern approved these changes Jan 9, 2025

View reviewed changes

nathan-weinberg approved these changes Jan 9, 2025

View reviewed changes

ADR 001: Vector store for RAG

8f2ae56

Signed-off-by: Anastas Stoyanovsky <[email protected]>

anastasds force-pushed the adr branch from b31c970 to 8f2ae56 Compare January 9, 2025 20:41

nathan-weinberg merged commit c71e488 into instructlab:main Jan 9, 2025
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ADR: Vector store for RAG #168

ADR: Vector store for RAG #168

anastasds commented Dec 19, 2024 •

edited

Loading

dmartinol commented Dec 19, 2024

anastasds commented Dec 19, 2024

jwm4 left a comment

anastasds commented Dec 20, 2024

anastasds Dec 20, 2024 •

edited

Loading

jwm4 Dec 20, 2024

nathan-weinberg Jan 9, 2025

jwm4 left a comment

jwm4 Dec 20, 2024

jwm4 left a comment

cdoern left a comment

anastasds commented Jan 9, 2025


		## Context

		One of the first choices to make in implementing RAG is to choose an initial vector store to develop against. Though the usage of frameworks like LangChain or Haystack make it easy to swap vector databases, we need a working end to end implementation for RAG that is tested against and available to install with InstructLab. There are many options (see [here](https://docs.haystack.deepset.ai/docs/choosing-a-document-store)).

ADR: Vector store for RAG #168

ADR: Vector store for RAG #168

Conversation

anastasds commented Dec 19, 2024 • edited Loading

dmartinol commented Dec 19, 2024

anastasds commented Dec 19, 2024

jwm4 left a comment

Choose a reason for hiding this comment

anastasds commented Dec 20, 2024

anastasds Dec 20, 2024 • edited Loading

Choose a reason for hiding this comment

jwm4 Dec 20, 2024

Choose a reason for hiding this comment

nathan-weinberg Jan 9, 2025

Choose a reason for hiding this comment

jwm4 left a comment

Choose a reason for hiding this comment

jwm4 Dec 20, 2024

Choose a reason for hiding this comment

jwm4 left a comment

Choose a reason for hiding this comment

cdoern left a comment

Choose a reason for hiding this comment

anastasds commented Jan 9, 2025

anastasds commented Dec 19, 2024 •

edited

Loading

anastasds Dec 20, 2024 •

edited

Loading