fix: add ollama embedding config and fix sqlite_vec db #1255

wukaixingxp · 2025-02-25T19:31:11Z

What does this PR do?

RAG+Ollama is not working, as I encounter errors: Embeddings are now served via Inference providers. Taking a closer look it seems that the embedding model yaml config has been removed here, this PR will add the config back so user can use the embedding model after they ollama run all-minilm:latest.

Then, sqlite_vec also give error SQLite objects created in a thread can only be used in that same thread. The object was created in thread id 8349485120 and this is thread id 6325039104, I think we need to use self.connection = sqlite3.connect(self.config.db_path, check_same_thread=False) instead in :

llama-stack/llama_stack/providers/inline/vector_io/sqlite_vec/sqlite_vec.py

Line 165 in 1a044ef

self.connection = sqlite3.connect(self.config.db_path)

Test Plan

[Describe the tests you ran to verify your changes with result summaries. Provide clear instructions so the plan can be easily re-executed.]
Tested with my DocQA app

llama_stack/templates/ollama/run.yaml

llama_stack/providers/inline/vector_io/sqlite_vec/sqlite_vec.py

ashwinb

see comment inline

wukaixingxp · 2025-02-25T23:11:58Z

see comment inline

Thanks for your help! I run pre-commit locally and everything looks good, not sure why pre-commit failed at CI

pre-commit run --all-files
check for merge conflicts................................................Passed
check for added large files..............................................Passed
fix end of files.........................................................Passed
Insert license in comments...............................................Passed
ruff.....................................................................Passed
ruff-format..............................................................Passed
blacken-docs.............................................................Passed
uv-export................................................................Passed
mypy.....................................................................Passed
Distribution Template Codegen............................................Passed

wukaixingxp · 2025-02-26T22:55:56Z

see comment inline

Thanks for your help! I run pre-commit locally and everything looks good, not sure why pre-commit failed at CI

pre-commit run --all-files
check for merge conflicts................................................Passed
check for added large files..............................................Passed
fix end of files.........................................................Passed
Insert license in comments...............................................Passed
ruff.....................................................................Passed
ruff-format..............................................................Passed
blacken-docs.............................................................Passed
uv-export................................................................Passed
mypy.....................................................................Passed
Distribution Template Codegen............................................Passed

@ashwinb Can you take another look at this PR? somehow the pre-commit is blocking it, I am not sure how to solve

ashwinb · 2025-02-26T23:22:04Z

llama_stack/providers/inline/vector_io/sqlite_vec/sqlite_vec.py

@@ -162,7 +162,7 @@ def __init__(self, config, inference_api: Api.inference) -> None:

    async def initialize(self) -> None:
        # Open a connection to the SQLite database (the file is specified in the config).
-        self.connection = sqlite3.connect(self.config.db_path)
+        self.connection = sqlite3.connect(self.config.db_path, check_same_thread=False)


actually I think the correct fix needs to be what @ehhuang did in 270d640

ashwinb · 2025-02-26T23:23:43Z

You need to fetch and rebase to latest origin/main to get rid of the pre-commit error. But I think the threading error needs to be taken care of in a better way (as @ehhuang's commit shows.)

wukaixingxp · 2025-02-26T23:49:12Z

You need to fetch and rebase to latest origin/main to get rid of the pre-commit error. But I think the threading error needs to be taken care of in a better way (as @ehhuang's commit shows.)

@ashwinb Thanks for the guidance, I followed his example and tested with my RAG app. Can you take another look?

ashwinb · 2025-02-27T03:04:22Z

llama_stack/providers/inline/vector_io/sqlite_vec/sqlite_vec.py


    async def initialize(self) -> None:
        # Open a connection to the SQLite database (the file is specified in the config).
-        self.connection = sqlite3.connect(self.config.db_path)
+        self.connection = self._get_connection()


so this isn't quite right. you cannot assign whatever value you get into an instance variable. you must use conn = self._get_connection() anywhere you need to use a connection.

add ollama embedding config and fix sqlite_vec db

ed2bd60

wukaixingxp self-assigned this Feb 25, 2025

wukaixingxp requested review from ashwinb, yanxi0830, hardikjshah, dltn, raghotham, dineshyv, vladimirivic, sixianyi0721, ehhuang and terrytangyuan as code owners February 25, 2025 19:31

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Feb 25, 2025

ashwinb reviewed Feb 25, 2025

View reviewed changes

llama_stack/templates/ollama/run.yaml Show resolved Hide resolved

ashwinb reviewed Feb 25, 2025

View reviewed changes

llama_stack/providers/inline/vector_io/sqlite_vec/sqlite_vec.py Outdated Show resolved Hide resolved

pre-commit

733b9c0

ashwinb requested changes Feb 25, 2025

View reviewed changes

fix ollama.py bug

32e8919

wukaixingxp requested a review from ashwinb February 25, 2025 23:12

wukaixingxp changed the title ~~add ollama embedding config and fix sqlite_vec db~~ fix:add ollama embedding config and fix sqlite_vec db Feb 25, 2025

wukaixingxp mentioned this pull request Feb 25, 2025

Make DocQA a one-clickable app implementation meta-llama/llama-stack-apps#151

Merged

5 tasks

ashwinb changed the title ~~fix:add ollama embedding config and fix sqlite_vec db~~ fix: add ollama embedding config and fix sqlite_vec db Feb 26, 2025

ashwinb reviewed Feb 26, 2025

View reviewed changes

wukaixingxp and others added 2 commits February 26, 2025 15:26

Merge branch 'meta-llama:main' into fix-ollama-rag

f42dc48

fix sqlite_vec by using local thread

6ff7ea1

wukaixingxp requested a review from ashwinb February 26, 2025 23:47

ashwinb reviewed Feb 27, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: add ollama embedding config and fix sqlite_vec db #1255

fix: add ollama embedding config and fix sqlite_vec db #1255

wukaixingxp commented Feb 25, 2025

ashwinb left a comment

wukaixingxp commented Feb 25, 2025

wukaixingxp commented Feb 26, 2025

ashwinb Feb 26, 2025

ashwinb commented Feb 26, 2025

wukaixingxp commented Feb 26, 2025

ashwinb Feb 27, 2025

fix: add ollama embedding config and fix sqlite_vec db #1255

Are you sure you want to change the base?

fix: add ollama embedding config and fix sqlite_vec db #1255

Conversation

wukaixingxp commented Feb 25, 2025

What does this PR do?

Test Plan

ashwinb left a comment

Choose a reason for hiding this comment

wukaixingxp commented Feb 25, 2025

wukaixingxp commented Feb 26, 2025

ashwinb Feb 26, 2025

Choose a reason for hiding this comment

ashwinb commented Feb 26, 2025

wukaixingxp commented Feb 26, 2025

ashwinb Feb 27, 2025

Choose a reason for hiding this comment