fix: Fixing tool prompt format #196

dmartinol · 2025-02-28T12:54:15Z

What does this PR do?

Closes #195

Feature/Issue validation/testing/test plan

Start llama-stack server, then:

 python -m examples.agents.rag_with_vector_db localhost 8321
 python -m examples.agents.rag_as_attachments localhost 8321

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Thanks for contributing 🎉!

Signed-off-by: Daniele Martinoli <[email protected]>

ehhuang · 2025-02-28T18:42:43Z

examples/agents/rag_as_attachments.py

@@ -58,7 +58,7 @@ def run_main(host: str, port: int, disable_safety: bool = False):
        },
        toolgroups=["builtin::rag"],
        tool_choice="auto",
-        tool_prompt_format="json",
+        tool_prompt_format="python_list",


You should be able to just remove this since we added meta-llama/llama-stack#1214

I see, thank you! I will update it asap
BTW: is there any way to be informed of the ongoing/planned changes before pushing PRs that are already out-of-date? 😬

I don't know if there's a better way but I just look at open PRs periodically and check github notifications.

BTW I do plan to do a full clean up of tool_prompt_format as well.

BTW: I tried to remove the prompt format but resolve_model fails to resolve the model so the default prompt is still json 😞

These are the models I'm using:

% llama-stack-client models list Available Models ┏━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━┓ ┃ model_type ┃ identifier ┃ provider_resource_id ┃ metadata ┃ provider_id ┃ ┡━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━┩ │ embedding │ all-MiniLM-L6-v2 │ all-minilm:latest │ {'embedding_dimension': │ ollama │ │ │ │ │ 384.0} │ │ ├────────────┼──────────────────┼──────────────────────┼──────────────────────────┼─────────────┤ │ llm │ llama3.2:1b │ llama3.2:1b │ │ ollama │ └────────────┴──────────────────┴──────────────────────┴──────────────────────────┴─────────────┘ Total models: 2

Changing the check to compare lower-cased strings seems to work, but I'm not sure it may break something else:

for m in all_registered_models(): if descriptor.lower in (m.descriptor().lower(), m.huggingface_repo): return m return None

ok let's get this in first

@dmartinol where are you getting llama3.2:1b? It seems to be in the wrong format

ollama run llama3.2:1b, is that wrong?

anyhow meta-llama/llama-stack#1360 should fix this

I will try again and clear the setting.

fixing tool prompt format

9851449

Signed-off-by: Daniele Martinoli <[email protected]>

dmartinol requested review from ashwinb, yanxi0830, hardikjshah, dltn, raghotham, dineshyv and ehhuang as code owners February 28, 2025 12:54

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Feb 28, 2025

dmartinol mentioned this pull request Feb 28, 2025

fix: Agent uses the first configured vector_db_id when documents are provided meta-llama/llama-stack#1276

Open

ehhuang reviewed Feb 28, 2025

View reviewed changes

ehhuang approved these changes Mar 1, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Fixing tool prompt format #196

fix: Fixing tool prompt format #196

dmartinol commented Feb 28, 2025

ehhuang Feb 28, 2025

dmartinol Feb 28, 2025

ehhuang Mar 1, 2025

dmartinol Mar 1, 2025

ehhuang Mar 1, 2025

ehhuang Mar 3, 2025

ehhuang Mar 3, 2025

dmartinol Mar 3, 2025

ehhuang Mar 3, 2025

dmartinol Mar 4, 2025

fix: Fixing tool prompt format #196

Are you sure you want to change the base?

fix: Fixing tool prompt format #196

Conversation

dmartinol commented Feb 28, 2025

What does this PR do?

Feature/Issue validation/testing/test plan

Before submitting

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment