Don't attempt batching with InstructLab's llama-cpp-python #358

bbrowning · 2024-11-10T01:59:13Z

Try harder to check for InstructLab's default llama-cpp-python during our batching check so that we can avoid throwing assertion errors on the server when we probe for batching support.

See instructlab/instructlab#1748 for user reports of this issue.

Try harder to check for InstructLab's default llama-cpp-python during our batching check so that we can avoid throwing assertion errors on the server when we probe for batching support. See instructlab/instructlab#1748 for user reports of this issue. Signed-off-by: Ben Browning <[email protected]>

bbrowning · 2024-11-10T02:11:36Z

There doesn't appear to be a robust way to identify llama-cpp-python as a HTTP client. However, InstructLab's most common happy path has users generating data against a llama-cpp-python managed by InstructLab and we CAN reliably identify that. We'll do that by looking for the API root welcome message and disable batching without actually attempting the batching request probe. That batching request attempt is what throws the assertion error in llama-cpp-python, and this change will avoid throwing those errors for anyone using ilab model serve to run their model.

bbrowning · 2024-11-10T14:35:39Z

We get lots of user reports via Slack and in this and other repos about the assertion error with llama-cpp-python. This gets rid of that, and will make for a much cleaner initial user experience. Otherwise, we often have users that think things are broken with data generation.

aakankshaduggal

Thanks @bbrowning, lgtm!

mergify bot added testing Relates to testing ci-failure labels Nov 10, 2024

mergify bot added ci-failure and removed ci-failure labels Nov 10, 2024

bbrowning requested a review from a team November 10, 2024 02:20

bbrowning mentioned this pull request Nov 10, 2024

AssertionErrors after starting the SDG #360

Closed

mergify bot removed the ci-failure label Nov 10, 2024

khaledsulayman approved these changes Nov 11, 2024

View reviewed changes

mergify bot added the one-approval label Nov 11, 2024

khaledsulayman mentioned this pull request Nov 11, 2024

llama-cpp multi server support #316

Open

aakankshaduggal approved these changes Nov 11, 2024

View reviewed changes

mergify bot merged commit 119d309 into instructlab:main Nov 11, 2024
22 checks passed

mergify bot removed the one-approval label Nov 11, 2024

bbrowning deleted the no-llama-assertion-error branch November 11, 2024 15:58

bbrowning mentioned this pull request Dec 9, 2024

AssertionError Exception: from await original_route_handler(request) instructlab/instructlab#2333

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't attempt batching with InstructLab's llama-cpp-python #358

Don't attempt batching with InstructLab's llama-cpp-python #358

bbrowning commented Nov 10, 2024

bbrowning commented Nov 10, 2024

bbrowning commented Nov 10, 2024

aakankshaduggal left a comment

Don't attempt batching with InstructLab's llama-cpp-python #358

Don't attempt batching with InstructLab's llama-cpp-python #358

Conversation

bbrowning commented Nov 10, 2024

bbrowning commented Nov 10, 2024

bbrowning commented Nov 10, 2024

aakankshaduggal left a comment

Choose a reason for hiding this comment