fix(ai): response stream chunking de-buffering #14079

oowl · 2025-01-03T09:18:26Z

Summary

Fixes an issue where "AI streaming responses" were returning all inside a single chunk, instead of being returned chunk-by-chunk to the client.

Also fixes other parsing issues with Bedrock, where the wrong response content-type was used.

Checklist

The Pull Request has tests
A changelog file has been created under changelog/unreleased/kong or skip-changelog label added on PR if changelog is unnecessary. README.md
There is a user-facing docs PR against https://github.com/Kong/docs.konghq.com - bugfix, no new docs

Issue reference

FTI-6419

tysoekong

Yep approved as discussed, I've tested all dependent AI services manually and the automation will handle the edge cases.

kong/llm/plugin/base.lua

kong/llm/plugin/shared-filters/normalize-response-header.lua

kong/plugins/ai-proxy/handler.lua

kong/llm/plugin/shared-filters/normalize-response-header.lua

oowl added 2 commits January 3, 2025 16:03

fix(ai): response stream chunking de-buffering

b6723e6

fix(ai): fix code

4034d6f

pull-request-size bot added the size/M label Jan 3, 2025

github-actions bot added cherry-pick kong-ee schedule this PR for cherry-picking to kong/kong-ee plugins/ai-proxy plugins/ai-request-transformer plugins/ai-response-transformer labels Jan 3, 2025

github-actions bot assigned oowl Jan 3, 2025

tysoekong added the backport release/3.9.x label Jan 3, 2025

tysoekong self-requested a review January 3, 2025 16:36

tysoekong approved these changes Jan 3, 2025

View reviewed changes

tysoekong requested a review from fffonion January 3, 2025 17:26

fffonion reviewed Jan 8, 2025

View reviewed changes

kong/llm/plugin/base.lua Outdated Show resolved Hide resolved

oowl added 2 commits January 9, 2025 15:23

fix(ai): fix code

d2bbb14

fix(ai): fix code

63c8680

fffonion reviewed Jan 9, 2025

View reviewed changes

kong/llm/plugin/base.lua Outdated Show resolved Hide resolved

kong/llm/plugin/shared-filters/normalize-response-header.lua Outdated Show resolved Hide resolved

oowl added 4 commits January 9, 2025 15:40

fix(ai): fix code

7d50419

fix(ai): fix code

619db9b

fix(ai): fix code

1922b61

fix(ai): fix code

1b8aa11

fffonion reviewed Jan 9, 2025

View reviewed changes

kong/plugins/ai-proxy/handler.lua Outdated Show resolved Hide resolved

fffonion reviewed Jan 9, 2025

View reviewed changes

kong/llm/plugin/shared-filters/normalize-response-header.lua Outdated Show resolved Hide resolved

oowl added 2 commits January 9, 2025 16:28

fix(ai): fix code

ad7dcf1

fix(ai): fix code

d4e71af

fffonion reviewed Jan 9, 2025

View reviewed changes

kong/llm/plugin/shared-filters/normalize-response-header.lua Outdated Show resolved Hide resolved

fffonion approved these changes Jan 9, 2025

View reviewed changes

fix(ai): fix code

275520b

fffonion merged commit b7f5ed2 into master Jan 9, 2025
25 checks passed

fffonion deleted the fix/ai_streaming_chunking branch January 9, 2025 09:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(ai): response stream chunking de-buffering #14079

fix(ai): response stream chunking de-buffering #14079

oowl commented Jan 3, 2025 •

edited by tysoekong

Loading

tysoekong left a comment

fix(ai): response stream chunking de-buffering #14079

fix(ai): response stream chunking de-buffering #14079

Conversation

oowl commented Jan 3, 2025 • edited by tysoekong Loading

Summary

Checklist

Issue reference

tysoekong left a comment

Choose a reason for hiding this comment

oowl commented Jan 3, 2025 •

edited by tysoekong

Loading