Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: ORCA Format KV Cache Utilization in Inference Response Header #7839

Open
wants to merge 38 commits into
base: main
Choose a base branch
from
Open
Changes from 1 commit
Commits
Show all changes
38 commits
Select commit Hold shift + click to select a range
e6dc971
Add helper functions to pull metrics in HTTPAPIServer to pull metrics…
BenjaminBraunDev Dec 9, 2024
6d8d0ec
Add logging, examples and more detailed comments, and move feature fu…
BenjaminBraunDev Dec 13, 2024
cdff4c9
Merge branch 'main' of https://github.com/BenjaminBraunDev/server-for…
BenjaminBraunDev Jan 13, 2025
13c10fe
Merge branch 'main' into r24.10
BenjaminBraunDev Jan 13, 2025
d47a547
Merge branch 'main' into r24.10
BenjaminBraunDev Jan 16, 2025
dab2ebf
change ORCA envvar flag variable name to TRITON_ORCA_METRIC_FORMAT an…
BenjaminBraunDev Jan 21, 2025
f172906
Merge branch 'main' into r24.10
BenjaminBraunDev Jan 21, 2025
e789068
small whitespace fix in http_header.h
BenjaminBraunDev Jan 21, 2025
3e4430b
Merge branch 'r24.10' of https://github.com/BenjaminBraunDev/server-f…
BenjaminBraunDev Jan 21, 2025
8ad502c
Merge branch 'main' into r24.10
BenjaminBraunDev Jan 22, 2025
a339ca5
Remove unreachable return line in ORCA test.
BenjaminBraunDev Jan 23, 2025
397d03c
Merge branch 'r24.10' of https://github.com/BenjaminBraunDev/server-f…
BenjaminBraunDev Jan 23, 2025
7819881
Merge branch 'main' into r24.10
BenjaminBraunDev Jan 27, 2025
29f9a9f
Merge branch 'main' into r24.10
BenjaminBraunDev Jan 28, 2025
9fd1098
Merge branch 'main' into r24.10
BenjaminBraunDev Jan 29, 2025
44af8bc
Change ORCA metric response header logic to depend on request header …
BenjaminBraunDev Feb 4, 2025
e2ad896
Merge branch 'main' into r24.10
BenjaminBraunDev Feb 4, 2025
65d4531
Merge branch 'r24.10' of https://github.com/BenjaminBraunDev/server-f…
BenjaminBraunDev Feb 4, 2025
15d0ebe
Add macros in place of kv_cache block type strings.
BenjaminBraunDev Feb 6, 2025
eb79e76
Merge branch 'main' into r24.10
BenjaminBraunDev Feb 6, 2025
43fbbef
Merge branch 'main' into r24.10
BenjaminBraunDev Feb 6, 2025
908f183
Merge branch 'main' into r24.10
BenjaminBraunDev Feb 10, 2025
80d81dd
Remove unused imports and variables.
BenjaminBraunDev Feb 11, 2025
633bfbb
Merge branch 'main' into r24.10
BenjaminBraunDev Feb 11, 2025
ca53a2d
Rename endpoint-load-metrics-type to endpoint-load-metrics-format.
BenjaminBraunDev Feb 13, 2025
2fbad44
Merge branch 'main' into r24.10
BenjaminBraunDev Feb 14, 2025
e37df6f
Change the string for native http ORCA metric type from http to text …
BenjaminBraunDev Feb 14, 2025
a7d86e8
Merge branch 'main' into r24.10
BenjaminBraunDev Feb 18, 2025
6512b5e
Merge branch 'main' into r24.10
BenjaminBraunDev Feb 24, 2025
b699437
Add new files for orca metric refactor.
BenjaminBraunDev Feb 25, 2025
71655f2
Merge branch 'main' into r24.10
BenjaminBraunDev Feb 26, 2025
c16a17f
Complete orca metrics refactor and add orca_http.cc to CMakeLists.
BenjaminBraunDev Feb 26, 2025
44bc54b
Merge branch 'refactor' into r24.10
BenjaminBraunDev Feb 27, 2025
30c673d
Undo build script fix, this fix is part of a seperate PR.
BenjaminBraunDev Feb 27, 2025
1b08c56
Remove whitespace.
BenjaminBraunDev Feb 27, 2025
036ea35
Merge branch 'main' into r24.10
BenjaminBraunDev Feb 27, 2025
824cbc9
Merge branch 'main' into r24.10
BenjaminBraunDev Mar 5, 2025
0ab3915
Merge branch 'main' into r24.10
BenjaminBraunDev Mar 6, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
Merge branch 'main' into r24.10
  • Loading branch information
BenjaminBraunDev authored Jan 22, 2025
commit 8ad502cc06ada45ad8089340b39a7d06350a43f1

This merge commit was added into this branch cleanly.

There are no new changes to show, but you can still view the diff.