You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm curious if there are any plans to extend the current framework to handle concurrent requests over an endpoint like llama-server, enabling production-ready serving like vllm. Any plans for this?
The text was updated successfully, but these errors were encountered:
I'm curious if there are any plans to extend the current framework to handle concurrent requests over an endpoint like llama-server, enabling production-ready serving like vllm. Any plans for this?
The text was updated successfully, but these errors were encountered: