You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Can you support better concurrent requests like this? https://github.com/ollama/ollama/releases/tag/v0.2.0
It would be nice to be able to spread out concurrent requests amongst the RAM/GPU to maintain a good speed, and, to be able to have 30 concurrent requests with a slower speed rather than crashing...
The text was updated successfully, but these errors were encountered:
Can you support better concurrent requests like this?
https://github.com/ollama/ollama/releases/tag/v0.2.0
It would be nice to be able to spread out concurrent requests amongst the RAM/GPU to maintain a good speed, and, to be able to have 30 concurrent requests with a slower speed rather than crashing...
The text was updated successfully, but these errors were encountered: