You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
After we added minio, our DB became persistent but our jobs in ray aren't. That means when we restart the containers we can see what's in the database but not what is in ray, causing some issues. We need to discuss how to approach this (should we have a volume for ray to make it persistent? should we make the api fault tolerant anyway?)
Alternatives
No response
Contribution
No response
Have you searched for similar issues before submitting this one?
Yes, I have searched for similar issues
The text was updated successfully, but these errors were encountered:
Ray seems to be considering at least persistence for job state and related information, e.g. ray-project/ray#39503, but apparently this is marked as "important, but not time critical". Embedded KV storages like RocksDB or SplinterDB could play a similar role to SQLite in this space.
A lightly loaded Redis should be a few MB more than 3MB in memory, according to Redis documentation: https://redis.io/docs/latest/develop/get-started/faq/#whats-the-redis-memory-footprint. The container will be configured properly to keep this memory under control. It may also be necessary to add documentation to instruct the user to clean up the memory volumes if a sizable amount of job state is stored. The lifetime of all named volumes will be tied together so that db and ray persistent data are always consistent.
Motivation
After we added minio, our DB became persistent but our jobs in ray aren't. That means when we restart the containers we can see what's in the database but not what is in ray, causing some issues. We need to discuss how to approach this (should we have a volume for ray to make it persistent? should we make the api fault tolerant anyway?)
Alternatives
No response
Contribution
No response
Have you searched for similar issues before submitting this one?
The text was updated successfully, but these errors were encountered: