Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

e2e tests occasionally flake with out of disk space #442

Open
bbrowning opened this issue Dec 10, 2024 · 2 comments
Open

e2e tests occasionally flake with out of disk space #442

bbrowning opened this issue Dec 10, 2024 · 2 comments
Labels
bug Something isn't working CI/CD Affects CI/CD configuration

Comments

@bbrowning
Copy link
Contributor

For whatever reason, we're occasionally running out of disk space in our e2e tests, on a node with 700GB+ free before we start the e2e test. See https://github.com/instructlab/sdg/actions/runs/12261506100/job/34208780366 for an example here.

@nathan-weinberg nathan-weinberg added the CI/CD Affects CI/CD configuration label Dec 18, 2024
@nathan-weinberg
Copy link
Member

This is something @courtneypacheco has been looking into

@nathan-weinberg nathan-weinberg added the bug Something isn't working label Dec 18, 2024
@courtneypacheco
Copy link
Contributor

For reference, this is why we run out of space:

instructlab/instructlab#2417 (comment)

tl;dr - it has to do with tmpfs defaults not allowing /tmp to consume more than 50% of the total physical memory we allocate via EBS. The easiest solutions would either be to change the default save data location for the data, or increase the disk space.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working CI/CD Affects CI/CD configuration
Projects
None yet
Development

No branches or pull requests

3 participants