Add XL e2e nightly CI job #5
e2e-nvidia-l40s-x8.yml
on: push
Annotations
3 errors and 4 warnings
e2e-xlarge-test
System.IO.IOException: No space left on device : '/actions-runner/_diag/pages/55cdac43-8d15-4164-b29a-04457d1e8b1a_ae6e50a3-07a5-47f0-b470-dc27250c13a5_2.log'
at Interop.ThrowExceptionForIoErrno(ErrorInfo errorInfo, String path, Boolean isDirError)
at Microsoft.Win32.SafeHandles.SafeFileHandle.Open(String fullPath, FileMode mode, FileAccess access, FileShare share, FileOptions options, Int64 preallocationSize, UnixFileMode openPermissions, Int64& fileLength, UnixFileMode& filePermissions, Boolean failForSymlink, Boolean& wasSymlink, Func`4 createOpenException)
at System.IO.Strategies.OSFileStreamStrategy..ctor(String path, FileMode mode, FileAccess access, FileShare share, FileOptions options, Int64 preallocationSize, Nullable`1 unixCreateMode)
at System.IO.FileStream..ctor(String path, FileMode mode, FileAccess access, FileShare share)
at GitHub.Runner.Common.PagingLogger.NewPage()
at GitHub.Runner.Worker.ExecutionContext.Write(String tag, String message)
at GitHub.Runner.Worker.ExecutionContextExtension.Output(IExecutionContext context, String message)
at GitHub.Runner.Worker.JobExtension.FinalizeJob(IExecutionContext jobContext, AgentJobRequestMessage message, DateTime jobStartTimeUtc)
at GitHub.Runner.Worker.JobRunner.RunAsync(AgentJobRequestMessage message, CancellationToken jobRequestCancellationToken)
at GitHub.Runner.Worker.JobRunner.RunAsync(AgentJobRequestMessage message, CancellationToken jobRequestCancellationToken)
at GitHub.Runner.Worker.Worker.RunAsync(String pipeIn, String pipeOut)
at GitHub.Runner.Worker.Program.MainAsync(IHostContext context, String[] args)
|
e2e-xlarge-test
No space left on device : '/actions-runner/_diag/pages/55cdac43-8d15-4164-b29a-04457d1e8b1a_8e6af478-3897-5c87-c601-661c392949aa_1.log'
|
loss-graphs
Unable to download artifact(s): Artifact not found for name: phase-1-training-log-xlarge.jsonl
Please ensure that your artifact is not expired and the artifact was uploaded using a compatible version of toolkit/upload-artifact.
For more information, visit the GitHub Artifacts FAQ: https://github.com/actions/toolkit/blob/main/packages/artifact/docs/faq.md
|
start-xlarge-ec2-runner
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
|
e2e-xlarge-test
You are running out of disk space. The runner will stop working when the machine runs out of disk space. Free space left: 8 MB
|
stop-xlarge-ec2-runner
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
|
loss-graphs
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
|