Add CI workflow that runs the full SDG pipeline #93
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR introduces a new e2e CI workflow that exercises the full SDG
pipeline. Some important details:
This job runs on a dynamic github runner spawned on AWS
The instance type has 4x NVIDIA A10G GPUs (96 GB total)
a 4-bit quantized version of Mixtral-8x7b-instruct is used
Training is currently skipped while we work out getting the new
training library functional in CI
The model is served via llama-cpp, as vllm is not yet functional in
CI
The job does not run automatically. It must be launched manually via
the GitHub UI. When you launch it against a given PR, it will
automatically add comments to that PR to make it easier to follow
the progress and results. For more info on launching this workflow
manually, see the instructions for similar workflows in the
instructlab/instructlab
repository:https://github.com/instructlab/instructlab/blob/main/docs/ci.md
Signed-off-by: Russell Bryant [email protected]