add al2keplergpu build recipe to build gpu amis for kepler arch #153
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Summary
With the release of
20230912
AMI, default Nvidia and Cuda drivers were updated in AL repos. These further later added support for P5 instances but lost support for P2 instances (Kepler architecture), for which Nvidia 470 is the latest supported driver.This PR adds a new
al2keplergpu
build recipe to support building AL2 ECS Optimized GPU AMIs with this package for instances with Kepler arch GPUs (such as P2 type). This recipe is NOT a new supported ECS platform, no new AMI types are published for this type. Rather it is a build recipe for anyone to build their own ECS AMIs for Kepler instances with GPUs on Kepler architecture.This will enable customers to create their own GPU supported AMIs for Kepler based GPU instances (such as P2.xlarge) by
release.auto.pkrvars.hcl
oroverrides.auto.pkrvars.hcl
REGION=<region> make al2keplergpu
Implementation details
al2keplergpu
and new makefile targetscripts/enable-ecs-agent-gpu-support.sh
to pin package versions to supported versions for Kepler type GPUs. Also, excludenvidia
andcuda
packages in the built AMI from yum updates to keep Kepler support for the built ami while maintaining regular updates.Testing
Make amis
REGION=us-west-2 make al2keplergpu
andREGION=us-west-2 make al2gpu
al2keplergpu
ami, run functional test suite on p2.xlarge instance, verify tests succeed.470.182.03
and Cuda11.4
is installed.ecs.capability.gpu-driver-version:470.182.03
yum update -y
does not update Nvidia/Cuda drivers./etc/yum.conf
has the exclusionexclude=*nvidia* *cuda*
al2gpu
ami, run functional test suite on g4dn.xlarge instance, verify tests succeed.535.54.03
and Cuda12.2
is installed.ecs.capability.gpu-driver-version:535.54.03
New tests cover the changes:
No new tests
Description for the changelog
add al2keplergpu build recipe to build gpu amis for kepler arch
Licensing
By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.