-
Notifications
You must be signed in to change notification settings - Fork 57
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug] Container fails to launch on AWS Fargate due to IO error #1033
Comments
Hi @jakeybrown92 thanks for trying out SOCI snapshotter! For your issue, I have tested and successfully launched a container using public.ecr.aws/sam/build-python3.8:latest as a base on SOCI v0.4.1. Are you able to reproduce the issue running your container in a non-Fargate environment and provide the SOCI logs to give some more insights to the error which is preventing your container from launching? Alternatively if you have an AWS Support plan you can file a technical support ticket with AWS Fargate which will enable us to investigate it with the service team. (Not a requirement by any means; just mentioning it because of the AWS Fargate reference) |
@austinvazquez im running on mac so was unable to get the soci snapshotter binaries working as expected so i have built a soci docker image that contains the binaries and pull this locally. i am then pulling the image from ecr and doing soci index create etc. Seems like there is an issue with the soci-snapshotter-gprc when running it locally on docker though, so unsure where i can look at logs. i am able to run the python images as containers stored in ECR into the soci container locally and also onto my mac locally and not sure the soci indexing would come into play here? I have raised a case with AWS in the mean time |
@jakeybrown92 , would like to touch base on this. I have reached out to the AWS Fargate service team and am working to get access to your service ticket so we can begin a root cause analysis. With respect to running SOCI on a Macbook, I am not familiar with Docker Desktop, but if you are looking for an alternative I have used Finch which allows developers to drop into the Linux VM. [Reference] Finch has SOCI integration out of the box. See Finch's Lazy Loading documentation. |
Hi @austinvazquez i did not know about finch. That looks really useful. Will use that for other things in the future! Okay so after running the vm and tailing the journalctl and running the below command i can see some errors in the logs I have also tested this works as expected on finch with another image that successfully works in fargate using soci index. (amazonlinux docker image) I can confirm this works as expected and the image gets lazy loaded from ECR. I also dont see any of the above errors in the logs |
Description
We are using docker buildx to build docker images for amd64 and arm64. We then have a script to create a soci index for both platforms and push to ECR.
This works fine for all of our other Docker images except python (same issue for all different python versions) We build on-top of the public aws sam build for python. The docker entrypoint is copied into file then executed(the same as our other working builds)
The ECS Fargate container fails to start. The task definition is running on x86. When checking the ECS logs, I can see the following error:-
exec /usr/local/bin/docker-entrypoint.sh: input/output error
as soon as we delete the soci index and image index from ECR for the particular image the issue is gone and the container gets pulled as expected. So we know there is not an issue with the image, its just when the soci index exists for the image
Steps to reproduce the bug
No response
Describe the results you expected
The container to lazy load the python images with no errors.
Host information
Any additional context or information about the bug
No response
The text was updated successfully, but these errors were encountered: