-
Notifications
You must be signed in to change notification settings - Fork 80
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
AWX Tasks hanging on ansible.utils.cli_parse >= 5.1.0 #382
Comments
Thank you so much for posting this, I though I was losing my mind. I am experiencing the same behavior with cli_parse when I try to run it on a target that is down. The task properly times out with I did what you suggested and downgraded the collection and had no problems. It seems to work fine if the list of hosts is small (<10) but if its >20 or so, 5.1.2 seems to choke. It doesn't even need many unreachable hosts to choke, just a single host in the batch that is down and it will hang. |
5.0.0 doesn't seem to work at all, it doesn't seem to actually make a ssh connection to the target. I think that is a bug in 5.0.0 outside of this issue. |
Here is some more detailed testing:
ansible.utils:
3.1.0 - Tested with 8dc11de reverted, works as expected Troubling thing is I can't identify a change from 4.0.0 to 4.1.0 is responsible for this changed behavior... |
I've tested on ansible-core 2.18.1 and no change But I did narrow it down a little more;
If I comment this line out, it doesn't hang on any version. |
hi @bewing @JCTechSol, To investigate further and pinpoint the root cause, I would need the following additional details: Debugging Logs: Simplified Test Case: |
SUMMARY
AWX Tasks running ansible.utils.cli_parse are hanging
ISSUE TYPE
COMPONENT NAME
ansible.utils.cli_parse
ANSIBLE VERSION
COLLECTION VERSION
CONFIGURATION
OS / ENVIRONMENT
awx-operator 2.19.1
AWX 24.6.1
STEPS TO REPRODUCE
We have several extremely large AWX jobs that collect information from the CLI and push new configs to switch devices. After upgrading from ansible.utils 5.0.0 to 5.1.0 and 5.1.2, jobs occasionally hang indefinitely, not producing anymore job output. The last log output of every hung job contains
"task_action": "ansible.utils.cli_parse"
Producing the same AWX EE image, but with ansible.utils==5.0.0, no jobs hang.
I will see if I can produce a publishable test case, but it may take some time. I did want to get this recorded in case any other operators are seeing similar issues.
The text was updated successfully, but these errors were encountered: