Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Job Variable_List dropped if qsub does not submit on first try #413

Open
brewlius-cesar opened this issue Feb 21, 2017 · 1 comment
Open

Comments

@brewlius-cesar
Copy link

A job's "Variable_List" attribute appears to get dropped if an attempt at job submission does not go through on the first try. This is likely to not be isolated to qsub, but that is the only way I have attempted to demonstrate it.

qsub makes multiple attempts at job submission in cmds/qsub_functions.c main_func(). I tripped upon this particular issue when submitting a job with a dependency that sent back a PBSE_BADDEPEND on first try but then succeeded once alternate_dependency was substituted in.

My amateur digging has led me to src/lib/Libifl/env_attropl_hash.c build_var_list(), where I can see that ATTR_v is deleted from the hash. It's not 100% clear why this is done, other than I noticed that the ATTR_v is sent last in encode_DIS_attropl_hash() and dropping ATTR_v would prevent it from being sent twice. If that is the case a simple if check in enc_DIS_attropl_hash_single() might be better than deleting from the hash table and losing state in between submission attempts.

@brewlius-cesar
Copy link
Author

brewlius-cesar commented Feb 21, 2017

I can provide a patch, would probably be easier than trying to understand my wording...

brewlius-cesar/torque@ 9197e6f

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant