-
Notifications
You must be signed in to change notification settings - Fork 309
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Backports to address b/336317519 #14561
Conversation
DAOS-14679 pool: Report on stopping sp_stopping (#14374) DAOS-15145 pool: add pool collective function (#13764) *DAOS-14105 object: collectively punch object (#13493) * partial backport, just the bitmap function Required-githooks: true Change-Id: I2b21b8121cbdecc79ae49a464a42b1d47fb9be10 Signed-off-by: Jeff Olivier <[email protected]>
Bug-tracker data: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. No errors found by checkpatch.
Tested with reintegrate loop, seems to be working still. This is the other patch of two suggested on DAOS-15386 |
@liw this is a bit complicated and required some manual work. If you have cycles and could take a look, I'd appreciate it. I was mainly trying to get your patch but the two other patches (or parts of them) seemed useful. |
src/object/srv_obj_migrate.c
Outdated
|
||
rc = ds_pool_thread_collective(pool->sp_uuid, | ||
PO_COMP_ST_NEW | PO_COMP_ST_DOWN | PO_COMP_ST_DOWNOUT, | ||
migrate_fini_one_ult, &arg, 0); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
[Request for changes] The "DAOS-15145 pool: add pool collective function #13764" commit is buggy, and requires at least (I can't find any other myself) the following fixes:
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It was recommended by @NiuYawei as something that might help with https://daosio.atlassian.net/browse/DAOS-15386
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
sorry, this was meant as a reply to the other comment.
Just curious: Why do you want my humble patch? :) It doesn't seem that critical. |
See the Jira ticket for the long story. When releasing resources, for instance, for stopping a pool, we should attempt on all targets, because the latest version of the pool map may differ from the version used to acquire the resources. (And in some cases, we no longer have the original pool map version in the context.) Required-githooks: true Change-Id: I363fbfefae01dd75cc07ded31a2e5b7033f01610 Signed-off-by: Li Wei <[email protected]>
The container might be opened when the target is up, but changed to down when closing. We need to attempt to close down/downout targets regardless; it won't take any action if it was not opened before. Failure to properly close it will result in container destruction failing with EBUSY. (See DAOS-15514) Required-githooks: true Change-Id: I82e7c507d5f165057d562455e31d7b02d19e1815 Signed-off-by: Wang Shilong <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. No errors found by checkpatch.
Bottom line
I replied to the wrong comment but ultimately, I'm just trying to take anything that might help stabilize our reintegration workflow |
Change-Id: Ib1b044f566d0aaf21e5faa40fb979448b503d7c8
Required-githooks: true Change-Id: I7a4f33eec601e33ca7d23b135b03ed1c36c3e4e4 Signed-off-by: Jeff Olivier <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. No errors found by checkpatch.
DAOS-14679 pool: Report on stopping sp_stopping (#14374)
DAOS-15514 container: fix container destroy failure (#14108)
DAOS-15672 rebuild: Fix pool destroy hangs (#14183)
DAOS-15145 pool: add pool collective function (#13764)
*DAOS-14105 object: collectively punch object (#13493)
Required-githooks: true
Change-Id: I2b21b8121cbdecc79ae49a464a42b1d47fb9be10
Before requesting gatekeeper:
Features:
(orTest-tag*
) commit pragma was used or there is a reason documented that there are no appropriate tags for this PR.Gatekeeper: