Skip to content

DAOS-6798 iv: stop the iv ns leader first during destroy #4886

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Mar 11, 2021

Conversation

kccain
Copy link
Contributor

@kccain kccain commented Mar 5, 2021

Cherry-pick of PR #4827 from daos master to daos release/1.2.

Let's stop the iv ns leader during the destory first, so
all arriving IV requests will fail and return immediately.

And with this change the mdtest_small test is re-enabled for PRs.

Signed-off-by: Di Wang di.wang@intel.com
Signed-off-by: Kenneth Cain kenneth.c.cain@intel.com

Copy link
Collaborator

@daosbuild1 daosbuild1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. No errors found by checkpatch.

Cherry-pick of PR #4827 from daos master to daos release/1.2.

Let's stop the iv ns leader during the destory first, so
all arriving IV requests will fail and return immediately.

And with this change the mdtest_small test is re-enabled for PRs.

Signed-off-by: Di Wang <di.wang@intel.com>
Signed-off-by: Kenneth Cain <kenneth.c.cain@intel.com>
@kccain kccain force-pushed the kccain/daos_6798_rel1p2 branch from 80eb3cb to 7842fbf Compare March 7, 2021 18:28
Copy link
Collaborator

@daosbuild1 daosbuild1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. No errors found by checkpatch.

@kccain
Copy link
Contributor Author

kccain commented Mar 7, 2021

The pool create test failures from build 1 are known to have been just recently fixed on release/1.2 branch. I have rebased this branch on latest release/1.2 and pushed for another round of CI testing to get rid of those issues.

@daosbuild1
Copy link
Collaborator

Test stage Scan CentOS 7 RPMs completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-4886/2/execution/node/1181/log

@kccain
Copy link
Contributor Author

kccain commented Mar 8, 2021

the other rebuild0-10 test failure in build 1 is due to an existing bug that requires its fix to be landed to release/1.2 branch

  1. https://build.hpdd.intel.com/job/daos-stack/job/daos/job/PR-4886/1/artifact/Functional_Hardware_Medium/daos_test/daos_core_test-rebuild.py/ : DAOS-6840

build 2 failures are due to existing bugs

  1. aggregation/aggregation_basic: DAOS-6790
  2. pool/create_capacity_test: DAOS-6784 (and more accurately DAOS-6976)

build 2 also showing a failure in Scan CentOS 7 RPMs that seems to be happening on other release/1.2 PRs e.g., PR 4589 and has been a recent issue on master.

@kccain
Copy link
Contributor Author

kccain commented Mar 8, 2021

I will wait for landing of PR #4881 to master that re-enables mdtest_small there, before requesting landing of this change to release/1.2 branch.

This PR being an unmodified cherry-pick of master PR 4827 + PR 4881 will not require code reviews on this release/1.2 PR. I will request landing from gatekeeper.

@kccain
Copy link
Contributor Author

kccain commented Mar 8, 2021

OK this is ready for review by @daos-stack/daos-gatekeeper .
The solution PR 4827 and re-enable mdtest_small PR #4881 have both landed to daos master. And, this PR has successfully tested including the re-enabled mdtest_small test.

@kccain kccain requested a review from a team March 8, 2021 21:44
@@ -4805,6 +4805,7 @@ ds_pool_evict_handler(crt_rpc_t *rpc)
D_GOTO(out_free, rc);

ds_pool_iv_srv_hdl_invalidate(svc->ps_pool);
ds_iv_ns_leader_stop(svc->ps_pool->sp_iv_ns);
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

should this be done as first operation before you proceed with tx update and hdl invalidation?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

oh, we want server handle be invalidated successfully during pool destroy, otherwise if iv_ns_leader_stop is being executed first, server handle invalidation will fail.

@kccain kccain requested a review from a team March 10, 2021 17:23
@kccain
Copy link
Contributor Author

kccain commented Mar 10, 2021

https://jira.hpdd.intel.com/browse/DAOS-6798 has just been marked approved to merge for release 1.2.
@daos-stack/daos-gatekeeper can you take a final look at it now that it is in this state?
Thanks!

@wangdi1 wangdi1 requested review from wangdi1 and frostedcmos March 10, 2021 18:50
@jolivier23 jolivier23 merged commit 34fe39b into release/1.2 Mar 11, 2021
@jolivier23 jolivier23 deleted the kccain/daos_6798_rel1p2 branch March 11, 2021 17:11
@ashleypittman ashleypittman mentioned this pull request Apr 28, 2021
@ashleypittman ashleypittman mentioned this pull request May 20, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

Successfully merging this pull request may close these issues.

5 participants