-
Notifications
You must be signed in to change notification settings - Fork 185
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DFBUGS-1656: [release-4.18] core: add back the CSI_DISABLE_HOLDER_PODS key on the cm #3050
DFBUGS-1656: [release-4.18] core: add back the CSI_DISABLE_HOLDER_PODS key on the cm #3050
Conversation
@parth-gr: This pull request references [Jira Issue DFBUGS-1656](https://issues.redhat.com//browse/DFBUGS-1656), which is valid. The bug has been moved to the POST state. 3 validation(s) were run on this bug
No GitHub users were found matching the public email listed for the QA contact in Jira ([email protected]), skipping review request. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
If the OCS operator is upgraded before the rook operator, It will go and remove the CSI_DISABLE_HOLDER_PODS key from the ocs-configmap, Removing the key from the ocs-cm will re-start the rook operator pods, As Rook has not been upgraded and is still on 4.17, it looks for the env variable value which was coming from ocs-configmap key, And as the key is deleted rook operator got stuck. Revert of the pr red-hat-storage#2609 And Revert the fix in 4.19. Signed-off-by: parth-gr <[email protected]> (cherry picked from commit 843bd4c) Signed-off-by: parth-gr <[email protected]>
9de9a36
to
cabc5af
Compare
@parth-gr: This pull request references [Jira Issue DFBUGS-1656](https://issues.redhat.com//browse/DFBUGS-1656), which is valid. 3 validation(s) were run on this bug
No GitHub users were found matching the public email listed for the QA contact in Jira ([email protected]), skipping review request. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/lgtm
/cc @subhamkrai |
@parth-gr |
/hold |
@parth-gr changes looks good to me but please test with upgrade case. Thanks |
No as in 4.18 rook CSV doesn’t reference to this Key after upgrade, so it just stays in the ocs-operator-config CM as a placeholder with no suage after upgrade to 4.18 is complete. |
@agarwal-mudit the holder pod is already deprecated rook/rook#14819 that's why this key was removed, For testing:
We cant control the OLM to upgrade the ocs operator first, so did 2 upgrades with @malayparida2000 but in those cases rook operator only upgraded first |
We have tested the upgrade scenarior by replacing our fix image in a affected cluster. |
Yeah it's very difficult to reproduce this issue we had 2 extra clusters and in both rook operator upgraded first so we couldn't hit the issue. We did test our image in the affected cluster we had from QE & that worked there. |
Upgrade testing(mirrored the scenario where upgrade was failing):
|
[APPROVALNOTIFIER] This PR is APPROVED Approval requirements bypassed by manually added approval. This pull-request has been approved by: malayparida2000, parth-gr The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
1f99ab6
into
red-hat-storage:release-4.18
@parth-gr: [Jira Issue DFBUGS-1656](https://issues.redhat.com//browse/DFBUGS-1656): All pull requests linked via external trackers have merged: [Jira Issue DFBUGS-1656](https://issues.redhat.com//browse/DFBUGS-1656) has been moved to the MODIFIED state. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
If the OCS operator is upgraded before the rook operator, It will go and remove the CSI_DISABLE_HOLDER_PODS key from the ocs-configmap, Removing the key from the cm will re-start the rook operator pods,
As Rook has not been upgraded and is still on 4.17, it looks for the env variable value which was coming from ocs-configmap key, And as the key is deleted rook operator got stuck.
Revert of the pr #2609
And Revert the fix in 4.19.
Signed-off-by: parth-gr [email protected]
(cherry picked from commit 843bd4c)