Skip to content

TMP: Pause kubevirt CSI e2e for OCPBUGS-77929 testing#80869

Open
jiezhao16 wants to merge 1 commit into
openshift:mainfrom
jiezhao16:pause-kubevirt-e2e
Open

TMP: Pause kubevirt CSI e2e for OCPBUGS-77929 testing#80869
jiezhao16 wants to merge 1 commit into
openshift:mainfrom
jiezhao16:pause-kubevirt-e2e

Conversation

@jiezhao16

@jiezhao16 jiezhao16 commented Jun 22, 2026

Copy link
Copy Markdown
Contributor

Summary

  • Adds a wait step to the hypershift-kubevirt-csi-e2e workflow post phase to pause the cluster for OCPBUGS-77929 debugging

Test plan

  • Verify the workflow pauses as expected during post steps
  • Use the paused cluster to investigate OCPBUGS-77929

Summary by CodeRabbit

This PR adds a temporary debugging step to the hypershift-kubevirt-csi-e2e workflow in the OpenShift CI infrastructure.

Change: A wait step has been inserted at the beginning of the workflow's post phase, before the existing cleanup and teardown chains. This will cause the workflow to pause for up to 3 hours (configurable via the TIMEOUT environment variable) after the CSI e2e tests complete.

Purpose: This is a temporary measure (indicated by the "TMP" prefix) to facilitate debugging of OCPBUGS-77929. The pause allows developers to investigate cluster state and gather additional diagnostic information using the running cluster before it is torn down.

Affected Component: The change only impacts the CI workflow configuration for the HyperShift KubeVirt CSI e2e test job (ci-operator/step-registry/hypershift/kubevirt/csi-e2e/hypershift-kubevirt-csi-e2e-workflow.yaml). The actual test logic and cluster setup/teardown remain unchanged.

@coderabbitai

coderabbitai Bot commented Jun 22, 2026

Copy link
Copy Markdown
Contributor

Walkthrough

A single wait step is inserted into the post phase of the hypershift-kubevirt-csi-e2e workflow, placed before the existing hypershift-dump, gather-core-dump, hypershift-kubevirt-destroy, and ipi-aws-post chains.

Changes

hypershift-kubevirt-csi-e2e Workflow Post Phase

Layer / File(s) Summary
Insert wait step into post cleanup sequence
ci-operator/step-registry/hypershift/kubevirt/csi-e2e/hypershift-kubevirt-csi-e2e-workflow.yaml
Adds a wait reference as the first entry in the post step list, ahead of the existing dump, destroy, and post chains.

Estimated code review effort

🎯 1 (Trivial) | ⏱️ ~2 minutes

🚥 Pre-merge checks | ✅ 15
✅ Passed checks (15 passed)
Check name Status Explanation
Description Check ✅ Passed Check skipped - CodeRabbit’s high-level summary is enabled.
Title check ✅ Passed The title clearly and specifically describes the main change: adding a pause step to the kubevirt CSI e2e workflow for debugging purposes related to OCPBUGS-77929.
Docstring Coverage ✅ Passed No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.
Linked Issues check ✅ Passed Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check ✅ Passed Check skipped because no linked issues were found for this pull request.
Stable And Deterministic Test Names ✅ Passed This PR modifies only a CI/CD workflow YAML configuration file (hypershift-kubevirt-csi-e2e-workflow.yaml), not Ginkgo test code. The custom check for stable and deterministic Ginkgo test names is...
Test Structure And Quality ✅ Passed Custom check is not applicable - PR modifies only workflow YAML configuration, not Ginkgo test code.
Microshift Test Compatibility ✅ Passed No new Ginkgo e2e tests are added in this PR. The change only modifies a CI workflow YAML configuration file by adding a wait step, not test code.
Single Node Openshift (Sno) Test Compatibility ✅ Passed The PR modifies only a CI workflow YAML file, not Ginkgo e2e test code. The SNO compatibility check applies only when new Ginkgo tests are added; this PR contains no test code.
Topology-Aware Scheduling Compatibility ✅ Passed Change is a CI workflow file adding a test utility step, not a deployment manifest, operator code, or controller. The check applies only to scheduling constraints in deployable components, which th...
Ote Binary Stdout Contract ✅ Passed This PR only modifies a CI workflow YAML file, adding a wait step reference. No Go test code or OTE binary implementations are modified, so the OTE Binary Stdout Contract check does not apply.
Ipv6 And Disconnected Network Test Compatibility ✅ Passed This PR only modifies a CI workflow YAML file (adding a wait step), not Ginkgo e2e test code. The check applies only to new tests; no test code changes are present.
No-Weak-Crypto ✅ Passed PR adds only a workflow YAML reference with no cryptographic code, weak crypto usage, or insecure secret comparisons.
Container-Privileges ✅ Passed No privileged container settings, hostPID, hostNetwork, hostIPC, SYS_ADMIN capabilities, root access, or allowPrivilegeEscalation found in the modified workflow file or referenced wait step.
No-Sensitive-Data-In-Logs ✅ Passed The PR adds a wait step for debugging. No sensitive data logging found: wait step only logs public PR numbers, duration strings, and generic debug messages—no passwords, tokens, keys, PII, session...

✏️ Tip: You can configure your own custom pre-merge checks in the settings.

✨ Finishing Touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

@openshift-ci

openshift-ci Bot commented Jun 22, 2026

Copy link
Copy Markdown
Contributor

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: jiezhao16
Once this PR has been reviewed and has the lgtm label, please assign nirarg for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-ci openshift-ci Bot requested review from orenc1 and qinqon June 22, 2026 19:26
@openshift-merge-bot

Copy link
Copy Markdown
Contributor

[REHEARSALNOTIFIER]
@jiezhao16: the pj-rehearse plugin accommodates running rehearsal tests for the changes in this PR. Expand 'Interacting with pj-rehearse' for usage details. The following rehearsable tests have been affected by this change:

Test name Repo Type Reason
pull-ci-openshift-kubevirt-csi-driver-main-e2e-kubevirt-aws-csi openshift/kubevirt-csi-driver presubmit Registry content changed
pull-ci-openshift-kubevirt-csi-driver-release-5.1-e2e-kubevirt-aws-csi openshift/kubevirt-csi-driver presubmit Registry content changed
pull-ci-openshift-kubevirt-csi-driver-release-5.0-e2e-kubevirt-aws-csi openshift/kubevirt-csi-driver presubmit Registry content changed
pull-ci-openshift-kubevirt-csi-driver-release-4.23-e2e-kubevirt-aws-csi openshift/kubevirt-csi-driver presubmit Registry content changed
pull-ci-openshift-kubevirt-csi-driver-release-4.22-e2e-kubevirt-aws-csi openshift/kubevirt-csi-driver presubmit Registry content changed
pull-ci-openshift-kubevirt-csi-driver-release-4.21-e2e-kubevirt-aws-csi openshift/kubevirt-csi-driver presubmit Registry content changed
pull-ci-openshift-kubevirt-csi-driver-release-4.20-e2e-kubevirt-aws-csi openshift/kubevirt-csi-driver presubmit Registry content changed
pull-ci-openshift-kubevirt-csi-driver-release-4.19-e2e-kubevirt-aws-csi openshift/kubevirt-csi-driver presubmit Registry content changed
pull-ci-openshift-kubevirt-csi-driver-release-4.18-e2e-kubevirt-aws-csi openshift/kubevirt-csi-driver presubmit Registry content changed
pull-ci-openshift-kubevirt-csi-driver-release-4.17-e2e-kubevirt-aws-csi openshift/kubevirt-csi-driver presubmit Registry content changed
pull-ci-openshift-kubevirt-csi-driver-release-4.16-e2e-kubevirt-aws-csi openshift/kubevirt-csi-driver presubmit Registry content changed
pull-ci-openshift-kubevirt-csi-driver-release-4.15-e2e-kubevirt-aws-csi openshift/kubevirt-csi-driver presubmit Registry content changed
pull-ci-openshift-kubevirt-csi-driver-release-4.14-e2e-kubevirt-aws-csi openshift/kubevirt-csi-driver presubmit Registry content changed
periodic-ci-openshift-hypershift-release-4.20-periodics-e2e-kubevirt-aws-ovn-csi N/A periodic Registry content changed
periodic-ci-openshift-hypershift-release-4.16-periodics-e2e-kubevirt-aws-ovn-csi N/A periodic Registry content changed
periodic-ci-openshift-hypershift-release-4.15-periodics-e2e-kubevirt-aws-ovn-csi N/A periodic Registry content changed
periodic-ci-openshift-hypershift-release-4.23-periodics-e2e-kubevirt-aws-ovn-csi N/A periodic Registry content changed
periodic-ci-openshift-hypershift-release-4.18-periodics-e2e-kubevirt-aws-ovn-csi N/A periodic Registry content changed
periodic-ci-openshift-hypershift-release-4.17-periodics-e2e-kubevirt-aws-ovn-csi N/A periodic Registry content changed
periodic-ci-openshift-hypershift-release-5.0-periodics-e2e-kubevirt-aws-ovn-csi N/A periodic Registry content changed
periodic-ci-openshift-hypershift-release-4.19-periodics-e2e-kubevirt-aws-ovn-csi N/A periodic Registry content changed
periodic-ci-openshift-hypershift-release-4.22-periodics-e2e-kubevirt-aws-ovn-csi N/A periodic Registry content changed
periodic-ci-openshift-hypershift-release-4.14-periodics-e2e-kubevirt-aws-csi N/A periodic Registry content changed
periodic-ci-openshift-hypershift-release-4.21-periodics-e2e-kubevirt-aws-ovn-csi N/A periodic Registry content changed

Prior to this PR being merged, you will need to either run and acknowledge or opt to skip these rehearsals.

Interacting with pj-rehearse

Comment: /pj-rehearse to run up to 5 rehearsals
Comment: /pj-rehearse skip to opt-out of rehearsals
Comment: /pj-rehearse {test-name}, with each test separated by a space, to run one or more specific rehearsals
Comment: /pj-rehearse more to run up to 10 rehearsals
Comment: /pj-rehearse max to run up to 25 rehearsals
Comment: /pj-rehearse auto-ack to run up to 5 rehearsals, and add the rehearsals-ack label on success
Comment: /pj-rehearse list to get an up-to-date list of affected jobs
Comment: /pj-rehearse abort to abort all active rehearsals
Comment: /pj-rehearse network-access-allowed to allow rehearsals of tests that have the restrict_network_access field set to false. This must be executed by an openshift org member who is not the PR author

Once you are satisfied with the results of the rehearsals, comment: /pj-rehearse ack to unblock merge. When the rehearsals-ack label is present on your PR, merge will no longer be blocked by rehearsals.
If you would like the rehearsals-ack label removed, comment: /pj-rehearse reject to re-block merging.

@jiezhao16

Copy link
Copy Markdown
Contributor Author

/pj-rehearse periodic-ci-openshift-hypershift-release-5.0-periodics-e2e-kubevirt-aws-ovn-csi

@openshift-merge-bot

Copy link
Copy Markdown
Contributor

@jiezhao16: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel.

@coderabbitai coderabbitai Bot left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In
`@ci-operator/step-registry/hypershift/kubevirt/csi-e2e/hypershift-kubevirt-csi-e2e-workflow.yaml`:
- Line 31: The wait step reference at line 31 is using default parameter values
(PR_ONLY=false and TIMEOUT=+3 hours) which causes all job runs to block
post-cleanup for extended periods and unnecessarily holds leased infrastructure.
Add parameter overrides to the wait step by setting PR_ONLY to "true" (and
optionally set a shorter TIMEOUT value) to scope this temporary debug path so it
only applies to PR runs and completes faster.
🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

  • Push a commit to this branch (recommended)
  • Create a new PR with the fixes

ℹ️ Review info
⚙️ Run configuration

Configuration used: Repository YAML (base), Central YAML (inherited)

Review profile: CHILL

Plan: Enterprise

Run ID: 45a73a88-b9af-4c5b-afd5-0ec5a172eb9d

📥 Commits

Reviewing files that changed from the base of the PR and between 725f9dd and fbfb165.

📒 Files selected for processing (1)
  • ci-operator/step-registry/hypershift/kubevirt/csi-e2e/hypershift-kubevirt-csi-e2e-workflow.yaml

test:
- chain: hypershift-kubevirt-run-csi-e2e
post:
- ref: wait

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Scope the debug wait so it doesn’t stall all job types.

At Line 31, wait uses defaults (PR_ONLY=false, TIMEOUT=+3 hours), so every run can block post-cleanup for hours and hold leased infra longer than needed. Please set PR_ONLY: "true" (and optionally a shorter TIMEOUT) for this temporary debug path.

Suggested change
 workflow:
   as: hypershift-kubevirt-csi-e2e
@@
   steps:
     env:
       COMPUTE_NODE_TYPE: c5n.metal
       CONTROL_PLANE_INSTANCE_TYPE: "m6i.2xlarge"
       COMPUTE_NODE_REPLICAS: "1"
       ZONES_COUNT: "3"
       SKIP_MONITOR_TEST: "true"
+      PR_ONLY: "true"
+      TIMEOUT: "+30 minutes"
@@
     post:
     - ref: wait
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In
`@ci-operator/step-registry/hypershift/kubevirt/csi-e2e/hypershift-kubevirt-csi-e2e-workflow.yaml`
at line 31, The wait step reference at line 31 is using default parameter values
(PR_ONLY=false and TIMEOUT=+3 hours) which causes all job runs to block
post-cleanup for extended periods and unnecessarily holds leased infrastructure.
Add parameter overrides to the wait step by setting PR_ONLY to "true" (and
optionally set a shorter TIMEOUT value) to scope this temporary debug path so it
only applies to PR runs and completes faster.

@openshift-ci

openshift-ci Bot commented Jun 22, 2026

Copy link
Copy Markdown
Contributor

@jiezhao16: all tests passed!

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

@jiezhao16

Copy link
Copy Markdown
Contributor Author

/pj-rehearse abort

@openshift-merge-bot

Copy link
Copy Markdown
Contributor

@jiezhao16: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant