Skip to content

smoke: isolate and loop flaky Chat Sessions / Agents Window tests [do not merge]#322243

Draft
alexdima wants to merge 1 commit into
mainfrom
agents/smoke-tests-flakiness-investigation-0e42e6ff
Draft

smoke: isolate and loop flaky Chat Sessions / Agents Window tests [do not merge]#322243
alexdima wants to merge 1 commit into
mainfrom
agents/smoke-tests-flakiness-investigation-0e42e6ff

Conversation

@alexdima

Copy link
Copy Markdown
Member

⚠️ Do not merge. This is a temporary investigation PR to flush out smoke test flakiness reported by the team in the Chat Sessions and Agents Window suites.

What this does

  1. Isolates the suspected flaky suites — in test/smoke/src/main.ts, all smoke test suites are commented out except:

    • setupChatSessionsTests (Chat Sessions)
    • setupAgentsWindowTests (Agents Window, including local AgentHost / SDK sandbox / Codex variants)
  2. Loops the Electron smoke step 10× on all three platforms (Linux, macOS, Windows). Each iteration runs the smoke tests; on the first non-zero exit the loop fails fast so the build fails (e.g. at run 4) while the existing diagnostics/artifact-upload steps still run and preserve logs + traces.

    • bash steps use set -e + for i in $(seq 1 10)
    • the PowerShell step checks $LASTEXITCODE after each run (PowerShell does not auto-fail on external command exit codes)
    • timeoutInMinutes bumped from 20 to 120 to accommodate 10 runs

Notes for reviewers

  • The Browser and Remote smoke steps are left untouched (single run). With the suites commented out they effectively run nothing, but the flaky tests under investigation are Electron-only anyway.
  • Nothing here should be merged — once the flakiness is understood the changes will be reverted.

Co-authored-by: Copilot 223556219+Copilot@users.noreply.github.com

Temporary, do-not-merge investigation change to flush out smoke test
flakiness. Disables all smoke suites except Chat Sessions and Agents
Window in main.ts, and loops the Electron smoke test step 10 times on
Linux, macOS and Windows so a single flaky failure fails the build while
preserving uploaded logs/traces.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Copilot AI review requested due to automatic review settings June 20, 2026 21:50

Copilot AI left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Temporary investigation PR intended to reproduce and diagnose flakiness in the Electron smoke tests for Chat Sessions and the Agents Window by narrowing the executed smoke suites and running them repeatedly in CI.

Changes:

  • Commented out all smoke suites except Chat Sessions and Agents Window in test/smoke/src/main.ts.
  • Updated Azure Pipelines (Linux/macOS/Windows) to loop the Electron smoke test step 10 times with an increased timeout to surface intermittent failures faster.
Show a summary per file
File Description
test/smoke/src/main.ts Disables most smoke suites, leaving only Chat Sessions + Agents Window setup enabled for Electron runs.
build/azure-pipelines/win32/steps/product-build-win32-test.yml Loops Electron smoke tests 10× on Windows and increases timeout.
build/azure-pipelines/linux/steps/product-build-linux-test.yml Loops Electron smoke tests 10× on Linux and increases timeout.
build/azure-pipelines/darwin/steps/product-build-darwin-test.yml Loops Electron smoke tests 10× on macOS and increases timeout.

Copilot's findings

  • Files reviewed: 4/4 changed files
  • Comments generated: 4

Comment thread test/smoke/src/main.ts
Comment thread build/azure-pipelines/win32/steps/product-build-win32-test.yml
Comment on lines +123 to +127
for i in $(seq 1 10); do
echo "##[group]Smoke test run $i of 10"
npm run smoketest-no-compile -- --tracing --build "$(agent.builddirectory)/VSCode-linux-$(VSCODE_ARCH)"
echo "##[endgroup]"
done
Comment on lines +111 to +115
for i in $(seq 1 10); do
echo "##[group]Smoke test run $i of 10"
npm run smoketest-no-compile -- --tracing --build "$APP_ROOT/$APP_NAME"
echo "##[endgroup]"
done
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants