feat: experimental retries #27930

AtofStryker · 2023-09-28T16:04:11Z

Closes N/A

Description

This feature branch his based of the feature/test-burn-in branch with the following commits reverted:

c428443 which introduces the experimentalBurnIn config option (PR). experimentalBurnIn is not ready for release yet and we have decided to omit the config options from the release of experimentalRetries
ae3df1a (PR) which removes the cloud burnInAction as it isn't needed for experimentalRetries release.
83842a6 Removes the update the the experimentalBurnIn snapshot which was updated with the v13 release

ALL COMMITS ON THIS BRANCH HAVE BEEN PREVIOUSLY REVIEWED (with the exception of f010aef)!

Goals / Risk

The goal of this feature branch is to release experimentalRetries ahead of experimentalBurnIn, which will allow users to try the new configuration options for themselves and report any bugs if found.

The secondary goal of releasing experimentalRetries ahead of burn in is to get users to experiment with the experimental feature and identify bugs that might occur with experimentalRetries. However, the primary goal is to detect any issues that might have occurred adapting the existing GA retries to the new retries engine implemented under the hood of the Cypress App.

It's important to understand that in order to support experimentalRetries, we needed to patch mocha in order to rerun passed test attempts. To accomplish this, we needed to change how retries fundamentally work, which means signficant changes to the retries mechanism. In short, this means that the logic within the new calculateTestStatus function backports the retries functionality into the new engine. An easy way to think about this would be:

A GA config of retries: 2 would map to a strategy of detect-flake-and-pass-on-threshold with a maxRetries of 2 and a passesRequired of 1

In addition, the new field, called _cypressTestStatusInfo, which is used by both the cypress reporter in console and in the browser, is still used in GA implementation to interpret the test outerStatus, and will fallback to the status of the last attempt.

It's important to note that there isn't currently anything signaling that the implementation changes are an issue, but we want to ship this out to a wider audience to reduce the scope of changes and have the impact be deterministic. If issues occur with the release of this feature, we can determine those issues with as little noise as possible, and if needed, rollback easily and assess what went wrong.

For more information on this, please see the following PRs that go into the implementation details more specifically:

feat: add new experimental retries configuration #27412 implements the experimental retries config options
feat: implement experimental retries #27826 implementation of experimental retries

The Feature

Experimental retries will allow users to leverage what we are calling a retry strategy. These strategies include

detect-flake-and-pass-on-threshold
detect-flake-but-always-fail

detect-flake-and-pass-on-threshold is similar to how retries work today. Users will be able to specify a retry limit and specify a required number of passes needed in order for that to be marked as passed. Today, that limit is 1, but users can set the limit to whatever they'd like. This can be useful if a test is retrying and is known to be flaky, but the test engineers wishes to make sure that the test can at least pass a certain threshold in order to be marked as passed, which might help in confidence that the test is just flaky and the failure is not legitimate. It's important to note that if the passesRequired can not be acheived by the remaining retries of the test, the test will stop retrying and the test will be marked as a failure.

The other, detect-flake-but-always-fail, will always mark a test that enters retries as failed, regardless of how many retried attempts pass. There is an option here, called stopIfAnyPassed, that will stop the test retries if any attempt passes. Without this option, a user might want to see what the probability that their test might pass (similar to detect-flake-and-pass-on-threshold, but mark the test as failed as the test engineer might not want to mark any detected flaky test as a passed test.

with `experimentalStrategy` and `experimentalOptions` enabled

with only `experimentalStrategy` enabled

with neither enabled

For reference, here are some configuration examples as to hopw the experimental options compare to what exists today, and how their outputs differ.

current implementation (GA)

retries: 2

Experimental options

Detect Flake and Pass on Threshold

experimentalStrategy: 'detect-flake-and-pass-on-threshold'
experimentalOptions: {
  maxRetries: 9,
  passesRequired: 5
}

Detect Flake but Always Fail

experimentalStrategy: 'detect-flake-but-always-fail'
experimentalOptions: {
  maxRetries: 5,
  stopIfAnyPassed: false
}

Detect Flake but Always Fail (`stopIfAnyPassed=true`)

experimentalStrategy: 'detect-flake-but-always-fail'
experimentalOptions: {
  maxRetries: 10,
  stopIfAnyPassed: true
}

Steps to test

For how this experiment is tested, please see the description of #27826, which uses several testing strategies and mechanisms to guarantee functionality

How has the user experience changed?

Users will now be able to provide experimentalStrategy and experimentalOptions in their global config and watch the new form of retries take hold in their test

PR Tasks

Have tests been added/updated?
Has a PR for user-facing changes been opened in cypress-documentation?
Have API changes been updated in the type definitions?

…test-burn-in

…st-burnin chore: merge develop into test burnin

* feat: add the burnIn Configuration to the config package. Option currently is a no-op * chore: make burn in experimental * chore: set experimentalBurnIn to false by default

* feat: implement the experimental retries configuration options to pair with test burn in * [run ci]

…chore/merge-develop-test-burn-in

…rn-in chore: merge develop test burn in

…chore/merge-develop

…rimentalflag

chore: merge develop into feature/test-burn-in

…chore/merge-develop-burnin

chore: merge develop burnin

* add burnInTestAction capability * feat: add burn in capability for cloud * chore: fix snapshot for record_spec

* chore: format the retries/runner snapshot files to make diff easier * feat: implement experimentalRetries strategies 'detect-flake-and-pass-on-threshold' and 'detect-flake-but-always-fail'. This should not be a breaking change, though it does modify mocha and the test object even when the experiment is not configured. This is to exercise the system and make sure things still work as expected even when we go GA. Test updates will follow in following commits. * chore: update snapshots from system tests and cy-in-cy tests that now have the cypress test metadata property _cypressTestStatusInfo. tests have been added in the fail-with-[before|after]each specs to visually see the suite being skipped when developing. * chore: add cy-in-cy tests to verify reporter behavior for pass/fail tests, as well as new mocha snapshots to verify attempts. New tests were needed for this as the 'retries' option in testConfigOverrides currently is and will be invalid for experiment and will function as an override. tests run in the cy-in-cy tests are using globally configured experimentalRetries for the given tested project, which showcases the different behavior between attempts/retries and pass/fail status. * chore: add unit test like driver test to verify the test object in mocha is decorated/handled properly in calculateTestStatus * chore: add sanity system tests to verify console reporter output for experimental retries logic. Currently there is a bug in the reporter where the logged status doesnt wait for the aftereach to complete, which impacts the total exitCode and printed status. * fix: aftereach console output. make sure to fail the test in the appropriate spot in runner.ts and not prematurely, which in turn updates the snapshots for cy-in-cy as the fail event comes later." * chore: address comments from code review * fix: make sure hook failures print outer status + attempts when the error is the hook itself. * chore: improve types within calculateTestStatus inside mocha.ts

…7377)" This reverts commit c428443.

This reverts commit ae3df1a.

…ure branch

…feature/experimental-retries

jennifer-shehane · 2023-09-28T17:23:41Z

system-tests/__snapshots__/results_spec.ts.js

@@ -14,6 +14,7 @@ exports['module api and after:run results'] = `
    "arch": "x64",
    "baseUrl": null,
    "blockHosts": null,
+    "experimentalBurnIn": false,


nit: Is it posssible to alphabetize this? I feel like Chris might have said it's not automatic. 😅

Regardless, approval given for this change as the codeowner of this file.

I don't think we can but I cant remember. Either way this should not be there, so I am going to revert the commit it was introduced which was bb5046c

reverted in 83842a6

… in experimentalflag" This reverts commit bb5046c.

Co-authored-by: Chris Breiding <[email protected]>

emilyrohrbough · 2023-10-23T16:06:40Z

packages/driver/src/cy/testConfigOverrides.ts

@@ -52,6 +52,22 @@ function setConfig (testConfig: ResolvedTestConfigOverride, config, localConfigO

      try {
        testConfig.applied = overrideLevel
+        // this is unique validation, not applied to the general cy config.


Should this logic be pushed up the validateOverridableAtRunTime method to ensure the stack & messaging is consistent with the existing override error messaging (found here). That is the place that ensure these levels are appropriate for the config passed in.

We can't declare invalid override levels for these keys because these are subkeys, and not defined as a primary config key. I can move the subkey detection to validateOverridableAtRunTime, but this logic will be removed in the next few weeks when we add support for setting experimental retry config at test & suite level overrides.

emilyrohrbough · 2023-10-23T16:08:46Z

packages/driver/src/cypress.ts

+        // If experimentalRetries are configured, a experimentalStrategy is present, and the retries configured is a boolean
+        // then we need to set the mocha '_retries' to 'maxRetries' present in the 'experimentalOptions' configuration.
+        if (testRetries['experimentalStrategy'] && _.isBoolean(retriesAsNumberOrBoolean) && retriesAsNumberOrBoolean) {
+          return testRetries['experimentalOptions'].maxRetries


If maxRetries are not set, does/should this fallback to the retriesAsNumberOrBoolean value? Or should this condition be verifying maxRetries is set?

This logic was previously reviewed (see PR description)

packages/driver/src/cypress.ts

ryanthemanuel · 2023-10-23T16:12:43Z

packages/driver/src/cypress/mocha.ts

+  let shouldAttemptsContinue: boolean = true
+  let outerTestStatus: 'passed' | 'failed' | undefined = undefined
+
+  const passedTests = _.filter(test.prevAttempts, (o) => o.state === 'passed')


Could probably do this and the next line in one loop and it'd be a shade more efficient. However, guessing that prevAttempts is likely not ever that large so ultimately doesn't matter. Up to you.

.github/workflows/update_v8_snapshot_cache.yml

cli/CHANGELOG.md

packages/app/src/settings/project/Experiments.vue

ryanthemanuel

Looks good! Couple of very minor things.

Co-authored-by: Ryan Manuel <[email protected]>

… experiments in Experimenets.vue

…tions to experiments in Experimenets.vue" This reverts commit b459aba.

emilyrohrbough · 2023-10-24T14:42:17Z

packages/driver/src/util/config.ts

@@ -103,7 +103,9 @@ export const validateConfig = (state: State, config: Record<string, any>, skipCo
    validateOverridableAtRunTime(config, isSuiteOverride, (validationResult) => {
      let errKey = 'config.cypress_config_api.read_only'

-      if (validationResult.supportedOverrideLevel === 'suite') {
+      if (validationResult.supportedOverrideLevel === 'global_only') {


Can def fix this later, but read_only & global_only should be the same thing. can't be updated at test-time so must be set in the configuration file.

at least that is how I intended it to be if it's not working that way 🙃

cypress-bot · 2023-10-31T16:19:20Z

Released in 13.4.0.

This comment thread has been locked. If you are still experiencing this issue after upgrading to
Cypress v13.4.0, please open a new issue.

AtofStryker and others added 19 commits July 24, 2023 11:19

chore: set up feature/test-burn-in feature branch

f6821e0

Merge branch 'develop' of github.com:cypress-io/cypress into feature/…

02ef3a3

…test-burn-in

Merge pull request #27400 from cypress-io/chore/merge-develop-into-te…

ba1a119

…st-burnin chore: merge develop into test burnin

feat: add burnIn Configuration option (currently a no-op) (#27377)

c428443

* feat: add the burnIn Configuration to the config package. Option currently is a no-op * chore: make burn in experimental * chore: set experimentalBurnIn to false by default

feat: add new experimental retries configuration (#27412)

ef84f03

* feat: implement the experimental retries configuration options to pair with test burn in * [run ci]

Merge branch 'develop' into feature/test-burn-in

334a955

fix cache invalidation [run ci]

209b719

Merge branch 'develop' of https://github.com/cypress-io/cypress into …

a802904

…chore/merge-develop-test-burn-in

Merge pull request #27538 from cypress-io/chore/merge-develop-test-bu…

f919931

…rn-in chore: merge develop test burn in

Merge branch 'develop' of https://github.com/cypress-io/cypress into …

cc0dd8a

…chore/merge-develop

fix snapshot added in v13 for module api to include test burn in expe…

bb5046c

…rimentalflag

chore: fix merge conflict

d08a7fa

Merge pull request #27751 from cypress-io/chore/merge-develop

62e6e11

chore: merge develop into feature/test-burn-in

Merge branch 'develop' of https://github.com/cypress-io/cypress into …

f4804dd

…chore/merge-develop-burnin

Merge pull request #27777 from cypress-io/chore/merge-develop-burnin

0fbfc28

chore: merge develop burnin

chore: add burnInTestAction capability (#27768)

ae3df1a

* add burnInTestAction capability * feat: add burn in capability for cloud * chore: fix snapshot for record_spec

Revert "feat: add burnIn Configuration option (currently a no-op) (#2…

5e8deb7

…7377)" This reverts commit c428443.

Revert "chore: add burnInTestAction capability (#27768)"

cde15d0

This reverts commit ae3df1a.

AtofStryker changed the title ~~Feature/experimental retries~~ feat: experimental retries Sep 28, 2023

AtofStryker added 2 commits September 28, 2023 12:53

chore: run snapshot and binary jobs against experimental retries feat…

c5abc90

…ure branch

Merge branch 'develop' of https://github.com/cypress-io/cypress into …

376c00f

…feature/experimental-retries

AtofStryker force-pushed the feature/experimental-retries branch from f010aef to 33591c1 Compare September 28, 2023 17:12

chore: add changelog entry (wip)

f866c5c

AtofStryker force-pushed the feature/experimental-retries branch from 33591c1 to f866c5c Compare September 28, 2023 17:13

AtofStryker marked this pull request as ready for review September 28, 2023 17:18

AtofStryker requested review from brian-mann and jennifer-shehane as code owners September 28, 2023 17:18

jennifer-shehane reviewed Sep 28, 2023

View reviewed changes

Revert "fix snapshot added in v13 for module api to include test burn…

83842a6

… in experimentalflag" This reverts commit bb5046c.

cacieprins and others added 2 commits October 20, 2023 16:51

Update packages/config/src/validation.ts

48c6734

Co-authored-by: Chris Breiding <[email protected]>

succinct changelog entry; links to docs for details

e89d821

chrisbreiding approved these changes Oct 23, 2023

View reviewed changes

testConfigOverride system test snapshots

067c265

emilyrohrbough reviewed Oct 23, 2023

View reviewed changes

ryanthemanuel reviewed Oct 23, 2023

View reviewed changes

packages/driver/src/cypress.ts Outdated Show resolved Hide resolved

ryanthemanuel reviewed Oct 23, 2023

View reviewed changes

.github/workflows/update_v8_snapshot_cache.yml Outdated Show resolved Hide resolved

ryanthemanuel reviewed Oct 23, 2023

View reviewed changes

cli/CHANGELOG.md Outdated Show resolved Hide resolved

ryanthemanuel reviewed Oct 23, 2023

View reviewed changes

packages/app/src/settings/project/Experiments.vue Show resolved Hide resolved

ryanthemanuel approved these changes Oct 23, 2023

View reviewed changes

cacieprins and others added 7 commits October 23, 2023 12:49

Update .github/workflows/update_v8_snapshot_cache.yml

4b0f9a2

Co-authored-by: Ryan Manuel <[email protected]>

Update cli/CHANGELOG.md

386af08

Co-authored-by: Ryan Manuel <[email protected]>

Update packages/driver/src/cypress.ts

0800b5f

Co-authored-by: Ryan Manuel <[email protected]>

updating cache-version

711b548

improve typescript usage when appending experimental retry options to…

b459aba

… experiments in Experimenets.vue

Revert "improve typescript usage when appending experimental retry op…

9c54013

…tions to experiments in Experimenets.vue" This reverts commit b459aba.

refactor test config override validation for experimental retry subkeys

577ae49

emilyrohrbough reviewed Oct 24, 2023

View reviewed changes

cacieprins and others added 6 commits October 24, 2023 13:43

account for error throw differences in browsers in system tests

1a17cad

Merge branch 'develop' into feature/experimental-retries

0ca0b19

Merge branch 'develop' into feature/experimental-retries

8d44661

bump circle cache

69e0761

bump circle cache again

5e85168

Merge branch 'develop' into feature/experimental-retries

d66f980

cacieprins merged commit 201e9f3 into develop Oct 26, 2023
8 of 9 checks passed

cacieprins deleted the feature/experimental-retries branch October 26, 2023 18:06

cypress-bot bot locked as resolved and limited conversation to collaborators Oct 31, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: experimental retries #27930

feat: experimental retries #27930

AtofStryker commented Sep 28, 2023 •

edited by cacieprins

Loading

jennifer-shehane Sep 28, 2023

AtofStryker Sep 28, 2023

AtofStryker Sep 28, 2023

emilyrohrbough Oct 23, 2023 •

edited

Loading

cacieprins Oct 23, 2023

emilyrohrbough Oct 23, 2023

cacieprins Oct 23, 2023

ryanthemanuel Oct 23, 2023

ryanthemanuel left a comment

emilyrohrbough Oct 24, 2023

emilyrohrbough Oct 24, 2023

cypress-bot bot commented Oct 31, 2023

feat: experimental retries #27930

feat: experimental retries #27930

Conversation

AtofStryker commented Sep 28, 2023 • edited by cacieprins Loading

Description

Goals / Risk

The Feature

with experimentalStrategy and experimentalOptions enabled

with only experimentalStrategy enabled

with neither enabled

current implementation (GA)

Experimental options

Detect Flake and Pass on Threshold

Detect Flake but Always Fail

Detect Flake but Always Fail (stopIfAnyPassed=true)

Steps to test

How has the user experience changed?

PR Tasks

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

emilyrohrbough Oct 23, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ryanthemanuel left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cypress-bot bot commented Oct 31, 2023

AtofStryker commented Sep 28, 2023 •

edited by cacieprins

Loading

with `experimentalStrategy` and `experimentalOptions` enabled

with only `experimentalStrategy` enabled

Detect Flake but Always Fail (`stopIfAnyPassed=true`)

emilyrohrbough Oct 23, 2023 •

edited

Loading