[SYNTH-16456] Postpone reporting results on 404 #1480

Drarig29 · 2024-10-24T11:20:58Z

What and why?

There is sometimes some latency between batches and the poll results endpoint. When this happens, the poll results endpoint may return a 404 error because some queried results cannot be found, so the request ends up returning nothing.

Currently, we retry 3 times on 404 errors for that endpoint, and if the request never succeeds, datadog-ci quits and the CI fails.

How?

Extend our concept of incomplete results to also take care of results that have a 404: the reporting is postponed, then the incomplete results are fetched in next polling cycles.
We backup polling results in case the last polling cycle encounters a 404, because this last polling cycle has to fetch all results and we'll probably have fetched and reported most results already.
Like before, incomplete results may still be incomplete at the end of the batch: in that case, the result is reported without a server result and the following log is printed: The information for result {resultId} of test {publicId} was incomplete at the end of the batch.

Review checklist

Feature or bugfix MUST have appropriate tests (unit, integration)

etnbrd · 2024-10-24T16:26:31Z

src/commands/synthetics/batch.ts

+    isNonFinal: isNonFinalResult(resultInBatch),
+    location: getLocation(resultInBatch.location, test),
+    passed: hasResultPassed(resultInBatch, false, hasTimedOut, options),
+    result: {} as ServerResult,


If we don't need this, could we rather make it optional?

I'm afraid of how it could break customer's code. I'll check internally (web-ui and CI integrations)

etnbrd · 2024-10-24T16:29:04Z

src/commands/synthetics/batch.ts

@@ -268,6 +274,31 @@ const getResultFromBatch = (
  }
 }

+const getResultWithoutPollResult = (


Given the similarity between this function and the return statement from the above, could we merge the two, and add only the difference if pollResult is not available?

const resultFromBatch = { executionRule: resultInBatch.execution_rule, initialResultId: resultInBatch.initial_result_id, isNonFinal: isNonFinalResult(resultInBatch), location: getLocation(resultInBatch.location, test), passed: hasResultPassed(resultInBatch, isUnhealthy, hasTimedOut, options), resultId: getResultIdOrLinkedResultId(resultInBatch), retries: resultInBatch.retries || 0, maxRetries: resultInBatch.max_retries || 0, selectiveRerun: resultInBatch.selective_rerun, timedOut: hasTimedOut, } if (pollResult) { return { ...resultFromBatch, result: { ...pollResult.result, ...(safeDeadlineReached ? { failure: new BatchTimeoutRunawayError().toJson() passed: false } : timedOutRetry || hasTimedOut ? { failure: new {code: 'TIMEOUT', message: 'The batch timed out before receiving the result.'} passed: false } : {}) }, test: deepExtend({}, test, pollResult), timestamp: pollResult.timestamp } } else { return { ...resultFromBatch, test: deepExtend({}, test), timestamp: Date.now() } }

This code might not be the best solution, but will hopefully give you an idea.

This way would make it more concise, but i'm not sure if having the two functions is not actually clearer and easier to understand 🤔

I prefer the 2 functions as well tbh 😁

teodor2312

Apart from the question the Etienne has raised LGTM 👍

[SYNTH-16456] Postpone reporting results on 404

788ddd5

Drarig29 added the synthetics Related to [synthetics] label Oct 24, 2024

Drarig29 marked this pull request as ready for review October 24, 2024 14:22

Drarig29 requested review from a team as code owners October 24, 2024 14:22

etnbrd reviewed Oct 24, 2024

View reviewed changes

teodor2312 approved these changes Oct 25, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SYNTH-16456] Postpone reporting results on 404 #1480

[SYNTH-16456] Postpone reporting results on 404 #1480

Drarig29 commented Oct 24, 2024 •

edited

Loading

etnbrd Oct 24, 2024

Drarig29 Oct 25, 2024

etnbrd Oct 24, 2024

teodor2312 Oct 25, 2024

Drarig29 Oct 25, 2024

teodor2312 left a comment

[SYNTH-16456] Postpone reporting results on 404 #1480

Are you sure you want to change the base?

[SYNTH-16456] Postpone reporting results on 404 #1480

Conversation

Drarig29 commented Oct 24, 2024 • edited Loading

What and why?

How?

Review checklist

etnbrd Oct 24, 2024

Choose a reason for hiding this comment

Drarig29 Oct 25, 2024

Choose a reason for hiding this comment

etnbrd Oct 24, 2024

Choose a reason for hiding this comment

teodor2312 Oct 25, 2024

Choose a reason for hiding this comment

Drarig29 Oct 25, 2024

Choose a reason for hiding this comment

teodor2312 left a comment

Choose a reason for hiding this comment

Drarig29 commented Oct 24, 2024 •

edited

Loading