Erroneous extra replay command when replaying mid-workflow tasks #1670

RamyElkest · 2024-10-14T10:44:33Z

This is more of a request for information than a bug.

Expected Behavior

Replaying a downloaded workflow history ending with workflow task (started) should not fail with a [TMPRL1100] nondeterministic workflow: extra replay command

Actual Behavior

Replaying a downloaded workflow history ending with workflow task (started) fails with an [TMPRL1100] nondeterministic workflow: extra replay command

Steps to Reproduce the Problem

Reproducing test and detailed explanation: RamyElkest#1

Specifications

Version: v1.26.0
Platform: v1.23.1

The text was updated successfully, but these errors were encountered:

RamyElkest · 2024-10-14T10:48:18Z

Solution
The proposed solution here is to trim scheduled/started/completed workflow tasks with no follow-up events, this guarantees the workflow history is in a safely replayable state. For this there are three approaches:

Trim the history in GetWorkflowHistory (to be discussed with upstream)

Trim the history in our code before passing it to the Replayer

Trim the history in the Replayer (to be discussed with upstream)

Curious if you have any thoughts / preferences here.

cretz · 2024-10-15T13:56:32Z

Thanks for the report! Will confer with the team on replaying of mid-task history captures. While it makes sense to only replay up to the last completed or failed task, we may need to double check that people aren't running replays on the active task without the task failure to replicate failures (e.g. to replicate deadlock detection).

cretz · 2024-10-16T20:32:04Z

Conferred with team, we consider this a bug. If we are in fact failing a replay with history that should succeed, we need to fix. It is likely we should not be performing history matching for non-determinism checks after the last task start (that doesn't have an end). This issue will be updated when we have a solution.

RamyElkest added the potential-bug label Oct 14, 2024

RamyElkest changed the title ~~Replaying partial histories~~ Extra replay command when replaying "partial histories" Oct 14, 2024

RamyElkest changed the title ~~Extra replay command when replaying "partial histories"~~ Erroneous extra replay command when replaying mid-workflow tasks Nov 6, 2024

RamyElkest added a commit to RamyElkest/sdk-go that referenced this issue Nov 6, 2024

Skip replaying incomplete workflow tasks (temporalio#1670)

6720811

yuandrew linked a pull request Dec 9, 2024 that will close this issue

Don't replay commands from non-completed task #1750

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Erroneous extra replay command when replaying mid-workflow tasks #1670

Erroneous extra replay command when replaying mid-workflow tasks #1670

RamyElkest commented Oct 14, 2024 •

edited

Loading

RamyElkest commented Oct 14, 2024

cretz commented Oct 15, 2024 •

edited

Loading

cretz commented Oct 16, 2024

Erroneous extra replay command when replaying mid-workflow tasks #1670

Erroneous extra replay command when replaying mid-workflow tasks #1670

Comments

RamyElkest commented Oct 14, 2024 • edited Loading

Expected Behavior

Actual Behavior

Steps to Reproduce the Problem

Specifications

RamyElkest commented Oct 14, 2024

cretz commented Oct 15, 2024 • edited Loading

cretz commented Oct 16, 2024

RamyElkest commented Oct 14, 2024 •

edited

Loading

cretz commented Oct 15, 2024 •

edited

Loading