[Task]: Reenable single iteration #23043

damccorm · 2022-09-06T13:57:07Z

What needs to happen?

Single iteration was temporarily disabled in #23042 and should either be turned back on or ripped out entirely. This should also address the issue raised in #22933

Issue Priority

Priority: 2

Issue Component

Component: sdk-go

lostluck · 2023-07-31T18:35:29Z

To replicate this locally, you need 2 terminals, starting from the root directory of the beam repo.

Terminal 1: Spark runner.

$ ./gradlew :runners:spark:3:job-server:runShadow

Terminal 2: Go SDK process.

$ cd sdks/go/test/regression
$ go1.20.4 test *.go -run ^TestLPErrorPipeline$ --environment_type=LOOPBACK --runner=universal --endpoint=localhost:8099

This sends just the test pipeline to the local spark runner, allowing debugging from the SDK side along with whatever debugging process you like in the go binary.

lostluck · 2023-07-31T19:51:13Z

OK, have determined what's going on.

The Spark runner always uses "multi-chunk" iterables, which isn't true of Flink or Prism (or even Dataflow, but that's harder to verify).

eg. Spark:
[128 32 196 155 160 188 247 247 0 0 0 1 7 1 0 255 255 255 255 3 1 0 5 65 112 112 108 101 1 0 6 66 97 110 97 110 97 1 0 6 67 104 101 114 114 121 0]

vs Prism:
[128 32 196 155 160 188 247 247 0 0 0 1 15 1 0 0 0 0 3 8 1 0 5 65 112 112 108 101 9 1 0 6 66 97 110 97 110 97 9 1 0 6 67 104 101 114 114 121]

vs Flink:

[128 32 196 155 160 188 247 247 0 0 0 1 15 1 0 0 0 0 3 1 0 5 65 112 112 108 101 1 0 6 66 97 110 97 110 97 1 0 6 67 104 101 114 114 121]

That means the values are always coming over with a -1 (the 255 255 255 255 in the enccoded value from spark) as the length of the chunk header, enabling the multi-chunk protocol, but then not doing a state backed iterable. It's a bug on the Go SDK side, because outside of the state backed case, I didn't think any runner implemented the multi-chunk protocol.

lostluck · 2023-07-31T20:29:16Z

The issue is that since the DoFn didn't drain the iterable, there were still bytes to be read when processing returned to the datasource.

So, a real bug, but on a disused path for most runners. #27762 has been filed to implement the behavior and test in prism, to allow future SDK devs to validate this behavior more easily.

* [#23043] Re-enable single iteration for the Go SDK. * more debuging * don't drop plan * debug text. * Fix beam23043 * clean up debugging. * update unit test. * go fmt --------- Co-authored-by: lostluck <[email protected]>

damccorm added task awaiting triage labels Sep 6, 2022

github-actions bot added go P2 labels Sep 6, 2022

lostluck self-assigned this Sep 6, 2022

lostluck removed the awaiting triage label Sep 6, 2022

github-actions bot added the stale label Nov 6, 2022

damccorm removed the stale label Dec 2, 2022

lostluck mentioned this issue Jul 29, 2023

[#23043] Re-enable single iteration for the Go SDK. #27744

Merged

3 tasks

lostluck added a commit to lostluck/beam that referenced this issue Jul 31, 2023

[apache#23043] Re-enable single iteration for the Go SDK.

52ad21b

lostluck mentioned this issue Jul 31, 2023

[Feature Request][prism]: Add support for the multi-chunk iterable protocol #27762

Open

15 tasks

lostluck closed this as completed in #27744 Aug 1, 2023

github-actions bot added this to the 2.50.0 Release milestone Aug 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Task]: Reenable single iteration #23043

[Task]: Reenable single iteration #23043

damccorm commented Sep 6, 2022

lostluck commented Jul 31, 2023

lostluck commented Jul 31, 2023

lostluck commented Jul 31, 2023

[Task]: Reenable single iteration #23043

[Task]: Reenable single iteration #23043

Comments

damccorm commented Sep 6, 2022

What needs to happen?

Issue Priority

Issue Component

lostluck commented Jul 31, 2023

lostluck commented Jul 31, 2023

lostluck commented Jul 31, 2023