[prism] PrismRunnerTest::test_windowing - session windowing failing. #32085

lostluck · 2024-08-05T22:21:58Z

PrismRunnerTest::test_windowing is failing with

apache_beam.testing.util.BeamAssertException: Failed assert: [('k', [1, 2]), ('k', [100, 101, 102])] == [('k', [1, 2, 100, 101, 102])], unexpected elements [('k', [1, 2, 100, 101, 102])], missing elements [('k', [1, 2]), ('k', [100, 101, 102])] [while running 'assert_that/Match']

which happens to be because test_windowing validates session windows.

Examining from prism's side, the issue is two fold: 1, that the session merging logic is wrong, we end up leading to duplicated data. And 2, Python isn't encoding the timestamps properly.

For 1. The fix is to actually delete the old data references from the window map, after extracting them, and to ensure the final data is actually put into the map afterwards.

For 2, I can override the existing test just for prism for now while we figure out the type problems. It doesn't seem like the other Runner suites are overriding it though. This might be a Python version thing I'm not familiar with.

I can also then enhance the test so there's a "middle" grouping, which will be a better test of the merging logic anyway.

It's not clear if this would succeed in an unbounded context though, rather than a batch context. Sessions are similar to triggers in that respect.

lostluck mentioned this issue Aug 5, 2024

[Tracking Umbrella] Prism Runner areas for contribution. #29650

Open

lostluck added a commit to lostluck/beam that referenced this issue Aug 5, 2024

[apache#32085][prism] Fix session windowing.

6972e0c

lostluck mentioned this issue Aug 5, 2024

[#32085][prism] Fix session windowing. #32086

Merged

3 tasks

lostluck added a commit that referenced this issue Aug 6, 2024

[#32085][prism] Fix session windowing. (#32086)

e3e4454

lostluck closed this as completed in #32086 Aug 6, 2024

github-actions bot added this to the 2.59.0 Release milestone Aug 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[prism] PrismRunnerTest::test_windowing - session windowing failing. #32085

[prism] PrismRunnerTest::test_windowing - session windowing failing. #32085

lostluck commented Aug 5, 2024 •

edited

Loading

[prism] PrismRunnerTest::test_windowing - session windowing failing. #32085

[prism] PrismRunnerTest::test_windowing - session windowing failing. #32085

Comments

lostluck commented Aug 5, 2024 • edited Loading

lostluck commented Aug 5, 2024 •

edited

Loading