-
Notifications
You must be signed in to change notification settings - Fork 4.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Adding Iceberg Support #29569
Adding Iceberg Support #29569
Conversation
Great -- thanks for digging in on this @byronellis ! |
…dedScan. Swapped test to use that.
…it into a new implementation instead.
…'s BatchLoad implementation since this is a pretty close analog. Has the beginnings of dynamic destination support, though doesn't do triggered windows yet (pretty mechanical just haven't done it yet). Successfully writes files and updates the catalog using a keyed pcollection to collect catalog updates. This appears to work much better than just doing it on bundle close, even in test that was causing collisions and performance issues.
…for right now ("failed writes" are really spilled writes not failures)
…o defer conversion to record and eliminates the need to pass through Row
…tifier from table.name(). If it matches the namespace of our catalog, remove the catalog part of the namespace first so things will work properly.
This pull request has been marked as stale due to 60 days of inactivity. It will be closed in 1 week if no further activity occurs. If you think that’s incorrect or this pull request requires a review, please simply write any comment. If closed, you can revive the PR at any time and @mention a reviewer or discuss it on the [email protected] list. Thank you for your contributions. |
This PR adds support for reads and writes via Iceberg to the Java SDK. At the moment this isn't intended to be used as a standalone IO, but rather to be integrated into a forthcoming catalog representation that should make it easier to work with more structured sources.
Thank you for your contribution! Follow this checklist to help us incorporate your contribution quickly and easily:
addresses #123
), if applicable. This will automatically add a link to the pull request in the issue. If you would like the issue to automatically close on merging the pull request, commentfixes #<ISSUE NUMBER>
instead.CHANGES.md
with noteworthy changes.See the Contributor Guide for more tips on how to make review process smoother.
To check the build health, please visit https://github.com/apache/beam/blob/master/.test-infra/BUILD_STATUS.md
GitHub Actions Tests Status (on master branch)
See CI.md for more information about GitHub Actions CI or the workflows README to see a list of phrases to trigger workflows.