You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Currently we do not really care about large datasets, and there are few places where index/offsets computations would fail because of int32 overflow. The reason is that pyarrow uses int32 arrays for offsets.
There are two possible solutions for this:
Use pyarrow.LargeList
Smarter and wider usage of chunks.
The text was updated successfully, but these errors were encountered:
Currently we do not really care about large datasets, and there are few places where index/offsets computations would fail because of int32 overflow. The reason is that pyarrow uses int32 arrays for offsets.
There are two possible solutions for this:
pyarrow.LargeList
The text was updated successfully, but these errors were encountered: