Using or not using Pandas #147

SemyonSinchenko · 2023-11-16T16:16:59Z

SemyonSinchenko
Nov 16, 2023
Maintainer

Using pandas and pyarrow may improve the performance of collect operations (like columns_to_list). On the other side, both pandas and pyarrow are optional dependencies for PySpark SQL. Should we use them or not? And if we should, is it a good idea to separate any calls to these libs to allow other functions to work well?

@MrPowers @jeffbrennan

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using or not using Pandas #147

{{title}}

Replies: 0 comments

Select a reply

Using or not using Pandas #147

SemyonSinchenko Nov 16, 2023 Maintainer

Replies: 0 comments

SemyonSinchenko
Nov 16, 2023
Maintainer