Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Can find_all_pandas treat list of struct as nested dataframe? #170

Open
anentropic opened this issue Oct 12, 2023 · 1 comment
Open

Can find_all_pandas treat list of struct as nested dataframe? #170

anentropic opened this issue Oct 12, 2023 · 1 comment

Comments

@anentropic
Copy link

I have a mongo document which has a list field containing child documents.

Pandas data frames can be nested. And PyArrow has Table and RecordBatch types.

I would like to avoid having to call pandas.json_normalize on the child list and instead have find_all_pandas return directly a nested dataframe.

Would it be possible to use Table or RecordBatch type in the schema to get this behaviour?

@blink1073
Copy link
Member

Hi @anentropic, I've created a ticket to track this request: https://jira.mongodb.org/browse/ARROW-185

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants