New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

containers:apply_to_tensors fails to return (or test) the application result on PackedSequence #996

Open

crutcher opened this issue May 26, 2022 · 1 comment

Assignees

Labels

FSDP

Contributor

crutcher commented May 26, 2022

At this point in apply_to_tensors(), the PackedSequence case drops the result tensors, unlike the other cases
https://github.com/facebookresearch/fairscale/blob/main/fairscale/utils/containers.py#L27

and thus fully_sharded_data_parallel is going to fail to capture the tensors for hooks here:
https://github.com/facebookresearch/fairscale/blob/main/fairscale/nn/data_parallel/fully_sharded_data_parallel.py#L1545

or properly yield casting results here:
https://github.com/facebookresearch/fairscale/blob/main/fairscale/nn/data_parallel/fully_sharded_data_parallel.py#L2490

The text was updated successfully, but these errors were encountered:

crutcher self-assigned this

min-xu-ai added the FSDP label

Contributor

min-xu-ai commented May 26, 2022

This is a care that our unit tests failed to cover, right?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment