Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Temperature unit test hang with multiple threads and ranks #279

Open
streeve opened this issue Jan 8, 2024 · 2 comments
Open

Temperature unit test hang with multiple threads and ranks #279

streeve opened this issue Jan 8, 2024 · 2 comments
Labels
bug Something isn't working

Comments

@streeve
Copy link
Collaborator

streeve commented Jan 8, 2024

ExaCA_Temperature_test_OPENMP_np_2_nt_2 failure - only seen once, but probably indicates an unlikely race condition

@streeve streeve added the bug Something isn't working label Jan 8, 2024
@MattRolchigo MattRolchigo reopened this Jan 30, 2024
@MattRolchigo
Copy link
Collaborator

Still unsure of the exact bug, but I've narrowed it down to testReadTemperatureData - some MPI ranks do not seem to store any of the appropriate temperature data (might not be reading the file at all?)

@MattRolchigo
Copy link
Collaborator

Observed an occurrence today of a similar hanging unit test, and an outright failure in checkTemperatureResults - this seemed to be Finch-ExaCA coupling specific, but could also be related

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants