Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[RNG] Split up some tests to prevent out-of-memory and device_lost crashes #1923

Merged
merged 7 commits into from
Nov 5, 2024

Conversation

egrabovskaya
Copy link
Contributor

@egrabovskaya egrabovskaya commented Oct 25, 2024

Here is the clarification from Matthew and a driver team member:
"This issue showed up as a regression after correcting the thread-EU ratio reported for BMG & LNL GTs. It directly affects the code which is used to calculate the private memory size needed by the kernels.
The ratio was increased from 8 to 16 to fit actual hardware, and by doing so a few SYCL CTS test failures were fixed (since they now have enough private memory for their kernels to use)."

It looks like kernels are reserved more scratch memory with new drivers leading to out-of-memory crashes when a large number of kernels are compiled and cached on the device in the tests.

ElenaTyuleneva
ElenaTyuleneva previously approved these changes Oct 30, 2024
Copy link
Contributor

@ElenaTyuleneva ElenaTyuleneva left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Changes looks good! Just a few minor comments from my side

@egrabovskaya
Copy link
Contributor Author

I removed extra code from geometric_distribution_test.pass as well.
From my point of view for bernoulli_distribution_test it makes no sense because of using std::vector inside of test and test_portion functions

@egrabovskaya egrabovskaya merged commit cbef9b5 into main Nov 5, 2024
22 checks passed
@egrabovskaya egrabovskaya deleted the dev/egrabovs/test_failures_fix branch November 5, 2024 18:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants