Skip to content

Pull requests: intelligent-machine-learning/dlrover

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Optimize diagnosis process
#1405 opened Dec 23, 2024 by samplise Loading…
Reduce the sleep time to speed up UT.
#1403 opened Dec 22, 2024 by workingloong Loading…
[DO NOT MERGE]Fix code coverage
#1379 opened Dec 6, 2024 by BalaBalaYi Loading…
no need to detect cpu hang in torch training,
#1373 opened Dec 4, 2024 by majieyue Loading…
add customized step collector into dlrover enhancement New feature or request
#1352 opened Nov 20, 2024 by majieyue Loading… v0.4.0
fix a bug in infer method investigating
#1340 opened Nov 16, 2024 by jlsong01 Loading…
Handle GPU lost in resource monitor
#1335 opened Nov 15, 2024 by samplise Loading… v0.4.0
Add sockct close v2 enhancement New feature or request wip issue or pr with 'wip' will ignore expiration
#1168 opened Jun 26, 2024 by yangrudan Loading… v0.4.0
add util for loss spike save and decode. wait response need user to response
#1044 opened Mar 21, 2024 by haikuotiankong1212 Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.