Skip to content

Commit

Permalink
Avoid spawning an extra CUDA block
Browse files Browse the repository at this point in the history
  • Loading branch information
xqft committed Jul 11, 2023
1 parent fd90466 commit 1f0eff1
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion math/src/fft/gpu/cuda/ops.rs
Original file line number Diff line number Diff line change
Expand Up @@ -29,7 +29,7 @@ where
for stage in 0..order {
const WARP_SIZE: usize = 32;
let block_size = WARP_SIZE;
let block_count = (input.len() + block_size) / block_size;
let block_count = (input.len() + block_size - 1) / block_size;

function.launch(block_count, block_size)?;
}
Expand Down

0 comments on commit 1f0eff1

Please sign in to comment.