Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Enable AVX implementations for 64 bit width inputs
The XMM registers are 128 bits, so that's only 2 rows per round. I implemented merging 4x 2 bits together so we can write back a byte. For that I increased the number of rounds for 64 bit inputs to 4 (instead of the default 2).
- Loading branch information