Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

upgrade to cuda9(Volta) GPU arch #118

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

dzhwinter
Copy link
Collaborator

Currently, I fix the cuda9 error and some issues in #117. However, our branch has diverged with original repo, this PR cherry-picked from my repo, now it can be safely merged. Please review this PR in your spare time, thanks!

@dzhwinter dzhwinter mentioned this pull request Apr 28, 2018
@t-vi
Copy link

t-vi commented Jul 4, 2018

@dzhwinter @wangkuiyi Say, would you be able to shed some light on the mask (0 if I see this correctly) you are using? in most other venues, it seems to be that shfl is replaced by shfl sync with all mask bits set. I would be most grateful!

@Diamondfan
Copy link

Seems not work for me with cuda9.2 and V100. With "./test_cpu: error while loading shared libraries: libwarpctc.so: cannot open shared object file: No such file or directory". Any suggestion?

@ThomasDelteil
Copy link

ThomasDelteil commented Oct 31, 2018

can this PR be merged? WarpCTC is failing on V100 otherwise.

edit: With this branch I am getting:

./test_gpu
Running GPU tests
Some or all tests fail

@aonotas
Copy link

aonotas commented Apr 8, 2019

@Diamondfan
maybe you need to set

$ cd build
$ export LD_LIBRARY_PATH=./:$LD_LIBRARY_PATH
$ ./test_gpu

I also get same error as @ThomasDelteil.

$ ./test_gpu
Running GPU tests
Some or all tests fail

I use cuda-9.0 and Volta-100.

@PapaMadeleine2022
Copy link

PapaMadeleine2022 commented Apr 22, 2019

use cuda10, and modify the code in CMakeLists.txt:

IF (CUDA_VERSION GREATER 7.6)
    set(CUDA_NVCC_FLAGS "${CUDA_NVCC_FLAGS} -gencode arch=compute_60,code=sm_60")
    set(CUDA_NVCC_FLAGS "${CUDA_NVCC_FLAGS} -gencode arch=compute_61,code=sm_61")
    set(CUDA_NVCC_FLAGS "${CUDA_NVCC_FLAGS} -gencode arch=compute_62,code=sm_62")
    set(CUDA_NVCC_FLAGS "${CUDA_NVCC_FLAGS} -gencode arch=compute_70,code=sm_70")
ENDIF()

it works.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants