Skip to content

Commit

Permalink
Update fbgemm
Browse files Browse the repository at this point in the history
  • Loading branch information
xuzhao9 committed Jul 2, 2024
1 parent 4d4e4a5 commit 40f0450
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion submodules/FBGEMM
Submodule FBGEMM updated 59 files
+8 −6 .github/scripts/fbgemm_gpu_install.bash
+4 −0 .github/scripts/fbgemm_gpu_test.bash
+13 −0 .github/scripts/utils_base.bash
+1 −5 .github/scripts/utils_build.bash
+9 −0 .github/scripts/utils_cuda.bash
+11 −0 cmake/modules/CudaSetup.cmake
+2 −1 fbgemm_gpu/FbgemmGpu.cmake
+162 −0 fbgemm_gpu/bench/quantize_ops_benchmark.py
+1 −1 fbgemm_gpu/codegen/inference/embedding_forward_quantized_cpu_template.cpp
+1 −1 fbgemm_gpu/codegen/training/backward/embedding_backward_split_cpu_approx_template.cpp
+1 −1 fbgemm_gpu/codegen/training/backward/embedding_backward_split_cpu_template.cpp
+1 −1 fbgemm_gpu/codegen/training/forward/embedding_forward_split_cpu.cpp
+42 −3 fbgemm_gpu/codegen/training/python/split_embedding_codegen_lookup_invoker.template
+47 −31 fbgemm_gpu/codegen/utils/embedding_bounds_check_host_cpu.cpp
+2 −1 fbgemm_gpu/experimental/example/CMakeLists.txt
+29 −0 fbgemm_gpu/experimental/gemm/test/fp8_gemm_test.py
+212 −136 fbgemm_gpu/experimental/gemm/triton_gemm/fp8_gemm.py
+2 −1 fbgemm_gpu/experimental/gen_ai/CMakeLists.txt
+37 −1 fbgemm_gpu/experimental/gen_ai/bench/quantize_ops.py
+22 −16 fbgemm_gpu/experimental/gen_ai/src/quantize/ck_extensions/fp8_blockwise_gemm.hip
+22 −12 fbgemm_gpu/experimental/gen_ai/src/quantize/ck_extensions/fp8_rowwise_gemm.hip
+307 −42 fbgemm_gpu/experimental/gen_ai/src/quantize/cutlass_extensions.cu
+1,597 −0 fbgemm_gpu/experimental/gen_ai/src/quantize/fp8_blockwise_cutlass_helpers.h
+29 −29 fbgemm_gpu/experimental/gen_ai/src/quantize/quantize.cpp
+15 −9 fbgemm_gpu/experimental/gen_ai/test/quantize/quantize_test.py
+70 −0 fbgemm_gpu/fbgemm_gpu/sparse_ops.py
+6 −0 fbgemm_gpu/fbgemm_gpu/tbe/ssd/training.py
+2 −0 fbgemm_gpu/fbgemm_gpu/tbe/utils/__init__.py
+1 −0 fbgemm_gpu/include/fbgemm_gpu/embedding_backward_template_helpers.cuh
+1 −0 fbgemm_gpu/include/fbgemm_gpu/embedding_forward_template_helpers.cuh
+4 −461 fbgemm_gpu/include/fbgemm_gpu/fbgemm_cuda_utils.cuh
+18 −0 fbgemm_gpu/include/fbgemm_gpu/sparse_ops.h
+0 −0 fbgemm_gpu/include/fbgemm_gpu/utils/bench_utils.cuh
+58 −0 fbgemm_gpu/include/fbgemm_gpu/utils/binary_search_range.cuh
+105 −0 fbgemm_gpu/include/fbgemm_gpu/utils/bitonic_sort.cuh
+0 −0 fbgemm_gpu/include/fbgemm_gpu/utils/cpu_utils.h
+177 −0 fbgemm_gpu/include/fbgemm_gpu/utils/cuda_prelude.cuh
+93 −0 fbgemm_gpu/include/fbgemm_gpu/utils/fixed_divisor.cuh
+109 −0 fbgemm_gpu/include/fbgemm_gpu/utils/inclusive_sum_scan.cuh
+0 −0 fbgemm_gpu/include/fbgemm_gpu/utils/topology_utils.h
+1 −0 fbgemm_gpu/src/input_combine_ops/input_combine.cu
+3 −0 fbgemm_gpu/src/jagged_tensor_ops/common.cuh
+25 −3 fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp
+1 −0 fbgemm_gpu/src/metric_ops/metric_ops.cu
+1 −1 fbgemm_gpu/src/quantize_ops/mx_common.cuh
+30 −40 fbgemm_gpu/src/quantize_ops/quantize_mx.cuh
+1 −0 fbgemm_gpu/src/sparse_ops/common.cuh
+23 −0 fbgemm_gpu/src/sparse_ops/sparse_ops_cpu.cpp
+22 −0 fbgemm_gpu/src/sparse_ops/sparse_permute_2d.cu
+2 −0 fbgemm_gpu/src/split_embeddings_cache/common.cuh
+1 −0 fbgemm_gpu/src/split_embeddings_utils/generate_vbe_metadata.cu
+1 −0 fbgemm_gpu/src/ssd_split_embeddings_cache/ssd_split_embeddings_cache_cuda.cu
+1 −1 fbgemm_gpu/src/topology_utils.cpp
+13 −0 fbgemm_gpu/test/merge_pooled_embeddings_test.py
+0 −1 fbgemm_gpu/test/tbe/training/backward_adagrad_common.py
+1 −1 fbgemm_gpu/test/tbe/training/backward_optimizers_test.py
+0 −1 fbgemm_gpu/test/tbe/training/backward_sgd_test.py
+1 −2 fbgemm_gpu/test/tbe/training/forward_test.py
+3 −4 fbgemm_gpu/test/tbe/utils/split_embeddings_utils_test.py

0 comments on commit 40f0450

Please sign in to comment.