├── .gitignore ├── .gitmodules ├── CMakeLists.txt ├── README.md ├── compile_sass.py ├── memory ├── cache_linesize.cu ├── global_memory_bandwidth.cu ├── memory_bandwidth.cu └── memory_latency.cu ├── miscellany ├── reg_bankconflict.cu └── shared_bankconflict.cu ├── sass_cubin ├── cache_linesize.sass ├── memory_bandwidth_block.sass ├── memory_bandwidth_thread.sass ├── memory_latency.sass ├── reg_reuse_bankconflict.sass ├── reg_reuse_double.sass ├── reg_with_bankconflict.sass ├── reg_without_bankconflict.sass ├── shared_bankconflict.sass └── warp_schedule.sass ├── schedule ├── block_schedule.cu └── warp_schedule.cu └── utils ├── format_print.cuh ├── macro.cuh ├── ptx_export.cuh ├── sass_kernel.cuh └── utils.cuh /.gitignore: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sjfeng1999/gpu-arch-microbenchmark/HEAD/.gitignore -------------------------------------------------------------------------------- /.gitmodules: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sjfeng1999/gpu-arch-microbenchmark/HEAD/.gitmodules -------------------------------------------------------------------------------- /CMakeLists.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sjfeng1999/gpu-arch-microbenchmark/HEAD/CMakeLists.txt -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sjfeng1999/gpu-arch-microbenchmark/HEAD/README.md -------------------------------------------------------------------------------- /compile_sass.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sjfeng1999/gpu-arch-microbenchmark/HEAD/compile_sass.py -------------------------------------------------------------------------------- /memory/cache_linesize.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sjfeng1999/gpu-arch-microbenchmark/HEAD/memory/cache_linesize.cu -------------------------------------------------------------------------------- /memory/global_memory_bandwidth.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sjfeng1999/gpu-arch-microbenchmark/HEAD/memory/global_memory_bandwidth.cu -------------------------------------------------------------------------------- /memory/memory_bandwidth.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sjfeng1999/gpu-arch-microbenchmark/HEAD/memory/memory_bandwidth.cu -------------------------------------------------------------------------------- /memory/memory_latency.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sjfeng1999/gpu-arch-microbenchmark/HEAD/memory/memory_latency.cu -------------------------------------------------------------------------------- /miscellany/reg_bankconflict.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sjfeng1999/gpu-arch-microbenchmark/HEAD/miscellany/reg_bankconflict.cu -------------------------------------------------------------------------------- /miscellany/shared_bankconflict.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sjfeng1999/gpu-arch-microbenchmark/HEAD/miscellany/shared_bankconflict.cu -------------------------------------------------------------------------------- /sass_cubin/cache_linesize.sass: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sjfeng1999/gpu-arch-microbenchmark/HEAD/sass_cubin/cache_linesize.sass -------------------------------------------------------------------------------- /sass_cubin/memory_bandwidth_block.sass: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sjfeng1999/gpu-arch-microbenchmark/HEAD/sass_cubin/memory_bandwidth_block.sass -------------------------------------------------------------------------------- /sass_cubin/memory_bandwidth_thread.sass: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sjfeng1999/gpu-arch-microbenchmark/HEAD/sass_cubin/memory_bandwidth_thread.sass -------------------------------------------------------------------------------- /sass_cubin/memory_latency.sass: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sjfeng1999/gpu-arch-microbenchmark/HEAD/sass_cubin/memory_latency.sass -------------------------------------------------------------------------------- /sass_cubin/reg_reuse_bankconflict.sass: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sjfeng1999/gpu-arch-microbenchmark/HEAD/sass_cubin/reg_reuse_bankconflict.sass -------------------------------------------------------------------------------- /sass_cubin/reg_reuse_double.sass: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sjfeng1999/gpu-arch-microbenchmark/HEAD/sass_cubin/reg_reuse_double.sass -------------------------------------------------------------------------------- /sass_cubin/reg_with_bankconflict.sass: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sjfeng1999/gpu-arch-microbenchmark/HEAD/sass_cubin/reg_with_bankconflict.sass -------------------------------------------------------------------------------- /sass_cubin/reg_without_bankconflict.sass: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sjfeng1999/gpu-arch-microbenchmark/HEAD/sass_cubin/reg_without_bankconflict.sass -------------------------------------------------------------------------------- /sass_cubin/shared_bankconflict.sass: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sjfeng1999/gpu-arch-microbenchmark/HEAD/sass_cubin/shared_bankconflict.sass -------------------------------------------------------------------------------- /sass_cubin/warp_schedule.sass: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sjfeng1999/gpu-arch-microbenchmark/HEAD/sass_cubin/warp_schedule.sass -------------------------------------------------------------------------------- /schedule/block_schedule.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sjfeng1999/gpu-arch-microbenchmark/HEAD/schedule/block_schedule.cu -------------------------------------------------------------------------------- /schedule/warp_schedule.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sjfeng1999/gpu-arch-microbenchmark/HEAD/schedule/warp_schedule.cu -------------------------------------------------------------------------------- /utils/format_print.cuh: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sjfeng1999/gpu-arch-microbenchmark/HEAD/utils/format_print.cuh -------------------------------------------------------------------------------- /utils/macro.cuh: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sjfeng1999/gpu-arch-microbenchmark/HEAD/utils/macro.cuh -------------------------------------------------------------------------------- /utils/ptx_export.cuh: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sjfeng1999/gpu-arch-microbenchmark/HEAD/utils/ptx_export.cuh -------------------------------------------------------------------------------- /utils/sass_kernel.cuh: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sjfeng1999/gpu-arch-microbenchmark/HEAD/utils/sass_kernel.cuh -------------------------------------------------------------------------------- /utils/utils.cuh: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sjfeng1999/gpu-arch-microbenchmark/HEAD/utils/utils.cuh --------------------------------------------------------------------------------