├── .gitignore ├── Atomic_ubench └── Atomic_add │ ├── Atomic_add_bw │ ├── Makefile │ └── atomic_add_bw.cu │ ├── Atomic_add_bw_conflict │ ├── Makefile │ └── atomic_add_bw.cu │ └── Atomic_add_lat │ ├── Makefile │ └── atomic_add_lat.cu ├── MSHR └── mshr_v2.cu ├── Makefile ├── MaxFlops ├── Makefile └── MaxFlops.cu ├── MaxFlops_16 ├── Makefile ├── MaxFlops_16 └── MaxFlops_16.cu ├── MaxFlops_64 ├── Makefile └── MaxFlops_64.cu ├── MaxFlops_int32 ├── Makefile └── MaxFlops_int32.cu ├── MaxFlops_int64 ├── Makefile └── MaxFlops_int64.cu ├── alu_lat_double ├── Makefile └── alu_lat_double.cu ├── alu_lat_float ├── Makefile └── alu_lat_float.cu ├── alu_lat_half ├── Makefile └── alu_lat_half.cu ├── alu_lat_int32 ├── Makefile └── alu_lat_int32.cu ├── coalescer ├── Makefile └── l1_stride.cu ├── deviceQuery ├── Makefile ├── NsightEclipse.xml ├── deviceQuery.cpp └── readme.txt ├── kernel_lat ├── Makefile └── kernel_lat.cu ├── l1_bw_128 ├── Makefile └── l1_bw_128.cu ├── l1_bw_32f ├── Makefile └── l1_bw_32f.cu ├── l1_bw_64f ├── Makefile └── l1_bw_64f.cu ├── l1_lat ├── Makefile └── l1_lat.cu ├── l1_sector ├── Makefile ├── data.csv ├── l1_sector.cu └── sass.txt ├── l1_shared_bw ├── Makefile └── l1_shared_bw.cu ├── l2_associativity └── l2_associativity.cu ├── l2_bw_128 ├── Makefile └── l2_bw_128.cu ├── l2_bw_32f ├── Makefile └── l2_bw_32f.cu ├── l2_bw_64f ├── Makefile └── l2_bw_64f.cu ├── l2_copy_engine ├── Makefile └── l2_copy_engine.cu ├── l2_lat ├── Makefile └── l2_lat.cu ├── mem_bw ├── Makefile └── mem_bw.cu ├── mem_lat ├── Makefile └── mem_lat.cu ├── shared_bw ├── Makefile └── shared_bw.cu ├── shared_bw_64 ├── Makefile └── shared_bw_64.cu ├── shared_lat ├── Makefile └── shared_lat.cu ├── tensor_lat_half ├── Makefile └── alu_lat_half.cu └── write_policy ├── Makefile └── l2_write_policy.cu /.gitignore: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/.gitignore -------------------------------------------------------------------------------- /Atomic_ubench/Atomic_add/Atomic_add_bw/Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/Atomic_ubench/Atomic_add/Atomic_add_bw/Makefile -------------------------------------------------------------------------------- /Atomic_ubench/Atomic_add/Atomic_add_bw/atomic_add_bw.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/Atomic_ubench/Atomic_add/Atomic_add_bw/atomic_add_bw.cu -------------------------------------------------------------------------------- /Atomic_ubench/Atomic_add/Atomic_add_bw_conflict/Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/Atomic_ubench/Atomic_add/Atomic_add_bw_conflict/Makefile -------------------------------------------------------------------------------- /Atomic_ubench/Atomic_add/Atomic_add_bw_conflict/atomic_add_bw.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/Atomic_ubench/Atomic_add/Atomic_add_bw_conflict/atomic_add_bw.cu -------------------------------------------------------------------------------- /Atomic_ubench/Atomic_add/Atomic_add_lat/Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/Atomic_ubench/Atomic_add/Atomic_add_lat/Makefile -------------------------------------------------------------------------------- /Atomic_ubench/Atomic_add/Atomic_add_lat/atomic_add_lat.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/Atomic_ubench/Atomic_add/Atomic_add_lat/atomic_add_lat.cu -------------------------------------------------------------------------------- /MSHR/mshr_v2.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/MSHR/mshr_v2.cu -------------------------------------------------------------------------------- /Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/Makefile -------------------------------------------------------------------------------- /MaxFlops/Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/MaxFlops/Makefile -------------------------------------------------------------------------------- /MaxFlops/MaxFlops.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/MaxFlops/MaxFlops.cu -------------------------------------------------------------------------------- /MaxFlops_16/Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/MaxFlops_16/Makefile -------------------------------------------------------------------------------- /MaxFlops_16/MaxFlops_16: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/MaxFlops_16/MaxFlops_16 -------------------------------------------------------------------------------- /MaxFlops_16/MaxFlops_16.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/MaxFlops_16/MaxFlops_16.cu -------------------------------------------------------------------------------- /MaxFlops_64/Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/MaxFlops_64/Makefile -------------------------------------------------------------------------------- /MaxFlops_64/MaxFlops_64.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/MaxFlops_64/MaxFlops_64.cu -------------------------------------------------------------------------------- /MaxFlops_int32/Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/MaxFlops_int32/Makefile -------------------------------------------------------------------------------- /MaxFlops_int32/MaxFlops_int32.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/MaxFlops_int32/MaxFlops_int32.cu -------------------------------------------------------------------------------- /MaxFlops_int64/Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/MaxFlops_int64/Makefile -------------------------------------------------------------------------------- /MaxFlops_int64/MaxFlops_int64.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/MaxFlops_int64/MaxFlops_int64.cu -------------------------------------------------------------------------------- /alu_lat_double/Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/alu_lat_double/Makefile -------------------------------------------------------------------------------- /alu_lat_double/alu_lat_double.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/alu_lat_double/alu_lat_double.cu -------------------------------------------------------------------------------- /alu_lat_float/Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/alu_lat_float/Makefile -------------------------------------------------------------------------------- /alu_lat_float/alu_lat_float.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/alu_lat_float/alu_lat_float.cu -------------------------------------------------------------------------------- /alu_lat_half/Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/alu_lat_half/Makefile -------------------------------------------------------------------------------- /alu_lat_half/alu_lat_half.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/alu_lat_half/alu_lat_half.cu -------------------------------------------------------------------------------- /alu_lat_int32/Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/alu_lat_int32/Makefile -------------------------------------------------------------------------------- /alu_lat_int32/alu_lat_int32.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/alu_lat_int32/alu_lat_int32.cu -------------------------------------------------------------------------------- /coalescer/Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/coalescer/Makefile -------------------------------------------------------------------------------- /coalescer/l1_stride.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/coalescer/l1_stride.cu -------------------------------------------------------------------------------- /deviceQuery/Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/deviceQuery/Makefile -------------------------------------------------------------------------------- /deviceQuery/NsightEclipse.xml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/deviceQuery/NsightEclipse.xml -------------------------------------------------------------------------------- /deviceQuery/deviceQuery.cpp: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/deviceQuery/deviceQuery.cpp -------------------------------------------------------------------------------- /deviceQuery/readme.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/deviceQuery/readme.txt -------------------------------------------------------------------------------- /kernel_lat/Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/kernel_lat/Makefile -------------------------------------------------------------------------------- /kernel_lat/kernel_lat.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/kernel_lat/kernel_lat.cu -------------------------------------------------------------------------------- /l1_bw_128/Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/l1_bw_128/Makefile -------------------------------------------------------------------------------- /l1_bw_128/l1_bw_128.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/l1_bw_128/l1_bw_128.cu -------------------------------------------------------------------------------- /l1_bw_32f/Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/l1_bw_32f/Makefile -------------------------------------------------------------------------------- /l1_bw_32f/l1_bw_32f.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/l1_bw_32f/l1_bw_32f.cu -------------------------------------------------------------------------------- /l1_bw_64f/Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/l1_bw_64f/Makefile -------------------------------------------------------------------------------- /l1_bw_64f/l1_bw_64f.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/l1_bw_64f/l1_bw_64f.cu -------------------------------------------------------------------------------- /l1_lat/Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/l1_lat/Makefile -------------------------------------------------------------------------------- /l1_lat/l1_lat.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/l1_lat/l1_lat.cu -------------------------------------------------------------------------------- /l1_sector/Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/l1_sector/Makefile -------------------------------------------------------------------------------- /l1_sector/data.csv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/l1_sector/data.csv -------------------------------------------------------------------------------- /l1_sector/l1_sector.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/l1_sector/l1_sector.cu -------------------------------------------------------------------------------- /l1_sector/sass.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/l1_sector/sass.txt -------------------------------------------------------------------------------- /l1_shared_bw/Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/l1_shared_bw/Makefile -------------------------------------------------------------------------------- /l1_shared_bw/l1_shared_bw.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/l1_shared_bw/l1_shared_bw.cu -------------------------------------------------------------------------------- /l2_associativity/l2_associativity.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/l2_associativity/l2_associativity.cu -------------------------------------------------------------------------------- /l2_bw_128/Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/l2_bw_128/Makefile -------------------------------------------------------------------------------- /l2_bw_128/l2_bw_128.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/l2_bw_128/l2_bw_128.cu -------------------------------------------------------------------------------- /l2_bw_32f/Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/l2_bw_32f/Makefile -------------------------------------------------------------------------------- /l2_bw_32f/l2_bw_32f.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/l2_bw_32f/l2_bw_32f.cu -------------------------------------------------------------------------------- /l2_bw_64f/Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/l2_bw_64f/Makefile -------------------------------------------------------------------------------- /l2_bw_64f/l2_bw_64f.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/l2_bw_64f/l2_bw_64f.cu -------------------------------------------------------------------------------- /l2_copy_engine/Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/l2_copy_engine/Makefile -------------------------------------------------------------------------------- /l2_copy_engine/l2_copy_engine.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/l2_copy_engine/l2_copy_engine.cu -------------------------------------------------------------------------------- /l2_lat/Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/l2_lat/Makefile -------------------------------------------------------------------------------- /l2_lat/l2_lat.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/l2_lat/l2_lat.cu -------------------------------------------------------------------------------- /mem_bw/Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/mem_bw/Makefile -------------------------------------------------------------------------------- /mem_bw/mem_bw.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/mem_bw/mem_bw.cu -------------------------------------------------------------------------------- /mem_lat/Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/mem_lat/Makefile -------------------------------------------------------------------------------- /mem_lat/mem_lat.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/mem_lat/mem_lat.cu -------------------------------------------------------------------------------- /shared_bw/Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/shared_bw/Makefile -------------------------------------------------------------------------------- /shared_bw/shared_bw.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/shared_bw/shared_bw.cu -------------------------------------------------------------------------------- /shared_bw_64/Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/shared_bw_64/Makefile -------------------------------------------------------------------------------- /shared_bw_64/shared_bw_64.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/shared_bw_64/shared_bw_64.cu -------------------------------------------------------------------------------- /shared_lat/Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/shared_lat/Makefile -------------------------------------------------------------------------------- /shared_lat/shared_lat.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/shared_lat/shared_lat.cu -------------------------------------------------------------------------------- /tensor_lat_half/Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/tensor_lat_half/Makefile -------------------------------------------------------------------------------- /tensor_lat_half/alu_lat_half.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/tensor_lat_half/alu_lat_half.cu -------------------------------------------------------------------------------- /write_policy/Makefile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/write_policy/Makefile -------------------------------------------------------------------------------- /write_policy/l2_write_policy.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/shen203/GPU_Microbenchmark/HEAD/write_policy/l2_write_policy.cu --------------------------------------------------------------------------------