├── .gitignore ├── LICENSE ├── README.md ├── assets ├── gpu_config_comparison.png ├── header.png ├── ip_title.png ├── ip_v2.png ├── pipeline.png └── speedup_and_example.png ├── eval └── eval_cuda.py ├── more_baselines ├── cuda_graph.json └── cudnn.json └── optimized_cuda_code ├── 3090.json ├── a100.json ├── codes ├── 3090.json ├── a100.json ├── h100.json ├── h20.json └── l40.json ├── h100.json ├── h20.json └── l40.json /.gitignore: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deepreinforce-ai/CUDA-L1/HEAD/.gitignore -------------------------------------------------------------------------------- /LICENSE: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deepreinforce-ai/CUDA-L1/HEAD/LICENSE -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deepreinforce-ai/CUDA-L1/HEAD/README.md -------------------------------------------------------------------------------- /assets/gpu_config_comparison.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deepreinforce-ai/CUDA-L1/HEAD/assets/gpu_config_comparison.png -------------------------------------------------------------------------------- /assets/header.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deepreinforce-ai/CUDA-L1/HEAD/assets/header.png -------------------------------------------------------------------------------- /assets/ip_title.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deepreinforce-ai/CUDA-L1/HEAD/assets/ip_title.png -------------------------------------------------------------------------------- /assets/ip_v2.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deepreinforce-ai/CUDA-L1/HEAD/assets/ip_v2.png -------------------------------------------------------------------------------- /assets/pipeline.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deepreinforce-ai/CUDA-L1/HEAD/assets/pipeline.png -------------------------------------------------------------------------------- /assets/speedup_and_example.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deepreinforce-ai/CUDA-L1/HEAD/assets/speedup_and_example.png -------------------------------------------------------------------------------- /eval/eval_cuda.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deepreinforce-ai/CUDA-L1/HEAD/eval/eval_cuda.py -------------------------------------------------------------------------------- /more_baselines/cuda_graph.json: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deepreinforce-ai/CUDA-L1/HEAD/more_baselines/cuda_graph.json -------------------------------------------------------------------------------- /more_baselines/cudnn.json: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deepreinforce-ai/CUDA-L1/HEAD/more_baselines/cudnn.json -------------------------------------------------------------------------------- /optimized_cuda_code/3090.json: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deepreinforce-ai/CUDA-L1/HEAD/optimized_cuda_code/3090.json -------------------------------------------------------------------------------- /optimized_cuda_code/a100.json: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deepreinforce-ai/CUDA-L1/HEAD/optimized_cuda_code/a100.json -------------------------------------------------------------------------------- /optimized_cuda_code/codes/3090.json: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deepreinforce-ai/CUDA-L1/HEAD/optimized_cuda_code/codes/3090.json -------------------------------------------------------------------------------- /optimized_cuda_code/codes/a100.json: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deepreinforce-ai/CUDA-L1/HEAD/optimized_cuda_code/codes/a100.json -------------------------------------------------------------------------------- /optimized_cuda_code/codes/h100.json: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deepreinforce-ai/CUDA-L1/HEAD/optimized_cuda_code/codes/h100.json -------------------------------------------------------------------------------- /optimized_cuda_code/codes/h20.json: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deepreinforce-ai/CUDA-L1/HEAD/optimized_cuda_code/codes/h20.json -------------------------------------------------------------------------------- /optimized_cuda_code/codes/l40.json: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deepreinforce-ai/CUDA-L1/HEAD/optimized_cuda_code/codes/l40.json -------------------------------------------------------------------------------- /optimized_cuda_code/h100.json: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deepreinforce-ai/CUDA-L1/HEAD/optimized_cuda_code/h100.json -------------------------------------------------------------------------------- /optimized_cuda_code/h20.json: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deepreinforce-ai/CUDA-L1/HEAD/optimized_cuda_code/h20.json -------------------------------------------------------------------------------- /optimized_cuda_code/l40.json: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deepreinforce-ai/CUDA-L1/HEAD/optimized_cuda_code/l40.json --------------------------------------------------------------------------------