├── README.md ├── Talk 1 - A Crash course on GPU Optimization - Mark Saroufim - Meta corporation ├── Presentation Slides - A Crash course on GPU optimization.pdf ├── Speaker Q&A - A Crash course in GPU optimization.pdf └── Summary Notes - A Crash Course on GPU Optimization.pdf ├── Talk 2 - High Performance LLM Serving on Nvidia GPUs - Sharan Chetlur -Nvidia ├── Presentation Slides - High Performance LLM Serving on Nvidia GPUs.pdf ├── Speaker Q&A - High Performance LLM Serving on Nvidia GPUs.pdf └── Summary Notes - High Performance LLM Serving on Nvidia GPUs.pdf ├── Talk 3 - Block Based GPU Programming with Triton - Phil Tillet - OpenAI ├── Presentation Slides - Block Based GPU Programming with Triton.pdf ├── Speaker Q&A - Block Based GPU Programming using Triton.pdf └── Summary Notes - Block-based GPU Programming with Triton.pdf ├── Talk 4 - Scaling data processing from CPU to distributed GPU - William Malpica - Voltron Data ├── Presentation Slides - Scaling data processing from CPU to distributed GPU.pdf ├── Speaker Q&A - Scaling data processing from CPU to distributed GPUs.pdf ├── Speaker Q&A - Scaling data processing from CPUs to distributed GPUs.pdf └── Summary Notes - Scaling data processing from CPU to distributed GPUs.pdf └── community-note.md /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/mlops-discord/gpu-optimization-workshop/HEAD/README.md -------------------------------------------------------------------------------- /Talk 1 - A Crash course on GPU Optimization - Mark Saroufim - Meta corporation/Presentation Slides - A Crash course on GPU optimization.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/mlops-discord/gpu-optimization-workshop/HEAD/Talk 1 - A Crash course on GPU Optimization - Mark Saroufim - Meta corporation/Presentation Slides - A Crash course on GPU optimization.pdf -------------------------------------------------------------------------------- /Talk 1 - A Crash course on GPU Optimization - Mark Saroufim - Meta corporation/Speaker Q&A - A Crash course in GPU optimization.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/mlops-discord/gpu-optimization-workshop/HEAD/Talk 1 - A Crash course on GPU Optimization - Mark Saroufim - Meta corporation/Speaker Q&A - A Crash course in GPU optimization.pdf -------------------------------------------------------------------------------- /Talk 1 - A Crash course on GPU Optimization - Mark Saroufim - Meta corporation/Summary Notes - A Crash Course on GPU Optimization.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/mlops-discord/gpu-optimization-workshop/HEAD/Talk 1 - A Crash course on GPU Optimization - Mark Saroufim - Meta corporation/Summary Notes - A Crash Course on GPU Optimization.pdf -------------------------------------------------------------------------------- /Talk 2 - High Performance LLM Serving on Nvidia GPUs - Sharan Chetlur -Nvidia/Presentation Slides - High Performance LLM Serving on Nvidia GPUs.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/mlops-discord/gpu-optimization-workshop/HEAD/Talk 2 - High Performance LLM Serving on Nvidia GPUs - Sharan Chetlur -Nvidia/Presentation Slides - High Performance LLM Serving on Nvidia GPUs.pdf -------------------------------------------------------------------------------- /Talk 2 - High Performance LLM Serving on Nvidia GPUs - Sharan Chetlur -Nvidia/Speaker Q&A - High Performance LLM Serving on Nvidia GPUs.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/mlops-discord/gpu-optimization-workshop/HEAD/Talk 2 - High Performance LLM Serving on Nvidia GPUs - Sharan Chetlur -Nvidia/Speaker Q&A - High Performance LLM Serving on Nvidia GPUs.pdf -------------------------------------------------------------------------------- /Talk 2 - High Performance LLM Serving on Nvidia GPUs - Sharan Chetlur -Nvidia/Summary Notes - High Performance LLM Serving on Nvidia GPUs.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/mlops-discord/gpu-optimization-workshop/HEAD/Talk 2 - High Performance LLM Serving on Nvidia GPUs - Sharan Chetlur -Nvidia/Summary Notes - High Performance LLM Serving on Nvidia GPUs.pdf -------------------------------------------------------------------------------- /Talk 3 - Block Based GPU Programming with Triton - Phil Tillet - OpenAI/Presentation Slides - Block Based GPU Programming with Triton.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/mlops-discord/gpu-optimization-workshop/HEAD/Talk 3 - Block Based GPU Programming with Triton - Phil Tillet - OpenAI/Presentation Slides - Block Based GPU Programming with Triton.pdf -------------------------------------------------------------------------------- /Talk 3 - Block Based GPU Programming with Triton - Phil Tillet - OpenAI/Speaker Q&A - Block Based GPU Programming using Triton.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/mlops-discord/gpu-optimization-workshop/HEAD/Talk 3 - Block Based GPU Programming with Triton - Phil Tillet - OpenAI/Speaker Q&A - Block Based GPU Programming using Triton.pdf -------------------------------------------------------------------------------- /Talk 3 - Block Based GPU Programming with Triton - Phil Tillet - OpenAI/Summary Notes - Block-based GPU Programming with Triton.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/mlops-discord/gpu-optimization-workshop/HEAD/Talk 3 - Block Based GPU Programming with Triton - Phil Tillet - OpenAI/Summary Notes - Block-based GPU Programming with Triton.pdf -------------------------------------------------------------------------------- /Talk 4 - Scaling data processing from CPU to distributed GPU - William Malpica - Voltron Data/Presentation Slides - Scaling data processing from CPU to distributed GPU.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/mlops-discord/gpu-optimization-workshop/HEAD/Talk 4 - Scaling data processing from CPU to distributed GPU - William Malpica - Voltron Data/Presentation Slides - Scaling data processing from CPU to distributed GPU.pdf -------------------------------------------------------------------------------- /Talk 4 - Scaling data processing from CPU to distributed GPU - William Malpica - Voltron Data/Speaker Q&A - Scaling data processing from CPU to distributed GPUs.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/mlops-discord/gpu-optimization-workshop/HEAD/Talk 4 - Scaling data processing from CPU to distributed GPU - William Malpica - Voltron Data/Speaker Q&A - Scaling data processing from CPU to distributed GPUs.pdf -------------------------------------------------------------------------------- /Talk 4 - Scaling data processing from CPU to distributed GPU - William Malpica - Voltron Data/Speaker Q&A - Scaling data processing from CPUs to distributed GPUs.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/mlops-discord/gpu-optimization-workshop/HEAD/Talk 4 - Scaling data processing from CPU to distributed GPU - William Malpica - Voltron Data/Speaker Q&A - Scaling data processing from CPUs to distributed GPUs.pdf -------------------------------------------------------------------------------- /Talk 4 - Scaling data processing from CPU to distributed GPU - William Malpica - Voltron Data/Summary Notes - Scaling data processing from CPU to distributed GPUs.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/mlops-discord/gpu-optimization-workshop/HEAD/Talk 4 - Scaling data processing from CPU to distributed GPU - William Malpica - Voltron Data/Summary Notes - Scaling data processing from CPU to distributed GPUs.pdf -------------------------------------------------------------------------------- /community-note.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/mlops-discord/gpu-optimization-workshop/HEAD/community-note.md --------------------------------------------------------------------------------