Understanding LLM Architecture and GPU Utilization Strategies for AI Beginners
hyunjinkim
Understand Transformer-based LLM architectures and GPU utilization strategies, and gain hands-on experience with the actual serving process using vLLM. This course covers the entire practical workflow, from building AI system pipelines to monitoring and multi-GPU utilization, and is designed for intuitive understanding through diagrams and practice without complex formulas.
Basic
GPU, attention-model, AI











