<Building and Learning LLM from Scratch> Code Commentary
haesunpark
This is a code explanation lecture for <Build a Large Language Model (from Scratch)> (Gilbut, 2025). The code can be found on GitHub (https://github.com/rickiepark/llm-from-scratch/). <Build a Large Language Model (from Scratch)> is the Korean translation of the bestseller <Build a Large Language Model (from Scratch)> (Manning, 2024) written by Sebastian Raschka. This book provides a way to learn and utilize the operating principles of large language models by building a complete model from scratch, starting with OpenAI's GPT-2 model.
초급
PyTorch, gpt-2, transformer