실습 중심

This is an introductory course to understand the concepts and application methods of Vision-Language Models (VLM), and to practically run the LLaVA model in an Ollama-based environment, practicing the process of integrating it with MCP (Model Context Protocol).

This course covers the principles of multimodal models, Quantization, service, and integrated demo development, providing a balanced mix of theory and practice.

dreamingbumblebee

입문

초급

중급이상

비전 트랜스포머

Vision Transformer

transformer

메타 대규모 언어 모델

Llama

Model Context Protocol

[VLM101] Creating a Multimodal Chatbot with Fine-tuning (feat.MCP)