실습 중심

This is an introductory course for understanding the concept and application methods of Vision-Language Models (VLM), and practicing running the LLaVA model in an Ollama-based environment while integrating it with MCP (Model Context Protocol).

This course covers the principles of multimodal models, quantization, service development, and integrated demo development, providing a balanced mix of theory and hands-on practice.

dreamingbumblebee

입문

초급

중급이상

비전 트랜스포머

Vision Transformer

transformer

메타 대규모 언어 모델

Llama

Model Context Protocol

[VLM101] Building a Multimodal Chatbot with Fine-tuning (feat.MCP / RunPod)

[VLM101] Building a Multimodal Chatbot with Fine-tuning (feat.MCP / RunPod)

News