Practical PyTorch Computer Vision: A Complete Guide from CNN to the Latest DETR

Name: Practical PyTorch Computer Vision: A Complete Guide from CNN to the Latest DETR
Price: 57750 KRW
Rating: 5 (7 reviews)

Are you feeling overwhelmed by practical application and interview preparation? Based on my industry experience, I will help you firmly grasp everything from CNN to DETR, with a strong focus on code.

(5.0) 7 reviews

98 learners

Level Intermediate

Course period Unlimited

YoungJea Oh

PyTorch

Computer Vision(CV)

CNN

PyTorch

Computer Vision(CV)

CNN

Reviews from Early Learners

5.0

Star Gu

100% enrolled

It was an informative class with kind and detailed explanations!!!

5.0

sunny75

100% enrolled

25/09/17/Wed 21:50 After listening to the lecture, I understood a lot about object recognition. I was always curious about how recognition works when watching object recognition videos... You've really created an excellent lecture. I usually don't listen to lectures on weekdays, but I listened to this lecture even on a weekday. ^^;; Thank you for creating such a great lecture.

5.0

원래그런거임

27% enrolled

I am a university student studying in a computer vision-related department. The lectures were meticulous, and above all, they helped me greatly by explaining in detail so that no ambiguous parts remained. While taking the course, I became interested in other lectures as well. However, it is true that the price range of the lectures is somewhat high, making it burdensome for me as a student. Regarding videos, it would be even better if there were additional updates on various computer vision technologies beyond object recognition. I will continue to diligently take the remaining lectures. Thank you for creating such excellent lectures.

What you will gain after the course

Capability to design and optimize high-performance CNN models based on PyTorch
Practical implementation skills for state-of-the-art object detection algorithms such as YOLO and DETR
Problem-solving techniques using data augmentation and transfer learning
Precision segmentation practice based on U-Net and Mask R-CNN

Latest Deep Learning-Based Image and Object Recognition Master Class

This course is a comprehensive program that systematically covers everything from basic concepts to the latest research achievements, focusing on the implementation of deep learning-based image and object recognition models. Students will learn step-by-step how to process image data, understand and implement Convolutional Neural Networks (CNN), and master transfer learning, object detection, and image segmentation using PyTorch.

First, we begin with the basics of PyTorch, a deep learning framework. You will understand the structure and operations of Tensors and the automatic differentiation feature, then use them to implement a basic neural network. Next, you will learn the concepts of Computer Vision, including image data structures, color representation methods (RGB, RGBA), and image augmentation techniques. Through this, you will prepare the model to learn robustly across various data environments.

In the core model training section, you will learn the structure of CNNs (Convolutional Neural Networks), convolution and pooling operations, and the concepts of padding and striding, followed by hands-on image classification practice using real-world datasets like CIFAR-10. Afterward, you will understand the evolution of major architectures such as AlexNet, VGG, ResNet, and EfficientNet, and cover transfer learning methods using pre-trained models. In particular, you will develop practical application skills through a transfer learning project using a COVID-19 X-ray dataset.

In the Object Detection course, you will compare and learn various algorithms such as the R-CNN family (Fast/Faster/Mask R-CNN), YOLO (You Only Look Once), SSD (Single Shot Detector), and DETR (Detection Transformer). You will understand the criteria for technology selection by studying the structural characteristics, differences in speed and accuracy, and real-world application cases of each model. By also covering the latest models like YOLOv11 and DETR, you can keep up with the trends in the rapidly evolving field of object detection.

Finally, you will learn segmentation techniques. You will learn the differences between Semantic Segmentation, Instance Segmentation, and Panoptic Segmentation, and experience pixel-level object segmentation through hands-on practice using U-Net and Mask R-CNN. By covering applications in various fields such as medical image analysis, autonomous driving, and satellite imagery, you will understand how the models you have learned are utilized in real-world industrial settings.

This course does not stop at simply listing theories; it is conducted in a way that involves directly executing code and practicing within the Google Colab environment. Therefore, upon completing the course, students will be equipped with the practical skills to handle real datasets and build, train, and evaluate models.

👉 Through this course, students will fully understand the core computer vision pipeline—from "Image Classification → Object Detection → Segmentation"—and gain the ability to apply the latest deep learning models.

After completing the course, you will be able to

You can directly implement the core pipelines of computer vision, from deep learning-based image classification → object detection → segmentation.
You will experience the entire process of loading actual datasets and training, evaluating, and improving models using PyTorch.
Beyond simple theoretical understanding, you will gain the practical skills to apply the latest object detection models, such as YOLO and DETR.
You will gain the ability to apply these skills across various industrial fields, such as medical imaging, autonomous driving, and satellite imagery.
You can build a strong advantage in job searching or research activities by adding your own practice code and project outcomes to your portfolio.

Key Strengths of This Course (2)

Balance between theory and practice: First, understand basic theories such as CNN's convolution and pooling concepts, and then proceed with hands-on practice using real datasets.
Connection to Practical Application: Covers cases applicable to industrial fields such as medical imaging, autonomous driving, and satellite image analysis.
Portfolio Creation Possible: You can build a personal portfolio through the practical project results, which directly helps with employment and research.

Recommended for
these people

Who is this course right for?

Learners who need a professional-level vision portfolio that goes beyond theory
Job seekers preparing for deep learning technical interviews and demonstrating practical skills.
Working developers who need to directly apply image recognition models to their services
Those who dream of becoming a professional vision engineer after learning the basics of PyTorch

Need to know before starting?

Python Programming Basics
Basic Knowledge of Vector and Matrix Operations
Basic Concepts of Machine Learning

Hello
This is YoungJea Oh

4,753

Learners

433

Reviews

158

Answers

4.7

Rating

Courses

I am a Senior Developer with extensive development experience. I would like to share the knowledge and experience I have accumulated over 30 years in the IT field, having worked at Hyundai Engineering & Construction's IT department, Samsung SDS, the e-commerce company Xmetrics, and Citibank's IT department. Currently, I am lecturing on Artificial Intelligence and Python.

Homepage Address:

https://ironmanciti.github.io/

Curriculum

All

44 lectures ∙ (11hr 5min)

Course Materials:

Lecture resources

Section 1. Preparing the Learning Environment

7 lectures ∙ (50min)

Section 2. Computer Vision / CNN Fundamentals

11 lectures ∙ (3hr 32min)

8. Computer Vision Overview 1 - Image Representation Methods, What is Object Detection?
14:48
9. Computer Vision Overview 2 - Semantic Segmentation, Image Augmentation Techniques
09:45
10. Practice - Image Augmentation Techniques
18:25
11. CNN (Convolutional Neural Network) 기초 1 - Convolution
16:46
12. CNN (Convolutional Neural Network) 기초 2 - Padding, Striding, Locality
16:32
13. CNN (Convolutional Neural Network) 기초 3 - Parameter Sharing, Pooling, Flattening
16:22
14. Practice - Color Image (CIFAR-10) Classification Using CNN
46:06
15. CNN Advanced - Introduction to EfficientNet
06:36
16. Practice - Using Pre-trained EfficientNet
12:10
17. Transfer Learning Theory
19:35
18. Practice - Creating a Transfer Learning Model (COVID-19 X-Ray Detection)
35:38

Section 3. 객체 탐지/분할(Object Detection/Segmentation) - Faster R-CNN / Mask R-CNN

8 lectures ∙ (2hr 14min)

19. Introduction to Object Detection Models
13:05
20. R-CNN Model Understanding - Region Proposal, Selective Search Technique Explanation
13:35
21. Practice - Selective Search Technique
08:20
22. Fast R-CNN 이해 - ROI (Region of Interest) 개념
12:33
23. R-CNN, Fast R-CNN, Faster R-CNN 차이점 이해 - RPN, Anchor Box
14:45
24. NMS (Non-max Suppression), IOU (Intersection Over Union) 개념 이해, Faster R-CNN 구조 요약
07:46
25. Mask R-CNN 이해
27:29
26. Practice - Fine-tuning Pre-trained Faster R-CNN / Mask R-CNN
36:42

Section 4. 객체 탐지/분할(Object Detection/Segmentation) - YOLO, SSD, U-Net

9 lectures ∙ (2hr 39min)

Section 5. Transformer-based Object Detection - DETR (Detection with Transformers)

9 lectures ∙ (1hr 48min)

Published: 09/05/2025

Last updated: 09/04/2025

Reviews

All

7 reviews

5.0

7 reviews

jyj1206
Reviews 2
∙
Average Rating 5.0
09/16/2025
Edited
5
27% enrolled
I am a university student studying in a computer vision-related department. The lectures were meticulous, and above all, they helped me greatly by explaining in detail so that no ambiguous parts remained. While taking the course, I became interested in other lectures as well. However, it is true that the price range of the lectures is somewhat high, making it burdensome for me as a student. Regarding videos, it would be even better if there were additional updates on various computer vision technologies beyond object recognition. I will continue to diligently take the remaining lectures. Thank you for creating such excellent lectures.
- trimurti
  Instructor
  09/16/2025
  Thank you for the good review. If you're facing financial burden as a student, please let me know which lecture you'd like to watch and I'll send you a discount coupon.
aceoftop1975
Reviews 122
∙
Average Rating 5.0
09/17/2025
5
100% enrolled
25/09/17/Wed 21:50 After listening to the lecture, I understood a lot about object recognition. I was always curious about how recognition works when watching object recognition videos... You've really created an excellent lecture. I usually don't listen to lectures on weekdays, but I listened to this lecture even on a weekday. ^^;; Thank you for creating such a great lecture.
- trimurti
  Instructor
  09/17/2025
  Thank you for the good course review.
lovesome994824
Reviews 3
∙
Average Rating 3.7
09/15/2025
5
61% enrolled
I found a lecture I really needed after such a long time Rather than various complex and difficult mathematical perspectives, showing it code-centered is refreshing. I can quickly learn what CNN is and how to utilize it The months I spent looking at various books and online lectures feel like such a waste As a bonus, if you had simply included image labeling work, etc., I think this would be the best vision lecture for many people struggling with vision
- trimurti
  Instructor
  09/15/2025
  Thank you for the positive feedback. I will also consider the advice you provided for the next course update.
starirene95758
Reviews 9
∙
Average Rating 4.6
04/16/2026
5
100% enrolled
It was an informative class with kind and detailed explanations!!!
yonsoo6259
Reviews 14
∙
Average Rating 5.0
03/27/2026
5
32% enrolled

YoungJea Oh's other courses

Check out other courses by the instructor!

From Introduction to Reinforcement Learning to Deep Q-learning/Policy Gradient

YoungJea Oh

Recently, all the remarkable achievements in the field of artificial intelligence are being announced in the area of reinforcement learning. This covers reinforcement learning technology—which is bringing about true innovation in AI such as robotics, autonomous driving, and humanoid machines—from basic to advanced levels in an easy-to-understand way for beginners.

Intermediate

Python, Deep Learning(DL), Reinforcement Learning(RL)

From Introduction to Reinforcement Learning to Deep Q-learning/Policy Gradient

YoungJea Oh

Understanding the Fundamentals and Operating Principles of Generative AI

YoungJea Oh

Understand the operating principles of generative AI models using deep learning and acquire practical application skills through hands-on exercises.

Intermediate

Python, openai, multimodal

Understanding the Fundamentals and Operating Principles of Generative AI

YoungJea Oh

Hands-on! Building Intermediate AI Agent Services with LangChain and LangGraph: From RAG to Multi-Agents

YoungJea Oh

Simple tutorials alone make it difficult to apply in practice. I will clearly pass on my professional know-how, covering complex state management and multi-agent design methods.

Basic

AI, ChatGPT, prompt engineering

Hands-on! Building Intermediate AI Agent Services with LangChain and LangGraph: From RAG to Multi-Agents

YoungJea Oh

Time series data processing using Python and deep learning

YoungJea Oh

Time is constantly flowing, and data is constantly accumulating. In this dynamic world, time series data plays a vital role in every aspect of our lives. From the volatility of financial markets to subtle signals of climate change, time series data captures it all. Now learn how to interpret and leverage this powerful data using the power of Python and deep learning!

Intermediate

AI, python3, Deep Learning(DL)

Time series data processing using Python and deep learning

YoungJea Oh

Introduction to Machine Learning and Deep Learning Using Python

YoungJea Oh

Do you want to take your first step into the world of data? Learn machine learning and deep learning, the core technologies of AI, with Python. This course will guide you step by step, from the basics of machine learning and deep learning to practical applications. Traditional machine learning and deep learning are based on many of the same principles and technical systems. Therefore, this course does not separate the two into separate subjects, but rather organizes them into one connected course so that beginners can increase their understanding of machine learning as a whole.

Beginner

Machine Learning(ML), Deep Learning(DL), Python

Introduction to Machine Learning and Deep Learning Using Python

YoungJea Oh

Practical LangChain Agent: AI Agent Workflows and Service Building for Intermediates

YoungJea Oh

If you only know LangChain, it's just a chatbot; but if you know LangGraph and Multi-Agent patterns, it becomes a service. An active AI instructor will share the know-how to build operational AI Agent workflows from start to finish—going beyond simple prompt calls to handling tools, maintaining memory, and incorporating human-in-the-loop verification—using 17 practice notebooks and 4 multi-agent patterns (Subagents, Handoffs, Router, and Skills).

Intermediate

Python, AI, AI Agent

Practical LangChain Agent: AI Agent Workflows and Service Building for Intermediates

YoungJea Oh

Follow along unconditionally from basic to advanced Python coding

YoungJea Oh

Are you new to coding or looking to take your coding skills to the next level? This course will introduce you to Python, a powerful and flexible language, designed for all levels of learners. Taught in a “follow-along” fashion, this course will allow you to learn by actually writing and running code. You’ll discover why Python is one of the most popular programming languages. From artificial intelligence to web development, learning the Python language offers endless possibilities, and it’s an opportunity to greatly expand your personal potential.

Beginner

Python, Pandas, Anaconda

Follow along unconditionally from basic to advanced Python coding

YoungJea Oh

Practical OpenAI SDK: AI Agent Workflows and Service Building for Intermediate Users

YoungJea Oh

Are you feeling stuck trying to implement an agent that goes beyond a simple chatbot? Based on my own trial and error and know-how from official documentation, I will show you how to build intermediate-level agents that can be immediately applied to real-world tasks.

Basic

AI, ChatGPT, prompt engineering

Practical OpenAI SDK: AI Agent Workflows and Service Building for Intermediate Users

YoungJea Oh

OpenAI API Practical Mastery: Designing and Deploying High-Performance AI Services for Intermediates

YoungJea Oh

Are you feeling stuck on how to apply what you've learned to real-world tasks even after mastering the basics? I will help you complete complex RAG and agent designs using my professional industry know-how.

Basic

Python, NLP, AI

OpenAI API Practical Mastery: Designing and Deploying High-Performance AI Services for Intermediates

YoungJea Oh

Hands-on! Building a Deep Learning-Based Recommendation System

YoungJea Oh

This course covers everything from the basic concepts of recommendation systems to the principles of applying deep learning. Develop your practical skills for recommendation service development by learning various recommendation algorithms, including collaborative filtering, content-based filtering, and hybrid recommendation systems!

Intermediate

Python, Machine Learning(ML), Deep Learning(DL)

Hands-on! Building a Deep Learning-Based Recommendation System

YoungJea Oh

Understanding the latest big data and artificial intelligence trends

YoungJea Oh

Find your way through the sea of data and explore the future of artificial intelligence. This course explores the two massive technological trends of Big Data and Artificial Intelligence (AI), examining how they are bringing innovation to our lives and work.

Beginner

Big Data, AI

Understanding the latest big data and artificial intelligence trends

YoungJea Oh

Hands-on! Machine Learning/Deep Learning Fraud Detection Master Class

YoungJea Oh

Do you know the theory but feel lost when applying it to real-world data? I will pass on practical techniques to solve complex fraudulent transactions directly with code, incorporating my professional know-how.

Intermediate

Machine Learning(ML), Deep Learning(DL)

Hands-on! Machine Learning/Deep Learning Fraud Detection Master Class

YoungJea Oh

Practical Financial Machine Learning: Building Intermediate Investment Strategies with Python

YoungJea Oh

Basic knowledge is not enough for real-world investing; build your own profit model using the machine learning know-how of a 10-year veteran quant.

Basic

Python, Machine Learning(ML)

Practical Financial Machine Learning: Building Intermediate Investment Strategies with Python

YoungJea Oh

Advanced Practical Deep Learning NLP: LLM Architecture and Fine-tuning in Practice

YoungJea Oh

Do you know the basics but feel stuck when it comes to practical application? I will clearly break down complex LLM structures by incorporating real-world industry experience.

Intermediate

Deep Learning(DL), Tensorflow, NLP

Advanced Practical Deep Learning NLP: LLM Architecture and Fine-tuning in Practice

YoungJea Oh

Introduction to Machine Learning/Deep Learning and Python Introductory Course for Learning

YoungJea Oh

You can acquire an overview of machine learning and deep learning, how to use basic tools, and the Python language knowledge necessary for learning in a short period of time.

Beginner

Machine Learning(ML), Deep Learning(DL), Python

Introduction to Machine Learning/Deep Learning and Python Introductory Course for Learning

YoungJea Oh

[Pytorch] Building Deep Learning Models Using PyTorch

YoungJea Oh

Learn how to build deep learning models yourself using intuitive and Pythonic PyTorch. This reflects the latest PyTorch version.

Basic

Deep Learning(DL), Artificial Neural Network, PyTorch

[Pytorch] Building Deep Learning Models Using PyTorch

YoungJea Oh

Machine Learning with JavaScript and Tensorflow.js

YoungJea Oh

JavaScript, which is known to all web developers, now explore the world of machine learning with this powerful language! This course learns how to build and deploy machine learning models using JavaScript and the powerful machine learning library Tensorflow.js. It guides you step-by-step through all the technologies required to develop web-based machine learning applications. Through this course, learners will systematically understand the core principles of machine learning. In addition, you will learn how to develop deep learning models using JavaScript and Tensorflow.js APIs, how to utilize transfer learning based on pre-trained models, and how to apply all this knowledge interactively in a browser environment.

Basic

Machine Learning(ML), Deep Learning(DL), JavaScript

Machine Learning with JavaScript and Tensorflow.js

YoungJea Oh

Similar courses

Explore other courses in the same field!

Deep Learning Next Generation Innovation Technology - Introduction to Physical Information Neural Networks and Pytorch Practice

dlbro

This is a lecture that studies the physical information neural network, one of the next-generation innovative technologies of deep learning, and implements it directly using Pytorch. Let's learn the next-generation innovative technology of artificial intelligence with me!

Basic

PyTorch, Deep Learning(DL), Machine Learning(ML)

Deep Learning Next Generation Innovation Technology - Introduction to Physical Information Neural Networks and Pytorch Practice

dlbro

From the concept of the latest deep learning technology Vision Transformer to Pytorch implementation

dlbro

This is a lecture that studies Vision Transformer, one of the latest deep learning technologies, and implements a paper using Pytorch. Come experience the new future of the vision field with me!

Intermediate

Vision Transformer, Deep Learning(DL), PyTorch

From the concept of the latest deep learning technology Vision Transformer to Pytorch implementation

dlbro

Deep Learning and PyTorch Bootcamp for Beginners (Easy! From Basics to ChatGPT's Core Transformer) [Data Analysis/Science Part 3]

funcoding

This is a newly designed course that allows you to gradually learn the mathematics, theory, PyTorch-based implementation, transfer learning, and GPT's core transformer needed to understand deep learning, based on the instructor's own failed experiences when first learning deep learning.

Basic

Deep Learning(DL), PyTorch, Machine Learning(ML)

Deep Learning and PyTorch Bootcamp for Beginners (Easy! From Basics to ChatGPT's Core Transformer) [Data Analysis/Science Part 3]

funcoding

Latest deep learning technology and object recognition

dlbro

This course will teach you from the early YOLO model, a real-time object recognition model, to the latest model. In addition, you will learn various deep learning techniques along with object recognition.

Basic

Deep Learning(DL), Computer Vision(CV)

Latest deep learning technology and object recognition

dlbro

License Plate Recognition Project and Deep Learning Image Recognition All-in-One with TensorFlow

AISchool

This is an all-in-one course where you can learn the entire process from the basics of deep learning, TensorFlow, and computer vision to practical applications through a real-world license plate recognition project. Through various hands-on exercises, you can develop the practical skills to apply the latest deep learning models to custom datasets.

Basic

Tensorflow, Deep Learning(DL), Machine Learning(ML)

License Plate Recognition Project and Deep Learning Image Recognition All-in-One with TensorFlow

AISchool

OpenCV + WebApp (Making a Face Eye Detection Web App)

Jeonghyun Kim

Let's create a web app that uploads an image and uses OpenCV to process it and detect faces.

Basic

Django, Web Application, OpenCV

OpenCV + WebApp (Making a Face Eye Detection Web App)

Jeonghyun Kim

[Mobile] Deep Learning Computer Vision Practical Project

nomad

AI Artificial Intelligence, it goes without saying that mobile is the trend. But how do you learn machine learning and deep learning and use them on mobile? Now, let's learn useful mobile Computer Vision projects that are used in daily life.

Intermediate

IONIC, Machine Learning(ML), Deep Learning(DL)

[Mobile] Deep Learning Computer Vision Practical Project

nomad

[Tensorflow2] Complete conquest of Python machine learning - Marathon record prediction project

nomad

This is a comprehensive machine learning project course that learns various useful machine learning regression and classification projects along with theory using Python and TensorFlow 2 based on Boston Marathon big data.

Intermediate

Tensorflow, Machine Learning(ML), Keras

[Tensorflow2] Complete conquest of Python machine learning - Marathon record prediction project

nomad

What if I were on the Titanic?! Building a Survival Probability Prediction AI Web Service with PyTorch & Next.js

dakgangjung123

Starting with the question, "If I were on the Titanic, would I have survived?", this course completes a full-stack project where you develop an AI model to predict survival probabilities based on real data and deploy it as a web service. You will gain hands-on experience in the entire process of AI and web development, from deep learning modeling using PyTorch to building a backend server with FastAPI and implementing a user interface with Next.js.

Intermediate

Python, Deep Learning(DL), PyTorch

What if I were on the Titanic?! Building a Survival Probability Prediction AI Web Service with PyTorch & Next.js

dakgangjung123

Building OCR that actually works in real-world scenarios, here's how to do it.

nexthumans

If you want to properly learn OCR technology that's truly used in practice, this one course is all you need! Aiming for over 98% accuracy even with unstructured documents and complex layouts, based on the latest SOTA models and real-world know-how, we build enterprise-level OCR projects together.

Basic

Python, AI, openai

Building OCR that actually works in real-world scenarios, here's how to do it.

nexthumans

Large Language Models, Just the Essentials!

haesunpark

This is a lecture covering LLM theory and practical examples based on <Large Language Models, Just the Essentials!> (Insight, 2025).

Beginner

Artificial Neural Network, PyTorch, LLM

Large Language Models, Just the Essentials!

haesunpark

Learning Transformer Through Implementation

dooleyz3525

From Multi-Head Attention to the Original Transformer model, BERT, the Encoder-Decoder based MarianMT translation model, and even Vision Transformer, you'll learn Transformer inside and out by implementing them directly in code.

Intermediate

Deep Learning(DL), PyTorch, encoder-decoder

Learning Transformer Through Implementation

dooleyz3525

Practical PyTorch Computer Vision: A Complete Guide from CNN to the Latest DETR

5.0

What you will gain after the course

Latest Deep Learning-Based Image and Object Recognition Master Class

Recommended for these people

After completing the course, you will be able to

Features of this course

Please introduce the key features and points of differentiation.

What you will learn

The creator of this course - Youngje Oh

Notes before taking the course

Practice Environment

Learning Materials

Prerequisites and Important Notes

Recommended for
these people

Hello
This is YoungJea Oh

Curriculum

Reviews

YoungJea Oh's other courses

Similar courses

Practical PyTorch Computer Vision: A Complete Guide from CNN to the Latest DETR

5.0

What you will gain after the course

Latest Deep Learning-Based Image and Object Recognition Master Class

Recommended for these people

After completing the course, you will be able to

Features of this course

Please introduce the key features and points of differentiation.

What you will learn

The creator of this course - Youngje Oh

Notes before taking the course

Practice Environment

Learning Materials

Prerequisites and Important Notes

Recommended for these people

HelloThis is YoungJea Oh

Curriculum

Reviews

YoungJea Oh's other courses

Similar courses

Recommended for
these people

Hello
This is YoungJea Oh