Practical PyTorch Computer Vision: A Complete Guide from CNN to the Latest DETR

Are you feeling overwhelmed by practical application and interview preparation? Based on my industry experience, I will help you firmly grasp everything from CNN to DETR, with a strong focus on code.

(5.0) 7 reviews

95 learners

Level Intermediate

Course period Unlimited

PyTorch
PyTorch
Computer Vision(CV)
Computer Vision(CV)
CNN
CNN
PyTorch
PyTorch
Computer Vision(CV)
Computer Vision(CV)
CNN
CNN

Reviews from Early Learners

5.0

5.0

Star Gu

100% enrolled

It was an informative class with kind and detailed explanations!!!

5.0

sunny75

100% enrolled

25/09/17/Wed 21:50 After listening to the lecture, I understood a lot about object recognition. I was always curious about how recognition works when watching object recognition videos... You've really created an excellent lecture. I usually don't listen to lectures on weekdays, but I listened to this lecture even on a weekday. ^^;; Thank you for creating such a great lecture.

5.0

원래그런거임

27% enrolled

I am a university student studying in a computer vision-related department. The lectures were meticulous, and above all, they helped me greatly by explaining in detail so that no ambiguous parts remained. While taking the course, I became interested in other lectures as well. However, it is true that the price range of the lectures is somewhat high, making it burdensome for me as a student. Regarding videos, it would be even better if there were additional updates on various computer vision technologies beyond object recognition. I will continue to diligently take the remaining lectures. Thank you for creating such excellent lectures.

What you will gain after the course

  • Capability to design and optimize high-performance CNN models based on PyTorch

  • Practical implementation skills for state-of-the-art object detection algorithms such as YOLO and DETR

  • Problem-solving techniques using data augmentation and transfer learning

  • Precision segmentation practice based on U-Net and Mask R-CNN

Latest Deep Learning-Based Image and Object Recognition Master Class

This course is a comprehensive program that systematically covers everything from basic concepts to the latest research achievements, focusing on the implementation of deep learning-based image and object recognition models. Students will learn step-by-step how to process image data, understand and implement Convolutional Neural Networks (CNN), and master transfer learning, object detection, and image segmentation using PyTorch.

First, we begin with the basics of PyTorch, a deep learning framework. You will understand the structure and operations of Tensors and the automatic differentiation feature, then use them to implement a basic neural network. Next, you will learn the concepts of Computer Vision, including image data structures, color representation methods (RGB, RGBA), and image augmentation techniques. Through this, you will prepare the model to learn robustly across various data environments.

In the core model training section, you will learn the structure of CNNs (Convolutional Neural Networks), convolution and pooling operations, and the concepts of padding and striding, followed by hands-on image classification practice using real-world datasets like CIFAR-10. Afterward, you will understand the evolution of major architectures such as AlexNet, VGG, ResNet, and EfficientNet, and cover transfer learning methods using pre-trained models. In particular, you will develop practical application skills through a transfer learning project using a COVID-19 X-ray dataset.

In the Object Detection course, you will compare and learn various algorithms such as the R-CNN family (Fast/Faster/Mask R-CNN), YOLO (You Only Look Once), SSD (Single Shot Detector), and DETR (Detection Transformer). You will understand the criteria for technology selection by studying the structural characteristics, differences in speed and accuracy, and real-world application cases of each model. By also covering the latest models like YOLOv11 and DETR, you can keep up with the trends in the rapidly evolving field of object detection.

Finally, you will learn segmentation techniques. You will learn the differences between Semantic Segmentation, Instance Segmentation, and Panoptic Segmentation, and experience pixel-level object segmentation through hands-on practice using U-Net and Mask R-CNN. By covering applications in various fields such as medical image analysis, autonomous driving, and satellite imagery, you will understand how the models you have learned are utilized in real-world industrial settings.

This course does not stop at simply listing theories; it is conducted in a way that involves directly executing code and practicing within the Google Colab environment. Therefore, upon completing the course, students will be equipped with the practical skills to handle real datasets and build, train, and evaluate models.

👉 Through this course, students will fully understand the core computer vision pipeline—from "Image Classification → Object Detection → Segmentation"—and gain the ability to apply the latest deep learning models.

Recommended for these people

Who should take this course (1)

  • Those who are curious about deep learning and computer vision but feel overwhelmed about where to start

  • Those who want to systematically learn concepts like CNN, transfer learning, and object detection from the very beginning

Who should take this course (2)

  • Those who want to directly implement models with real-world datasets using PyTorch

  • Those who want to learn through practice-oriented sessions rather than just theory and "verify through code"

Who should take this course (3)

  • Those who want to apply the latest object detection models (YOLO, DETR, etc.) and segmentation techniques to practical work.

  • Students, developers, and researchers considering career expansion into the AI/Machine Learning field

After completing the course, you will be able to

  • You can directly implement the core pipelines of computer vision, from deep learning-based image classification → object detection → segmentation.

  • You will experience the entire process of loading actual datasets and training, evaluating, and improving models using PyTorch.

  • Beyond simple theoretical understanding, you will gain the practical skills to apply the latest object detection models, such as YOLO and DETR.

  • You will gain the ability to apply these skills across various industrial fields, such as medical imaging, autonomous driving, and satellite imagery.

  • You can build a strong advantage in job searching or research activities by adding your own practice code and project outcomes to your portfolio.

Features of this course

Please introduce the key features and points of differentiation.

특징이미지_1

Key Strengths of This Course (1)

  • Practice-oriented: Instead of staying within theory, you will practice by writing code directly in the Google Colab environment.

  • Easy and Systematic Explanations: Learn step-by-step from PyTorch basics to CNN, object detection, and segmentation, making it easy for even beginners to follow along.

  • Covers the latest models: Including the latest research achievements such as YOLO and DETR, you won't miss out on the rapidly evolving trends in computer vision.

Key Strengths of This Course (2)

  • Balance between theory and practice: First, understand basic theories such as CNN's convolution and pooling concepts, and then proceed with hands-on practice using real datasets.

  • Connection to Practical Application: Covers cases applicable to industrial fields such as medical imaging, autonomous driving, and satellite image analysis.

  • Portfolio Creation Possible: You can build a personal portfolio through the practical project results, which directly helps with employment and research.

What you will learn

The creator of this course - Youngje Oh

  • 2019 ~ Present: Professional AI Instructor

  • 2001~2019: IT development and operations in the field

  • 2020–Present: Currently teaching online/offline courses

  • Currently operating 14 artificial intelligence courses on Inflearn

Notes before taking the course

Practice Environment

  • The lectures use Google Colab, so they can be taken regardless of whether you use Windows or MacOS.


Learning Materials

  • Provided as PDF files and GitHub links!

Prerequisites and Important Notes

  • Basic Python syntax

  • Basic knowledge of machine learning

  • This course is for intermediate learners.

Recommended for
these people

Who is this course right for?

  • Learners who need a professional-level vision portfolio that goes beyond theory

  • Job seekers preparing for deep learning technical interviews and demonstrating practical skills.

  • Working developers who need to directly apply image recognition models to their services

  • Those who dream of becoming a professional vision engineer after learning the basics of PyTorch

Need to know before starting?

  • Python Programming Basics

  • Basic Knowledge of Vector and Matrix Operations

  • Basic Concepts of Machine Learning

Hello
This is YoungJea Oh

4,679

Learners

423

Reviews

158

Answers

4.7

Rating

18

Courses

I am a Senior Developer with extensive development experience. I would like to share the knowledge and experience I have accumulated over 30 years in the IT field, having worked at Hyundai Engineering & Construction's IT department, Samsung SDS, the e-commerce company Xmetrics, and Citibank's IT department. Currently, I am lecturing on Artificial Intelligence and Python.

Homepage Address:

https://ironmanciti.github.io/

More

Curriculum

All

44 lectures ∙ (11hr 5min)

Course Materials:

Lecture resources
Published: 
Last updated: 

Reviews

All

7 reviews

5.0

7 reviews

  • jyj1206님의 프로필 이미지
    jyj1206

    Reviews 2

    Average Rating 5.0

    Edited

    5

    27% enrolled

    I am a university student studying in a computer vision-related department. The lectures were meticulous, and above all, they helped me greatly by explaining in detail so that no ambiguous parts remained. While taking the course, I became interested in other lectures as well. However, it is true that the price range of the lectures is somewhat high, making it burdensome for me as a student. Regarding videos, it would be even better if there were additional updates on various computer vision technologies beyond object recognition. I will continue to diligently take the remaining lectures. Thank you for creating such excellent lectures.

    • trimurti
      Instructor

      Thank you for the good review. If you're facing financial burden as a student, please let me know which lecture you'd like to watch and I'll send you a discount coupon.

  • lovesome994824님의 프로필 이미지
    lovesome994824

    Reviews 3

    Average Rating 3.7

    5

    61% enrolled

    I found a lecture I really needed after such a long time Rather than various complex and difficult mathematical perspectives, showing it code-centered is refreshing. I can quickly learn what CNN is and how to utilize it The months I spent looking at various books and online lectures feel like such a waste As a bonus, if you had simply included image labeling work, etc., I think this would be the best vision lecture for many people struggling with vision

    • trimurti
      Instructor

      Thank you for the positive feedback. I will also consider the advice you provided for the next course update.

  • aceoftop1975님의 프로필 이미지
    aceoftop1975

    Reviews 121

    Average Rating 5.0

    5

    100% enrolled

    25/09/17/Wed 21:50 After listening to the lecture, I understood a lot about object recognition. I was always curious about how recognition works when watching object recognition videos... You've really created an excellent lecture. I usually don't listen to lectures on weekdays, but I listened to this lecture even on a weekday. ^^;; Thank you for creating such a great lecture.

    • trimurti
      Instructor

      Thank you for the good course review.

  • starirene95758님의 프로필 이미지
    starirene95758

    Reviews 9

    Average Rating 4.6

    5

    100% enrolled

    It was an informative class with kind and detailed explanations!!!

    • yonsoo6259님의 프로필 이미지
      yonsoo6259

      Reviews 14

      Average Rating 5.0

      5

      32% enrolled

      YoungJea Oh's other courses

      Check out other courses by the instructor!

      Similar courses

      Explore other courses in the same field!