강의

멘토링

로드맵

Inflearn brand logo image
Data Science

/

etc. (Data Science)

Building OCR that actually works in real-world scenarios, here's how to do it.

If you want to properly learn OCR technology that's truly used in practice, this one course is all you need! Aiming for over 98% accuracy even with unstructured documents and complex layouts, based on the latest SOTA models and real-world know-how, we build enterprise-level OCR projects together.

(4.5) 2 reviews

52 learners

  • nexthumans
실습 중심
AI 코딩
ocr
문서인식
Azure
Python
AI
openai
openAI API
Computer Vision(CV)

What you will learn!

  • From Data Refinement to AI Service Connection, End-to-End OCR Practical Project

  • Deep Learning-based SOTA OCR Model Study

Introduction to the course

If you want to learn OCR technology that is actually used in the field, this is the course for you!
We work together to create enterprise-level OCR projects based on the latest SOTA models and practical know-how, aiming for an accuracy of over 98% even on non-standard documents and complex layouts.

  • “From Flyers to Chatbots, Catch Up on Real OCR Practice”

  • “This is how we make the 98% accuracy OCR that the industry wants!”

  • “Complex documents are OK too! Complete mastery of the latest OCR technology!”

  • “Start your hands-on OCR project today!”

#Python, #Artificial Intelligence (AI), #openai, #openAI API, #computer vision

Lecture Description

License plates, business cards, contracts… OCR is now standard.
In real life, we face complex, irregular documents like E-Mart flyers.

In this lecture, we will not just read the letters,
📦 Clean the data
🧠 Connecting to AI service
📊 Deriving insights
This hands-on course covers the flow of an end-to-end OCR project .

From deep learning-based SOTA OCR model
Building a project using real flyer data,
And RAG, chatbots, and marketing insights!

Learn everything about OCR that can be applied directly in practice.

Target audience

  • Practitioners and data engineers who need to handle complex document recognition.

  • Anyone who wants to design a real project using OCR technology

  • Anyone preparing or interested in RAG-based AI services

  • Students and job seekers who need a practical project for their portfolio

Learning Objectives

  • You can understand and compare the characteristics and application range of various OCR models.

  • You can extract necessary information in a refined form from unstructured documents (flyers, advertisements, etc.).

  • You can design a practical flow that links OCR data to RAG and AI services.

  • Gain the know-how to solve problems encountered in real life (distortion, background, font, etc.).

  • Understand strategies for achieving the level of accuracy required in business practice.

Notes before taking the course

Prerequisites

  • Python Basic Grammar

  • Although not required , it will help you learn if you know the following:

    • Experience using Pandas and Numpy

    • Machine Learning and Deep Learning Basics

    • Interest in computer vision or OCR technology

Tools and Libraries Used

  • Key technology stacks used in the course:

    • Python 3.10 or later


    • Image preprocessing and conversion using OpenCV and Numpy

    • OpenAI API

Practice environment

  • You can practice in a local environment and take the course without a separate GPU.


  • Practice code is provided along with the lecture materials.

Lecture materials

  • Provides flyer images and code samples needed for all exercises


Other Notes

  • The course is structured around practical, project-based learning, not just theory-based lectures.

  • Since hands-on practice is included, we encourage you to follow along and run Python code yourself during the course.

  • For continuous improvement, if you have any questions during the lecture, you can ask them through the community Q&A or instructor feedback channel.

Recommended for
these people

Who is this course right for?

  • Practitioners and data engineers who deal with complex document recognition

  • Someone who wants to design a real project utilizing OCR technology

  • Those preparing for or interested in RAG-based AI services

  • Students and job seekers needing practical projects for portfolio

Need to know before starting?

  • Python Programming Basics

Hello
This is

106

Learners

10

Reviews

16

Answers

4.9

Rating

3

Courses

현재 대기업 중심으로 아래와 같은 프로젝트의 개발책임 및 컨설팅을 맡고 있습니다. 현역^^입니다.

더불어, 고려대 대학원에서 인공지능 관련 겸임교수로도 활동하고 있습니다.

저의 목표는 실전에 바로 써먹을 수 있는 현장감 있는 프로그래밍 기술입니다. 앞으로 많은 여러분과 함께 재미난 수업 만들어 나가고 싶습니다.

  • 엔터프라이즈 인공지능 구조 및 서비스 설계

  • 머신러닝 서비스 구현

  • 벡엔드 서비스 개발

  • 클라우드(Azure) Databricks, ETL, Fabric 등 각종 클라우드 환경에서의 데이터베이스 구축 및 서비스 개발

Curriculum

All

22 lectures ∙ (11hr 47min)

Course Materials:

Lecture resources
Published: 
Last updated: 

Reviews

All

2 reviews

4.5

2 reviews

  • sjoh7998님의 프로필 이미지
    sjoh7998

    Reviews 13

    Average Rating 5.0

    5

    64% enrolled

    • fin4444님의 프로필 이미지
      fin4444

      Reviews 1

      Average Rating 4.0

      4

      32% enrolled

      $84.70

      nexthumans's other courses

      Check out other courses by the instructor!

      Similar courses

      Explore other courses in the same field!