강의

멘토링

로드맵

Inflearn brand logo image
BEST
AI Development

/

Deep Learning & Machine Learning

Practical Data Science Part 3. Understanding Machine Learning

The digital transformation (DT) and introduction of artificial intelligence (AI) in companies begin with the construction of machine learning models. However, the scope of machine learning technology is very broad, and in order to select the optimal method, it is necessary to clearly understand the basic concepts. In this lecture, we will introduce the core contents necessary to clearly understand the basic concepts of machine learning, focusing on five examples.

(4.7) 31 reviews

350 learners

  • hjkim3
Machine Learning(ML)

Reviews from Early Learners

What you will learn!

  • Understand the basics of what machine learning is and how it works.

  • Understand how to implement machine learning models in Python and various performance metrics to evaluate the performance of the model.

  • Understand the difference between traditional statistical analysis and machine learning, and learn key statistical techniques such as probability distributions, independence tests, and chi-square tests through examples.

Only the essential points are included!
Understanding Machine Learning Fundamentals for Model Building

What is Machine Learning? 👩‍💻

Machine learning refers to software that performs tasks such as predicting numbers (regression), classifying categories, and making optimal recommendations. It refers to software that gradually improves its performance by observing and learning from data.

Machine learning is currently the most common method for implementing artificial intelligence. The core function of machine learning is to create a machine learning "model" that performs intelligent actions .


Machine Learning Models 📖

It refers to software that obtains the optimal output (y) from input data (X), and the optimal output means predicting the correct answer (label, target) well.

Model types include linear models, logistic regression, support vector machines (SVMs), decision trees, random forests, k-NN, Bayesian models, and deep learning models (MLPs, CNNs, and RNNs). While this lecture does not cover the specifics of these algorithms, it will teach you the basic and common methods for implementing machine learning models using linear models. The characteristics of each model will be covered in other lectures.


Machine Learning Model Components 🚦

1️⃣ Input data

To implement the optimal model, you must prepare the training data required to train the model and the validation data required to verify the operation of the trained model.

The process of creating appropriate training and validation data from raw data is data preprocessing , and data preprocessing greatly affects the performance of machine learning models.

2️⃣ Purpose of machine learning models

The purpose of using machine learning models is divided into four categories:


What you will learn in this course 👨‍🏫

✅ Machine learning model

Learn an overview of machine learning and explore key concepts for understanding machine learning through five examples.

✅ Performance Evaluation

First, you will learn how to implement, train, and validate regression models, as well as model performance evaluation metrics such as R-squared, MAE, and RMSE.

Next, we'll learn how to implement a classification model, as well as the concepts of decision boundaries, confusion matrices, accuracy, precision, recall, and the f-1 score. Evaluating classification performance requires a clear understanding of the confusion matrix, which we'll explain in detail through examples.

To comprehensively evaluate the performance of a classification model, the predicted ranking must be evaluated. To this end, we will explain how to use ROC-AUC and precision-recall curves.

✅ Finding classification thresholds

In real-world applications, classification models often have minimum precision or recall requirements, requiring selection of an optimal classification threshold that satisfies these requirements. This article details how to find the optimal threshold using the Precision-Recall curve.

✅ The Difference Between Statistical Analysis and Machine Learning

Machine learning While learning most Curious thing middle One With statistical analysis The difference Understanding Statistical analysis is divided into descriptive statistics, estimation , and hypothesis testing.

  • descriptive statistics
    • It refers to obtaining some meaning or insight contained in the data by understanding the average, standard deviation, probability distribution, correlation, etc. of the data.
  • estimation
    • There are point estimates that estimate specific numerical values such as the mean and standard deviation by looking at a sample and interval estimates that estimate confidence intervals.
  • testing hypothesis
    • This refers to testing whether the alternative hypothesis is correct by looking at sample data.

Statistics emphasizes explaining theoretical foundations, dealing with hypotheses, probabilities, confidence intervals, and margins of error. In contrast, machine learning focuses on creating software models that excel at prediction and classification, rather than providing theoretical foundations.

If the data to be analyzed is small, it is necessary to rely on statistical analysis for explanation, estimation, hypothesis testing, etc. However, if the data is sufficiently large, it is more useful to create a machine learning model that can be used in practice.

This lecture introduces the fundamentals of statistical analysis, including the characteristics of the normal distribution. For reference, the normal distribution is the probability distribution function of accumulated samples that converges and no longer changes (see figure below).




🌟 View Core Data Science Lectures

Python, the foundational language of data science.
This course is designed to provide basic knowledge of Python.
For those who do not have basic knowledge of the Python language,
Practical Data Science Part 1. Through an introductory Python lecture.
I recommend learning player knowledge.
Practical Data Science Part 2: Data Preprocessing
The most frequently used data analysis performance in businesses
This is the lecture on data preprocessing that has the greatest impact.
For those who want to learn more about data preprocessing,
Practical Data Science Part 2. Data Preprocessing Lecture Learning
I recommend it.

Recommended for
these people

Who is this course right for?

  • For those who are learning the working principles of machine learning for the first time

  • If you need to apply machine learning to your work but find it difficult to invest a lot of time, this will be helpful for those who want to learn the core of machine learning in a short period of time.

Need to know before starting?

  • Basic knowledge of Python is required.

Hello
This is

919

Learners

77

Reviews

11

Answers

4.8

Rating

3

Courses

"고장난 라디오 고칠 수 있어?"

제가 전자공학과에 입학한 후 친구로부터 받은 질문입니다. 뭐, 대답은 했습니다. "전자공학과에서는 라디오 만드는 원리를 배우는 것이지 고장난 전자제품 고치는 것은 우리 일이 아니고..." 

이론으로 무장한 전문가보다 문제 해결사가 필요한 경우가 더 많습니다. 저는 실전 문제 해결이 더 중요하다고 생각합니다.

최근에는 머신러닝으로 금융, 에너지, 전자, 중장비, 물류, 신약개발, 식품 등 산업 영역의 문제를 해결하는 일을 하고 있는데, 정말 배울 것도 많고 할 일도 무궁무진한 영역인 것 같습니다. 본업은 교수지만 (강원대 컴퓨터공학과), 현장의 문제해결에 관심이 많아 여러 겸직을 하고 있습니다. AI신약개발지원센터장, KAIST 겸임교수, 그리고 데이터사이언스랩 대표를 맡고 있습니다.

AI 시대에 가장 필요한 인재는 실전 문제를 해결할 수 있는 데이터 사이언티스트라고 믿으며 여러분 모두  인기 있는 데이터 사이언티스트가 되기를 바랍니다.

Curriculum

All

20 lectures ∙ (4hr 45min)

Published: 
Last updated: 

Reviews

All

31 reviews

4.7

31 reviews

  • 박언상님의 프로필 이미지
    박언상

    Reviews 4

    Average Rating 5.0

    5

    100% enrolled

    데이터 분석과 머신 러닝에 대한 상세한 설명이 좋았습니다.

    • 최승민님의 프로필 이미지
      최승민

      Reviews 2

      Average Rating 5.0

      5

      100% enrolled

      • 김현철님의 프로필 이미지
        김현철

        Reviews 7

        Average Rating 5.0

        5

        30% enrolled

        • 이우광님의 프로필 이미지
          이우광

          Reviews 1

          Average Rating 5.0

          5

          70% enrolled

          좋아요.

          • 정원태님의 프로필 이미지
            정원태

            Reviews 1

            Average Rating 5.0

            5

            60% enrolled

            상세한 설명 덕분에 전체적인 흐름을 이해하는데 도움이 되었습니다.

            $51.70

            hjkim3's other courses

            Check out other courses by the instructor!

            Similar courses

            Explore other courses in the same field!