inflearn logo

AI Development Part 3: Practical Machine Learning Projects

"Beyond Data Analysis: Mastering Predictive Modeling with 5 Real-World Projects (45 Lectures Total)" Have you finished learning data analysis but feel stuck when it's time to actually build a model? Beyond simply learning how to call libraries, this course will help you fully master the inner workings of algorithms and optimal model validation strategies—ranging from Titanic survival prediction to spam text classification. Systematically conquer projects that wield the most power in the industry, from linear models to the latest ensemble algorithms and the basics of Natural Language Processing (NLP). Step into the world of AI modeling and start predicting the future based on analyzed data.

2 learners are taking this course

Level Basic

Course period Unlimited

Machine Learning(ML)
Machine Learning(ML)
NLP
NLP
Algorithm
Algorithm
AI
AI
Machine Learning(ML)
Machine Learning(ML)
NLP
NLP
Algorithm
Algorithm
AI
AI

What you will gain after the course

  • - Ability to extract insights from raw data through Exploratory Data Analysis (EDA)

  • - Ability to select the optimal algorithm for specific problem types, such as regression, classification, and clustering.

  • - Realistic data imbalance resolution skills using oversampling techniques such as SMOTE

  • - Basic NLP skills to clean text data and integrate it into AI models

  • - Understanding the entire flow of the machine learning pipeline, from data cleaning to prediction

Course Introduction


: "Beyond Data Analysis: Mastering Predictive Modeling through 5 Real-World Projects (45 Lectures in Total)"

Have you finished data analysis but feel lost when it's time to actually build a model? Beyond simply learning how to call libraries, this course will help you master everything from the working principles of each algorithm to optimal model validation strategies, covering projects ranging from Titanic survival prediction to spam message classification.

From linear models to the latest ensemble algorithms and the basics of Natural Language Processing (NLP), you will systematically master the projects that exert the most powerful influence in the field. Now, step into the world of AI modeling, where you predict the future based on analyzed data.

 

Key points unique to this course

* 5 Major Real-world Projects: Step-by-step learning using Titanic, bike rental, medical, consumption patterns, and spam data

* Mastering Imbalanced Data: Learn the 'Oversampling (SMOTE)' strategy, a real-world challenge, using surgical patient data

* Feature Engineering: Mastering core feature extraction and data cleaning strategies that determine model performance

* Unsupervised Learning in Practice: Finding the optimal cluster (K) and customer segmentation using demographic data

* Introduction to Natural Language Processing (NLP): Techniques for tokenizing and vectorizing text data to apply to machine learning models



 

📱 Curriculum & Project Preview


✒ Section 1. Project Overview and Development Environment Setup: Lecture 1 ~ Lecture 4

We will outline the course roadmap and the big picture of machine learning projects. After installing the necessary libraries, we will set up the optimal Python environment for hands-on practice.

Key Learning: Introduction to courses and projects, setting up the development environment, and installing essential libraries.


✒Section 2. [Project 1] Classification - Titanic Survivor Prediction (Lectures 5 ~ 16)

Through the most famous datasets, we cover the entire machine learning process in depth, from EDA to prediction. You will gain hands-on experience with the importance of data cleaning and correlation analysis.

Key Technologies: EDA and Visualization, Data Cleaning/Transformation, Correlation Analysis, Feature Selection, and Model Training/Prediction

 

✒ Section 3. [Project 2] Regression - Predicting Bicycle Rental Demand (Lectures 17 ~ 27)

Train a regression model to predict continuous numerical values. Learn how to analyze data with time-series characteristics and improve model performance.

Key Learning: Advanced EDA, data transformation strategies, regression model training, and multi-step prediction optimization


✒ Section 4. [Project 3] Handling Imbalanced Data - Surgical Patient Data Analysis (Lectures 28 ~ 33)

You will learn techniques to enable normal prediction by resolving the imbalance issue in surgical patient data through oversampling.

Key Technologies: Oversampling, Model Evaluation


✒ Section 5. [Project 4] Unsupervised Learning (Clustering) - Customer Segmentation (Lecture 34 ~ 38)

You will learn clustering, a technique for grouping data without predefined labels, using customer spending habit data.

Key Technologies: Clustering, Searching for the optimal K (Elbow Method, etc.)


✒ Section 6. [Project 5] Natural Language Processing (NLP) - Spam Text Classification (Lectures 39 ~ 45)

You will learn how to handle unstructured text data. You will go through the entire natural language processing (NLP) process to convert it into numerical data that machine learning models can understand.

Key Learning: Tokenization, Noise Removal, Stopword Removal, Stemming/Lemmatization, Feature Vectorization, and ML Classification



✒ About the Instructor

Jaesung Yoon (Lead Instructor of Data Analysis at Like Lion)


Development Experience
• Developed and launched SKT "Island Adventure" mobile content
• Developed and launched KT "Quiz Soccer" mobile content
• Launched SK "Mobile Real Estate Agent"
• Developed iPhone "Hanjatong" app
• Developed iPhone "Health Training" app
• Developed content for KT/SK Japan Namco "Tales of Commons"
• Developed KT mini-games (Yageum Yageum Land Grab, Aladdin's Magic Lamp, Mystery Block Detective Agency, BUZZ and BUZZ)

Teaching Experience
A veteran instructor with 19 years of experience in teaching and development for current employees and job seekers at famous domestic companies and institutions, including Samsung Multi Campus, Busan IT Industry Promotion Agency, Jeonju IT Convergence Agency, Incheon IT Promotion Agency, Korea Radio Promotion Association, SK C&C, T Academy, Korea Institute for Robot Industry Advancement, Daejeon ETRI, Samsung Electronics, nica Education Center, Korea Productivity Center, Hanwha S&C, LG Electronics, and more.

Teaching Fields
I teach in fields such as Java, Android, Frameworks, Databases, UML, iPhone, Big Data processing and analysis, Python, IoT, data analysis using R/Python, Deep Learning, Machine Learning AI, and Spark. I structure my lectures to explain concepts as easily as possible by incorporating my diverse experiences, and I create examples so that students can apply them to hands-on practice. Since this is not an offline class, please use the Q&A section for anything you don't understand. I will make sure to resolve your questions.

Recommended for
these people

Who is this course right for?

  • - For those who want to step into the world of professional AI modeling: Recommended for those who have completed the basics of data analysis but are curious about how to create predictive models using real-world data.

  • - Those who are eager for practical preprocessing techniques: This is essential for aspiring data scientists who need techniques that directly impact model performance, ranging from handling missing values and feature selection to text data cleaning.

  • - For those who want to work with data from various domains: Highly recommended for those who want to experience data in diverse fields such as social (Titanic), environment (bicycles), medical (surgery), economy (consumption), and telecommunications (spam).

  • - For those who want to increase model reliability: This is suitable for those who want to achieve both 'interpretability' and 'accuracy' through correlation analysis and feature selection, rather than just simple implementation.

Need to know before starting?

  • Basic knowledge of Python syntax, data analysis libraries (NumPy, Pandas), and the fundamentals of machine learning is required.

  • If you are unfamiliar with data handling, we recommend taking the [Required Prerequisite] Python Data Analysis Master course first.

Hello
This is softcampus

15,961

Learners

821

Reviews

592

Answers

4.7

Rating

43

Courses

Soft Campus is an education center that supports online and offline lectures and content sales.

For inquiries regarding AI-related fields and the purchase of various lectures and content, please contact us at raputa@nate.com or by phone at 02-553-0824.

Thank you.

More

Curriculum

All

47 lectures ∙ (13hr 2min)

Course Materials:

Lecture resources
Published: 
Last updated: 

Reviews

Not enough reviews.
Please write a valuable review that helps everyone!

softcampus's other courses

Check out other courses by the instructor!

Similar courses

Explore other courses in the same field!

Limited time deal

$27,500.00

50%

$42.90