BEST

Spark Machine Learning Complete Guide - Part 1

Name: Spark Machine Learning Complete Guide - Part 1
Price: 99000 KRW
Rating: 4.9 (29 reviews)

If you want to be recognized as a machine learning expert in large-scale data environments—from understanding the core framework of Spark Machine Learning to SQL-based data processing through high-difficulty practical problems, and the ability to implement optimized machine learning models through business domain analysis—join this course.

(4.9) 29 reviews

952 learners

Level Intermediate

Course period Unlimited

dooleyz3525

Apache Spark

Machine Learning(ML)

Big Data

Data Engineering

Apache Spark

Machine Learning(ML)

Big Data

Data Engineering

Reviews from Early Learners

4.9

5.0

freedom07

93% enrolled

I first got to know Professor Kwon Chul-min through the Complete Guide to Python Machine Learning. Thanks to that lecture, I, a non-major, was able to not give up on this field that I had been thinking of giving up on. I am currently working in this field and studying steadily by taking Infraon lectures. I wanted to thank the teacher, so I first thanked the teacher in the Q&A session, and the teacher encouraged me that if I continued to study, I would be able to achieve what I had worked for. I plan to continue to listen to the teacher's lectures in the future. ^^ㅎㅎ He really teaches so well. Professor Kwon Chul-min, I would like to take this opportunity to sincerely thank you.

5.0

egs41

10% enrolled

It was good to focus on the instructor's diction and voice, and the content was solid. Please continue to make good lectures. Thank you.

5.0

밑바닥개발자

54% enrolled

I am a student who has been attending Kwon Chul-min's lecture series! Thank you for continuing to provide high-quality lectures! And I have seen several Spark lectures in Scala and Java, but this is the first time I have seen a lecture that teaches Spark in Python, so I think it was even better! Although I have not completed the course yet, I still like how he tries to teach simple grammar as easily as possible! And I also like how he provides various practice materials to encourage repeated mastery! I look forward to other lectures in the future!

What you will gain after the course

Implementing Machine Learning Models in Spark
A detailed understanding of DataFrame, the foundation of Spark's data processing
Understanding various technical elements that constitute the Spark Machine Learning Framework
Mastering Spark's Machine Learning Pipelines
SQL proficiency for data analysis
SQL-based Feature Engineering Techniques
Implementing models with XGBoost and LightGBM in Spark
Model hyperparameter tuning method based on Bayesian optimization
Simultaneously improve data analysis and ML model implementation skills through high-difficulty practical problems.
Data analysis method based on analysis domains
Various data visualization techniques

[Notice] Databricks Community Edition, which was provided for free as the practice environment for this course, is no longer accepting new sign-ups. Therefore, please be advised that the practice environment will be changed to a local Spark and Jupyter environment as of December 5, 2025.

Since the changes to the practice code due to the transition to a local environment are limited to certain parts, most lecture videos from Section 1 to Section 10 will continue to use the existing recordings from Databricks Community, while new lecture videos in the local Spark environment have been added only for major changes. From Section 11 onwards, many lectures have been replaced with practice videos in the local Spark environment.

Please note when choosing lectures that the current course is composed of a mix of existing recorded videos based on Databricks Community and new videos based on local Spark.

Data analysis + feature engineering + ML implementation,
master all three skills at once.

The encounter between Apache Spark and
Machine Learning.

Apache Spark, the leader in open-source large-scale distributed processing solutions, has met Machine Learning.

Many large corporations and financial institutions in Korea utilize Apache Spark to analyze massive amounts of data and build machine learning models. Since Spark is based on a distributed data processing framework, it can process large-scale data and create ML models while scaling capacity across anywhere from a few to dozens of servers. Therefore, it allows you to overcome the limitations of Scikit-learn, which can only implement machine learning models on a single server.

We will help you grow into a machine learning expert
who is also proficient in
data processing and analysis.

The 'Spark Machine Learning Complete Guide - Part 1' course goes beyond learning how to implement machine learning models in Spark and will help you grow into a machine learning expert who is also proficient in data processing and analysis.

To grow into a true machine learning expert, it is crucial to possess not only the ability to implement ML models but also the skill to process and combine business data to create those models. To this end, you will learn how to process data using SQL, which is most commonly used for large-scale data processing in practice, and data analysis techniques based on business domain analysis through hands-on exercises.

_{The curriculum is designed to help you build data processing/analysis and ML implementation skills through detailed theoretical explanations and hands-on practice.}

We will solve the problems
you will face.

Implementing machine learning models on Spark is not easy. This is because you encounter many problems that existing data scientists or machine learning experts have not experienced, such as unique machine learning APIs and frameworks based on the specificities of the Spark architecture, and data processing based on SQL.

Through this course, Spark Machine Learning Perfect Guide, I will help you develop the ability to solve the problems you encounter.

The first half of the 'Spark Machine Learning Perfect Guide - Part 1' course is

The first half of the lecture consists of detailed theoretical explanations and extensive hands-on practice regarding various elements that make up the Spark Machine Learning Framework, such as DataFrame, SQL, Estimator, Transformer, Pipeline, and Evaluator. Through this, you will be able to easily and quickly implement ML models in Spark.

Additionally, I will provide detailed explanations on how to use LightGBM in Spark and how to tune hyperparameters using HyperOpt based on Bayesian optimization.

The latter half of the 'Spark Machine Learning Guide - Part 1' course is

The latter half of the lecture consists of a hands-on practice of Kaggle's Instacart Market Basket Analysis competition.

Through the model implementation of Kaggle's Instacart Market Basket Analysis competition, a highly challenging contest, we will simultaneously improve your practical data processing/analysis skills and machine learning model implementation capabilities.

Through this dataset, you will learn in detail how to process and analyze business data and perform feature engineering based on SQL, how to derive analysis domains from business operations, and how to create models based on these derived features.

💻 Please check before taking the course!

All practice codes in this course are based on Python. Please note that Scala is not covered before choosing this course.

Please check the
practice environment.

This course uses Docker to set up a practice environment based on local Spark and Jupyter. The practice environment is configured by installing Docker Desktop on your local PC, and the course is designed so that you will have no problem setting up the environment even if you are not familiar with Docker.

Lecture practice codes and lecture explanatory materials can be downloaded from '실습코드와 설명자료 다운로드 받기'.

Prior knowledge is
required for this course.

This course is designed with the assumption that students possess knowledge of Chapter 5 (Regression) of the Python Machine Learning Guide or equivalent expertise, as well as a very basic understanding of SQL, so please keep this in mind when choosing the course.

It is helpful if you know the basics of Spark, but you should have no trouble following the lecture even if you don't.

Please check the prerequisite courses!

Python Machine Learning Guide

Stop theory-oriented machine learning lectures,
learn everything from core machine learning concepts to practical skills easily and accurately.

_{Curious about the instructor's interview? (Click)}

Recommended for
these people

Who is this course right for?

Those who wish to implement machine learning using Spark
Those who wish to implement machine learning based on large-scale data
Those who wish to improve data processing techniques for machine learning using SQL
Those who want to master the entire process of processing data into a desired format and building ML models based on it in a practical setting.
Those who want to simultaneously improve their data analysis, feature engineering skills, and ML implementation.

Need to know before starting?

Understanding up to Chapter 5 (Regression) of "Python Machine Learning Perfect Guide" or equivalent prerequisite knowledge.
Basic Understanding of SQL

Hello
This is dooleyz3525

Inflearn Verified

28,150

Learners

1,531

Reviews

4,077

Answers

4.9

Rating

Courses

(Former) Encore Consulting | (Former) Oracle Korea | Author of "Python Machine Learning Perfect Guide"

AI Freelance Consultant

Curriculum

All

132 lectures ∙ (25hr 1min)

Course Materials:

Lecture resources

Section 1. Course introduction and setting up the practice environment.

9 lectures ∙ (59min)

Section 2. Spark Overview

4 lectures ∙ (52min)

10. Distributed Data Architecture and Spark Overview
15:14
11. Spark Architecture and RDD Overview
12:43
12. Comparison of Spark's RDD, DataFrame, and SQL
11:48
13. Spark Machine Learning Overview
12:28

Section 3. Understanding Spark DataFrames - 01

10 lectures ∙ (2hr 15min)

Section 4. Understanding Spark DataFrames - 02

9 lectures ∙ (1hr 57min)

Section 5. Spark SQL Overview

5 lectures ∙ (43min)

Section 6. Understanding Spark Machine Learning - 01

14 lectures ∙ (2hr 59min)

Section 7. Understanding Spark Machine Learning - 02

9 lectures ∙ (1hr 50min)

Section 8. Spark ML Classification - 01

12 lectures ∙ (2hr 5min)

Section 9. Spark ML - Classification - 02

7 lectures ∙ (1hr 22min)

Section 10. Spark ML Regression

4 lectures ∙ (44min)

Section 11. Understanding Data Analysis for Practical Machine Learning

6 lectures ∙ (1hr 9min)

Section 12. Practical Machine Learning - Kaggle Instacart Market Basket Analysis Competition Overview

6 lectures ∙ (59min)

Section 13. Practical Machine Learning - SQL-based EDA Analysis of Kaggle Instacart Data

18 lectures ∙ (3hr 16min)

Section 14. Practical Machine Learning - Kaggle Instacart Feature Engineering, Model Training, Evaluation, Tuning - 01

13 lectures ∙ (2hr 28min)

Section 15. Practical Machine Learning - Kaggle Instacart Feature Engineering, Model Training, Evaluation, Tuning - 02

6 lectures ∙ (1hr 17min)

Published: 12/08/2021

Last updated: 03/25/2026

Reviews

All

29 reviews

4.9

29 reviews

indizz4933
Reviews 1
∙
Average Rating 5.0
01/03/2022
5
100% enrolled
Thank you for explaining it step by step.
freedom07
Reviews 7
∙
Average Rating 5.0
02/04/2022
5
93% enrolled
I first got to know Professor Kwon Chul-min through the Complete Guide to Python Machine Learning. Thanks to that lecture, I, a non-major, was able to not give up on this field that I had been thinking of giving up on. I am currently working in this field and studying steadily by taking Infraon lectures. I wanted to thank the teacher, so I first thanked the teacher in the Q&A session, and the teacher encouraged me that if I continued to study, I would be able to achieve what I had worked for. I plan to continue to listen to the teacher's lectures in the future. ^^ㅎㅎ He really teaches so well. Professor Kwon Chul-min, I would like to take this opportunity to sincerely thank you.
- dooleyz3525
  Instructor
  02/04/2022
  I am even more impressed that you left such a touching review. I think I should be the one to thank you for the writing that instantly rewards the hard work you put into creating the lecture. If you continue to work hard like this, you will definitely achieve everything you want. Thank you.
iamcodingcat
Reviews 13
∙
Average Rating 5.0
02/07/2022
5
54% enrolled
I am a student who has been attending Kwon Chul-min's lecture series! Thank you for continuing to provide high-quality lectures! And I have seen several Spark lectures in Scala and Java, but this is the first time I have seen a lecture that teaches Spark in Python, so I think it was even better! Although I have not completed the course yet, I still like how he tries to teach simple grammar as easily as possible! And I also like how he provides various practice materials to encourage repeated mastery! I look forward to other lectures in the future!
egs41
Reviews 54
∙
Average Rating 5.0
02/09/2022
5
10% enrolled
It was good to focus on the instructor's diction and voice, and the content was solid. Please continue to make good lectures. Thank you.
gomjong
Reviews 8
∙
Average Rating 4.9
01/14/2022
5
100% enrolled
Thanks to you, I learned about Spark and gained confidence in Kaggle challenges. Thank you!

dooleyz3525's other courses

Check out other courses by the instructor!

FastAPI Complete Guide

dooleyz3525

This course is designed to help you learn the core functions of FastAPI, as well as the entire process of web service development. Through this course, we will help you become a FastAPI expert developer needed in the field.

Intermediate

Python, FastAPI, SQL

FastAPI Complete Guide

dooleyz3525

The Complete Guide to Airflow - Part 1

dooleyz3525

This course is a practical Airflow master program designed to help you understand "why Airflow works this way" and enable you to design and debug data pipelines on your own. It covers everything from Apache Airflow's core mechanisms to detailed theory and hands-on practice involving DAGs, Operators, Hooks, Scheduling, Timezones, Idempotency, and Templates.

Intermediate

Data Engineering, airflow, orchestration

The Complete Guide to Airflow - Part 1

dooleyz3525

Learning Transformer Through Implementation

dooleyz3525

From Multi-Head Attention to the Original Transformer model, BERT, the Encoder-Decoder based MarianMT translation model, and even Vision Transformer, you'll learn Transformer inside and out by implementing them directly in code.

Intermediate

Deep Learning(DL), PyTorch, encoder-decoder

Learning Transformer Through Implementation

dooleyz3525

Deep Learning CNN Complete Guide - Pytorch Version

dooleyz3525

From core theories of deep learning and CNN to various CNN model implementation methods, and practical deep learning development know-how through real-world problems, If you want to become a deep learning CNN technology expert based on Pytorch, join us in this lecture :)

Basic

Deep Learning(DL), PyTorch, Computer Vision(CV)

Deep Learning CNN Complete Guide - Pytorch Version

dooleyz3525

Kafka Complete Guide - ksqlDB

dooleyz3525

This course is designed to help you learn the use of ksqlDB and its core mechanisms through various hands-on exercises. After completing the course, you will be able to easily and quickly build a real-time streaming data analysis system based on Kafka.

Intermediate

Kafka, ksqlDB, Data Engineering

Kafka Complete Guide - ksqlDB

dooleyz3525

Kafka Complete Guide - Connect

dooleyz3525

Through in-depth theoretical explanations of Kafka Connect and detailed practical training that can be used immediately in the field, we will help you grow into an expert in building data linkage and data pipelines based on Kafka Connect that are needed in the field.

Intermediate

Kafka, Data Engineering

Kafka Complete Guide - Connect

dooleyz3525

Kafka Complete Guide - Core

dooleyz3525

From the core of Kafka to in-depth content on internal mechanisms, the course is structured so that even beginners can quickly reach the expert level through detailed theoretical explanations, hands-on practice, and practical Kafka application development practice.

Intermediate

Kafka, Data Engineering

Kafka Complete Guide - Core

dooleyz3525

SQL data analysis learned through various cases

dooleyz3525

By implementing various practical data analysis cases using SQL, you can simultaneously improve your data analysis and SQL utilization skills.

Intermediate

SQL, PostgreSQL, DBMS/RDBMS

SQL data analysis learned through various cases

dooleyz3525

Data Analysis SQL Fundamentals

dooleyz3525

Through detailed lectures and hands-on practice on the core elements of SQL, we will provide you with a solid foundation to grow into a SQL analysis expert.

Basic

SQL, PostgreSQL, DBMS/RDBMS

Data Analysis SQL Fundamentals

dooleyz3525

A Complete Guide to Deep Learning CNN - TensorFlow Keras Version

dooleyz3525

From core theories of Deep Learning and CNN to implementation methods of various CNN models, and practical Deep Learning development know-how through real-world problems, If you want to become a Deep Learning CNN technology expert, join us in this lecture :)

Basic

Deep Learning(DL), CNN, Tensorflow

A Complete Guide to Deep Learning CNN - TensorFlow Keras Version

dooleyz3525

Oracle Performance Analysis and Instance Tuning Core Guide

dooleyz3525

It provides a key guide to understanding the internal mechanisms of Oracle DB architecture and growing as a performance tuning and performance analysis expert.

Intermediate

Oracle, DBMS/RDBMS

Oracle Performance Analysis and Instance Tuning Core Guide

dooleyz3525

Kaggle Advanced Machine Learning Practical Crash Course

dooleyz3525

This course is designed to upgrade your skills as a practical machine learning development expert by implementing the machine learning model of the Home Credit Default Risk competition on Kaggle.

Intermediate

Machine Learning(ML), Kaggle

Kaggle Advanced Machine Learning Practical Crash Course

dooleyz3525

[Revised Edition] Deep Learning Computer Vision: The Complete Guide

dooleyz3525

This course will help you grow into a deep learning-based computer vision expert needed in the field through in-depth theoretical explanations of Object Detection and Segmentation, along with practical examples at a level that can be immediately applied in the industry.

Intermediate

Python, Machine Learning(ML), Deep Learning(DL)

[Revised Edition] Deep Learning Computer Vision: The Complete Guide

dooleyz3525

[Revised Edition] The Complete Guide to Python Machine Learning

dooleyz3525

We will help you easily understand the core concepts of machine learning and acquire the ability to implement practical machine learning applications by moving away from theory-based machine learning courses.

Basic

Python, Machine Learning(ML), Statistics

[Revised Edition] The Complete Guide to Python Machine Learning

dooleyz3525

Similar courses

Explore other courses in the same field!

The Complete Guide to Airflow - Part 1

dooleyz3525

Intermediate

Data Engineering, airflow, orchestration

The Complete Guide to Airflow - Part 1

dooleyz3525

Learning Spark through Practice Part 1

nexthumans

Through this course, you will be able to immediately carry out corporate Apache Spark projects.

Basic

Apache Spark, Big Data, Machine Learning(ML)

Learning Spark through Practice Part 1

nexthumans

[Management Course #3] DE, DBA (SSIS, SSAS, MachineLearning, BI, ETL)

vmproductor0202

SSIS, SSAS, MachineLearning, BI, ETL. You can learn important technologies that cannot be found in domestic books, YouTube, lectures, blogs, and academies. I also recommend it to those who are interested in employment at domestic large companies, American large companies, and American state funding agencies.

Basic

Big Data, ssis, ssas

[Management Course #3] DE, DBA (SSIS, SSAS, MachineLearning, BI, ETL)

vmproductor0202

ChatGPT Business Innovation BIBLE - The Era of Citizen Developers Using Vibe Coding

kimw24072

This course is a practical, hands-on guide to automating repetitive tasks and maximizing work productivity using ChatGPT and Python. You will learn step-by-step how to use AI to automate daily repetitive tasks such as Excel work, report writing, sending emails, and data organization. Starting as a non-developer myself, I have taught AI automation to thousands of students, and I have compiled only the methods that can be used immediately in actual business environments. This course focuses not on complex theories, but on **"automation techniques you can use right today."** 👉 The goal is to turn ChatGPT into more than just a tool, but into **your own personal work assistant (Jarvis)**.

Beginner

Python, Big Data, Pandas

ChatGPT Business Innovation BIBLE - The Era of Citizen Developers Using Vibe Coding

kimw24072

Business Data Analysis Part 1 - Python and Descriptive Statistics

softcampus

"From environment setup to practical business projects, a data science journey to find revenue answers through data (39 lectures in total)." Go beyond simply learning how to use libraries and learn how to cook real-world e-commerce data through the data science process. Starting from development environment setup and Numpy/Pandas basics to data preprocessing, statistical analysis, visualization, and business scenario-based RFM segmentation and hypothesis testing! Systematically master the core competencies of a data scientist through practical projects.

Basic

Python, Data Engineering, Numpy

Business Data Analysis Part 1 - Python and Descriptive Statistics

softcampus

Java Machine Learning Weka Intermediate

javaraml

This is the second lecture for popularizing Java machine learning. We introduce Weka, which provides UI and API so that both design and coding can be implemented. We have included cases that are completely suitable for practical application in the lecture.

Intermediate

Java, Machine Learning(ML), Weka

Java Machine Learning Weka Intermediate

javaraml

Database - SQL

kjlee

This course is for those who are learning database programming for the first time or those who have some knowledge but want to learn systematically. From concepts to practical exercises, you will master the database query language by organizing examples of types that frequently occur in real life.

Basic

SQL, Data Engineering

Database - SQL

kjlee

Data Analysis SQL Fundamentals

dooleyz3525

Through detailed lectures and hands-on practice on the core elements of SQL, we will provide you with a solid foundation to grow into a SQL analysis expert.

Basic

SQL, PostgreSQL, DBMS/RDBMS

Data Analysis SQL Fundamentals

dooleyz3525

Methods to Improve Data Mindset (Data Literacy) for Immediate Use in Work

kpcre

We introduce methods for planners and marketers without data analysis experience or technical skills to try data analysis at the most basic level, complete with various examples. Built on years of hands-on practice with over 2,000 participants in more than 100 companies and public institutions, the content is designed around the most practical analysis methods for data non-experts.

Beginner

Data literacy, Big Data, Machine Learning(ML)

Methods to Improve Data Mindset (Data Literacy) for Immediate Use in Work

kpcre

AI-Driven Practical Implementation Strategies for Manufacturing Industry (Electronics/Semiconductor Sector)

88888

The electronics and semiconductor industry is a field where data-driven management and innovation are particularly crucial due to ultra-precision processes and complex supply chains. This course covers practical strategies that can be directly applied in electronics and semiconductor manufacturing, including defect detection, process optimization, predictive maintenance, and supply chain management using AI technology. Along with real-world cases from global companies, the course also presents low-cost, high-efficiency AI implementation methods that small and medium-sized enterprises can realistically utilize. Through this, students will be able to understand and apply AI-based manufacturing strategies that not only improve productivity and reduce costs, but also build future competitiveness.

Beginner

AI, Big Data, Machine Learning(ML)

AI-Driven Practical Implementation Strategies for Manufacturing Industry (Electronics/Semiconductor Sector)

88888

Complete Preparation for the AWS AI Practitioner (AIF-C01) Certification Practice Exam

careerd

See exam types and trends at a glance! Strengthen your practical skills with approximately 100 questions. The perfect preparation for the AWS AI Practitioner exam, allowing for systematic study even when preparing on your own.

Beginner

Machine Learning(ML), Deep Learning(DL), AWS

Complete Preparation for the AWS AI Practitioner (AIF-C01) Certification Practice Exam

careerd

[Free] Basic Text Mining: App Review Analysis with Python (40-minute completion)

HappyAI

This course teaches Python-based text mining: basic theory and practice. It covers fundamental text mining data analysis for practical or thesis writing.

Basic

Text Mining, Big Data, NLP

[Free] Basic Text Mining: App Review Analysis with Python (40-minute completion)

HappyAI

mongoDB from basics to practice (feat. Node.js)

sihoon

mongoDB, NoSQL You hear a lot these days, but is it still an unfamiliar database? Aren't you using it like a relational database (RDS/SQL)? No matter how good the technology is, if it's not used correctly, it will have the opposite effect. That's why we often see cases of failure in using MongoDB. This lecture will teach you everything from basic concepts to practical know-how so that you can use MongoDB as MongoDB-like as possible.

Basic

MongoDB, REST API, Node.js

mongoDB from basics to practice (feat. Node.js)

sihoon

Complete Mastery of Azure Data Fundamentals for Data Beginners

daniellee

This is a special lecture that can lay the theoretical foundation to simultaneously prepare for the Microsoft AZ-900 certification, and as the latest content reflecting the scope of questions as of May 2025, by providing content related to core data concepts, Azure's relational data, Azure's non-relational data, and Azure's analytical workloads in a form combining theory and practice, it can be utilized as a meaningful educational opportunity to not only obtain the certification but also take the first step towards becoming a data professional.

Beginner

SQL, Big Data, Data Engineering

Complete Mastery of Azure Data Fundamentals for Data Beginners

daniellee

[7-Day Complete] Pass MS AI-900 Certification in One Go

jobgreegi

Essential certification for the AI era, MS AI-900! 🚀 Did you know? ChatGPT operates exclusively on Microsoft Azure! If you want to properly utilize Microsoft Azure OpenAI, which exclusively provides the latest GPT models, now is the perfect opportunity. Grow into an Azure AI services expert through the AI-900 course and master enterprise-grade AI solutions that companies trust. Secure global official certification and enhance your competitiveness as a true AI professional!

Beginner

Machine Learning(ML), Deep Learning(DL), AI

[7-Day Complete] Pass MS AI-900 Certification in One Go

jobgreegi

(v501) The Heart of AI: AI Foundation Models and the Mechanics of Intelligence

khjyhy100

[Understanding AI Foundation Models and Their Operating Principles: Engineering Control and System Architecture, Practical Methodologies for Resolving AI Uncertainty and Engineering Assetization] 1. Introduction: The Necessity of Engineering Control of Intelligence (Engineering Control vs. Systemic Chaos) A core conclusion derived from long-term practical insights in industrial fields is that power that is not properly controlled acts as a potential liability rather than an asset. Even a high-performance engine is nothing more than an unstable physical mass if it lacks sophisticated combustion logic and microsecond-level control systems. The organizational chaos currently appearing in the process of adopting Generative AI is judged to stem from a lack of understanding of these control principles and a blind faith in technical "black boxes." This masterclass redefines Artificial Intelligence not as a mysterious stochastic phenomenon, but from the perspective of Model-Based Engineering (MBE). By transforming the uncertain domain of intelligence into a predictable and reliable engineering framework, we aim to present a strategic methodology that allows organizations to secure strong leadership across the entire system without being dependent on technical trends. 2. The 4 Pillars for Solving Core Challenges ① Epistemological Paradigm Shift: Visualizing the Black Box and Assetizing Technical Debt Many companies are facing "technical debt"—characterized by exposure to security vulnerabilities and exponential increases in maintenance costs—by adopting AI models without a clear understanding of their internal structures. This course assetizes this through the following approaches: Deconstruction of Mechanisms: We engineeringly deconstruct the Self-Attention mechanism, the core of the Transformer architecture, from the perspective of numerical weight analysis. By understanding the numerical mechanisms that determine information priority, we visualize the basis for the model's judgment. Analysis of ID Formation: We transparently track the process by which a series of pipelines—leading from Pre-training to Supervised Fine-Tuning (SFT) and Reinforcement Learning from Human Feedback (RLHF)—forms the model's technical identity and ethical guidelines. This converts invisible threats into controllable system parameters. ② Securing Deterministic Reliability: Hallucination Control Strategies to Overcome Probabilistic Limits Large Language Models (LLMs) are not systems that reason for truth, but systems that generate the next most probable token. The "hallucination" phenomenon resulting from this inherent characteristic becomes a fatal flaw in engineering fields where reliability is vital. Constraints of Retrieval-Augmented Generation (RAG): We move away from closed structures that rely solely on the model's internal fixed memory (Internal Weights). We establish an "open-book strategy" that provides clear grounding for generated results by allowing real-time reference to trusted external knowledge bases. Hybrid Model Architecture: We design a redundancy strategy that achieves both accuracy and operational efficiency by deploying large models for areas requiring enterprise-wide knowledge and optimized Small Language Models (SLMs) for specific domains where security and real-time response are essential. ③ Computing Architecture Optimization: Overcoming Physical Bottlenecks (Memory Wall) While intelligence is implemented in software, its performance and economic sustainability are defined by the physical limits of hardware. Physical Constraint Analysis: We diagnose the "Memory Wall" problem, where data transfer speeds cannot keep up with the processing speeds of computing units, and heat generation issues resulting from high-density computation from an engineering perspective. Infrastructure Design Capability: We precisely analyze the physical impact of High Bandwidth Memory (HBM) stacking structures and 2.5D/3D advanced packaging technologies on inference efficiency. We cultivate design capabilities to optimize Total Cost of Ownership (TCO) through full-stack integrated insights that complement hardware limitations with software architecture. ④ Acceleration of Functional Expansion: Transitioning from Passive Tools to Autonomous Agent Systems Current AI remains at the level of simple Q&A, failing to create added value for practical business automation. This course evolves AI into an active subject that judges and executes on its own. Decomposition: We learn techniques for decomposing complex goals into achievable sub-tasks and logically organizing the execution sequence upon receiving them. Digital Workforce Deployment: We define the process of applying "Active Agent" systems to the field, which autonomously call internal ERPs, browsers, and external APIs to complete actual business logic and accept feedback on results. 3. Core Architecture: Closed-loop Control System The way an AI agent manifests intelligence and performs complex tasks is theoretically identical in logical structure to the closed-loop control system performed by an ECU (Electronic Control Unit), the core brain of a car. This course analyzes this in detail from the perspective of the ReAct (Reasoning and Acting) framework. First, the system begins at the Input stage, receiving ambiguous and complex requests from the user. This plays the same role as a sensor in control engineering collecting physical data from the external environment and delivering it to the system, serving as the standard for defining the initial state of the task at hand. Second, based on the received data, the Thought stage proceeds, where plans are established through logical reasoning within the LLM architecture. This is in line with the process where control algorithms in an ECU calculate optimal control values by processing input sensor data. At this stage, the agent sets the optimal path to achieve the goal and secures the logical rigor of the system. Third, the Action stage follows, where tasks are completed by calling external tools or APIs according to the established plan. This logically matches the mechanism where the calculation results of a control system are converted into physical power through an actuator to execute commands. Through this, intelligence exerts actual physical and digital influence beyond abstraction. Finally, the Observation stage is performed, analyzing the execution results and correcting errors relative to the initial goal. This is identical to the core principle of control engineering, which reduces system deviation through a feedback loop. The agent self-verifies whether the execution results meet the goal and continuously upgrades performance by reflecting occurred errors into the next action plan. AI equipped with such a closed-loop structure is no longer an incomplete system dependent on probability. By securing engineering rigor that self-verifies execution results and corrects errors, it functions as a trust-based partner capable of performing business-critical tasks. 4. Practical Application and Expansion: Software-Defined Vehicles (SDV) and Physical AI The final destination of AI architecture lies in the cross-industry proliferation of Software-Defined Vehicles (SDV) and Physical AI, which overcome and evolve physical constraints through software intelligence. This is the standard model for future System Integration (SI) across manufacturing and service industries. Securing Edge Intelligence and Data Sovereignty: Small models (SLMs) mounted on-device in vehicles or facilities learn real-time field data immediately. This minimizes cloud dependency, perfectly protecting data sovereignty—a core asset of the company—and enables precision services based on ultra-low latency. Hardware Optimization and Lightweight Engineering: To implement the best intelligence within limited power and computational resources, we actively introduce model compression technologies such as Quantization, Pruning, and Knowledge Distillation. Model deployment considering hardware bandwidth becomes a core competency that determines system response speed and user experience. Hybrid Orchestration: We design an integrated architecture that organically connects "Cloud LLMs" possessing broad general knowledge with "Edge SLMs" specialized for specific physical control and security. Integration from a full-stack perspective, penetrating from silicon chipsets to software stacks, provides a powerful competitive advantage that evolves the entire system through software updates alone. 5. Conclusion: The Role and Vision of the AI Architect The ultimate goal of this masterclass is to elevate students from the position of a "User" who passively relies on technology and hopes for luck, to a professional "AI Architect" who perfectly controls and tunes everything from the physical limits of the system to the depths of the software architecture. While the phenomenon of intelligence manifests from software logic, it is silicon (hardware) that defines the physical limits of that intelligence, and only sophisticated engineering can overcome those limits to complete actual business value. "Intelligence may reside in the realm of probability, but the vessel that contains that intelligence and makes it operate according to purpose must belong solely to the realm of rigorous and sophisticated engineering."

Intermediate

Data Engineering, AI, Data literacy

(v501) The Heart of AI: AI Foundation Models and the Mechanics of Intelligence

khjyhy100

Artificial Intelligence (AI) - Learning Runway AI that Creates New Videos and Moving Images

usefulit

This course covers everything from the basics to hands-on practice of AI platforms, and is a program where you can learn AI image generation technology. Learners can understand the fundamental concepts of AI and develop the ability to generate AI images using various tools and technologies. Through this course, you can enhance your understanding of the AI field and learn how to directly generate AI images in real-world applications.

Beginner

AI, Deep Learning(DL), Machine Learning(ML)

Artificial Intelligence (AI) - Learning Runway AI that Creates New Videos and Moving Images

usefulit

GA4 Google Analytics Ecommerce Setup Practice for Marketers (2025)

GA4 Guide

You can learn the core essentials of GA4 (Google Analytics 4) e-commerce setup—which is often too difficult to learn alone just by searching the internet—through hands-on practice. This course is designed so that performance marketers and beginner developers setting up GA4 e-commerce for the first time can easily understand it by practicing step-by-step on a Cafe24 demo shopping mall. Since we delegate the developer's role to ChatGPT during the practice sessions, even marketers, PMs, or planners without professional development knowledge can learn how to set up GA4 e-commerce.

Basic

Google Analytics, Data Engineering, Data literacy

GA4 Google Analytics Ecommerce Setup Practice for Marketers (2025)

GA4 Guide

Fundamentals of Machine Learning and Deep Learning with Python, Keras

pnuswedu

Let's understand machine learning and deep learning model algorithms by using Keras!

Basic

Machine Learning(ML), Deep Learning(DL), Keras

Fundamentals of Machine Learning and Deep Learning with Python, Keras

pnuswedu

Complete Guide to Unity Machine Learning Agents (Basics)

kyushik

Through this course, students will learn various reinforcement learning theories and implement them themselves, as well as create a reinforcement learning environment to test the reinforcement learning algorithm implemented using Unity Machine Learning Agents.

Basic

Reinforcement Learning(RL), Machine Learning(ML), Unity

Complete Guide to Unity Machine Learning Agents (Basics)

kyushik

Spark Machine Learning Complete Guide - Part 1

4.9

What you will gain after the course

[Notice] Databricks Community Edition, which was provided for free as the practice environment for this course, is no longer accepting new sign-ups. Therefore, please be advised that the practice environment will be changed to a local Spark and Jupyter environment as of December 5, 2025.

The encounter between Apache Spark and Machine Learning.

We will help you grow into a machine learning expert who is also proficient in data processing and analysis.

We will solve the problems you will face.

The first half of the 'Spark Machine Learning Perfect Guide - Part 1' course is

The latter half of the 'Spark Machine Learning Guide - Part 1' course is

Please check the practice environment.

Prior knowledge is required for this course.

Please check the prerequisite courses!

Python Machine Learning Guide

Recommended for these people

HelloThis is dooleyz3525

Curriculum

Reviews

dooleyz3525's other courses

Similar courses

The encounter between Apache Spark and
Machine Learning.

We will help you grow into a machine learning expert
who is also proficient in
data processing and analysis.

We will solve the problems
you will face.

Please check the
practice environment.

Prior knowledge is
required for this course.

Recommended for
these people

Hello
This is dooleyz3525