Evaluation methods for stable AI agent service operation

Name: Evaluation methods for stable AI agent service operation
Price: 69300 KRW
Rating: 5 (6 reviews)

Are you anxious every time you deploy an AI agent? Based on experience with major domestic corporations and global big tech companies, we will show you how to systematically measure and improve agent quality using LangSmith.

(5.0) 6 reviews

153 learners

Level Intermediate

Course period Unlimited

jasonkang

Python

LangChain

LangGraph

Python

LangChain

LangGraph

What you will gain after the course

AI Agent-Specific Evaluation Methodologies and Practical Know-How
Establishing a "data"-driven decision-making system rather than one based on "intuition"
Dramatic reduction in development and testing costs
Error resolution and debugging techniques for real-world service operations

Notes before taking the course

Hands-on Environment

Python 3.13 or higher must be installed.

Prerequisite Knowledge and Important Notes

You must be familiar with the basic syntax of Python programming.
This is suitable for those who have experience in developing agents using LangChain and LangGraph.
- If you are not familiar with LangChain syntax, please take Mastering LangChain Basics in One Hour↗️ first.
- If you are not familiar with LangGraph syntax, please take AI Agent Development using LangGraph↗️ first.

Learning Materials

Lecture materials are provided via the Notion page↗️
Practice code and example datasets are provided via GitHub↗️

Recommended for
these people

Who is this course right for?

A developer who feels anxious that every time they fix a single line of a prompt, another feature might break.
A planner who wants to make decisions based on data and metrics rather than 'feelings' when communicating with the development team
Developers who want to go beyond the basics and develop AI agents at a professional, practical level

Need to know before starting?

Python required
LangGraph Required

Hello
This is jasonkang

Inflearn Verified

Career Verified

19,040

Learners

1,499

Reviews

528

Answers

4.9

Rating

Courses

FAANG Senior Software Engineer
(Former) GS Group AI Agent platform development/operations
(Former) GS Group DX BootCamp Mentor/Coach
(Former) FAANG Senior Software Engineer (Former) GS Group AI Agent Platform Development/Operations (Former) GS Group DX BootCamp Mentor/Coach
(Former) Tech Lead at a Series C AI Startup
Stanford University Code in Place Python Instructor
Naver Boostcamp Web/Mobile Mentor
Naver Cloud YouTube Channel presenter
Author of Building Autonomous AI Agents with LangChain & LangGraph
Wanted Pre-onboarding Frontend/Backend Challenge Instructor (6,000+ cumulative participants)
Hanghae AI Plus Course 1st Generation Coach

Curriculum

All

18 lectures ∙ (3hr 16min)

Section 1. Intro: Introduction to Course Content (Please watch before enrolling!)

1 lectures ∙ (9min)

1. What this course covers and what it does not cover
09:49

Section 2. Reasons why AI agent evaluation is necessary

2 lectures ∙ (19min)

Section 3. Golden Dataset Creation

6 lectures ∙ (1hr 4min)

4. What is a Golden Dataset?
07:16
5. Generating a Golden Dataset using the RAGAS library
15:16
6. Set up projects and APIs in your LangSmith account
02:40
7. Generating Golden Datasets using Custom Agents + FAQ
11:59
8. Generating a Golden Dataset using Custom Agents + General Documents
17:28
9. Generating Golden Datasets Using Claude Code Agent Skills
09:25

Section 4. [Basic] Designing AI Agent Evaluation: What and How to Measure

5 lectures ∙ (1hr 15min)

Section 5. [Advanced] How to Read Agent Performance Metrics

3 lectures ∙ (22min)

Section 6. Outro: Review of overall content + Evaluation strategies tailored to service characteristics

1 lectures ∙ (5min)

Published: 04/13/2026

Last updated: 05/13/2026

Reviews

All

6 reviews

5.0

6 reviews

qkenr1321559
Reviews 8
∙
Average Rating 5.0
04/20/2026
Edited
5
33% enrolled
Jason's courses are ones I always trust and sign up for. I have taken all of the instructor's LangChain-related courses, and thanks to them, I am currently working as a junior AI Engineer. I had been worrying a lot about evaluation in my actual work, and since this course was released at the perfect time, I am planning to learn and apply it quickly. Thank you for always providing high-quality lectures. Additionally, this is a separate question, but I just found out that you recently published a book. I haven't purchased it yet, but I'd like to ask if it's worth studying with the book even though I've already taken all the courses. Your lectures feel like having a great mentor because you always explain and share things from the student's perspective. Once again, thank you for the great lectures as always. :)
- jasonkang
  Instructor
  04/20/2026
  Hello Seonggyu! Thank you for the great feedback. I'm so proud to hear that taking this course helped you in your career as an AI engineer, as it feels like the effectiveness of the course has been proven. Thank you for sharing. The book does cover a slightly wider variety of evaluation strategies and methods than the course. However, since the course covers evaluation theory sufficiently, I don't think you necessarily need to purchase the book if you've already completed the lectures (I probably shouldn't be saying this as someone selling the book 😅). I look forward to seeing you again with another great course!
- qkenr1321559
  04/20/2026
  Ah. Honestly, I'm so grateful and it makes me trust you even more because you were so straightforward..!! :) I'll continue to sign up for the early bird courses first thing in the future. I look forward to working with you!
ysj
Reviews 4
∙
Average Rating 5.0
04/22/2026
5
61% enrolled
nopainnogame6243
Reviews 5
∙
Average Rating 4.8
04/19/2026
5
100% enrolled
yangroro
Reviews 1
∙
Average Rating 5.0
05/13/2026
5
33% enrolled
hong3
Reviews 8
∙
Average Rating 5.0
04/29/2026
5
33% enrolled

jasonkang's other courses

Check out other courses by the instructor!

Developing LLM Applications Using RAG (feat. LangChain)

jasonkang

RAG. Learn from Silicon Valley GenAI Hackathon Winner. Packed with real-world know-how.

Basic

LLM, RAG, LangChain

Developing LLM Applications Using RAG (feat. LangChain)

jasonkang

LangChain Fundamentals in One Hour

jasonkang

A basic LangChain course that extracts only the essentials from the official documentation, all for the price of a cup of coffee(?). Get familiar with LangChain syntax before diving into full-scale development!

Basic

prompt engineering, LLM, LangChain

LangChain Fundamentals in One Hour

jasonkang

AI Agent Development Using LangGraph (feat. MCP)

jasonkang

LangGraph, packed with a major corporation's AI Agent lead's know-how. We deliver knowledge gained from real-world challenges.

Basic

prompt engineering, LLM, AI Agent

AI Agent Development Using LangGraph (feat. MCP)

jasonkang

Work Automation AI Agent Ready for Immediate Use in Companies (w. n8n, LangGraph)

jasonkang

Artificial Intelligence, AI, agents... They might seem grand, but once you try them, they're not as difficult as you think. That's why it's important to implement simple features yourself. Through practical projects that can actually be used at a company, you'll directly experience various use cases and learn how to utilize and apply AI agents.

Basic

n8n, AI, LangChain

Work Automation AI Agent Ready for Immediate Use in Companies (w. n8n, LangGraph)

jasonkang

Building an LLM Chatbot with Flutter (feat. Gemini)

jasonkang

New to Flutter? A Flutter Contributor will guide you step-by-step! Start Flutter dev with an AI project using Gemini.

Basic

Flutter, Chatbot, gemini

Building an LLM Chatbot with Flutter (feat. Gemini)

jasonkang

AWS deployments that can be applied directly to practice

jasonkang

This is for those who want to deploy/operate services with AWS. From domain settings to Docker and serverless!

Basic

AWS, Docker, aws-ecs

AWS deployments that can be applied directly to practice

jasonkang

Storybooks and UI tests that can be applied directly to practice

jasonkang

How to Use Storybooks A to Z. We show you everything about storybooks.

Basic

storybook, ui-testing, React

Storybooks and UI tests that can be applied directly to practice

jasonkang

Front-end testing basics in 2 hours

jasonkang

Test code! For those who are at a loss as to where to start, here it is. From writing tests to deploying through automation, all in one place!

Basic

React, Cypress, Jest

Front-end testing basics in 2 hours

jasonkang

Similar courses

Explore other courses in the same field!

[Practical AIoT] Perfect Preparation for Smart Mirror Makerthon: LLM, CV, and Hardware Design

kodekorea

Solve the point where 80% get stuck at makeathons. Complete Raspberry Pi · Computer Vision · LLM · 3D Design in 4 weeks! Achieve top rankings at makeathons with a demonstrable smart mirror PoC!

Basic

Python, Raspberry Pi, Arduino

[Practical AIoT] Perfect Preparation for Smart Mirror Makerthon: LLM, CV, and Hardware Design

kodekorea

Silicon Valley Engineer's Guide to LangChain, LangGraph, and MCP

altoformula

Become a pioneer in the latest large language model (LLM) technology through Langchain online lectures! This course provides practical skills and innovative knowledge that will upgrade your career. #LangChain #LangGraph #LangSmith #MCP

Basic

LLM, LangChain, prompt engineering

Silicon Valley Engineer's Guide to LangChain, LangGraph, and MCP

altoformula

Building OCR that actually works in real-world scenarios, here's how to do it.

nexthumans

If you want to properly learn OCR technology that's truly used in practice, this one course is all you need! Aiming for over 98% accuracy even with unstructured documents and complex layouts, based on the latest SOTA models and real-world know-how, we build enterprise-level OCR projects together.

Basic

Python, AI, openai

Building OCR that actually works in real-world scenarios, here's how to do it.

nexthumans

Getting Started with AI Agents Right Away – From Essential Basics to Practical Knowledge That Everyone Needs to Use Immediately!

kyoungsh7152

This is a course that beginners can follow along with in a fun and easy way. Beyond simple chatbots, AI Agents that automate industry-specific business workflows. This course quickly covers the basic structure and core technologies of AI Agents (LangChain, LangGraph, RAG) in 1.5 hours, and is an introductory course where you build practical skills by directly creating mini agents that work with actual code. After completing the course, you'll understand the necessity of industry-specific data preprocessing and scalable design, and be prepared for advanced concepts covered in deeper courses.

Beginner

Python, RAG, AI Agent

Getting Started with AI Agents Right Away – From Essential Basics to Practical Knowledge That Everyone Needs to Use Immediately!

kyoungsh7152

Building an AI Recommendation System by a Working Engineer | Recommendation Algorithm | Recommender | Recsys

Jay

This course covers everything from core recommendation system algorithms to practical implementation. - Content-based filtering - Collaborative filtering and deep learning-based recommendation model implementation - Two-step recommender systems implementation - Hands-on practice using PyTorch/RecBole - Industry know-how and recommendation result visualization

Basic

Python, Recommendation System, AI

Building an AI Recommendation System by a Working Engineer | Recommendation Algorithm | Recommender | Recsys

Jay

Autonomous Driving with Python

hjk1000

Why this course is special: Key Advantages • Intuitive Visualization: Directly observe algorithm operations in real-time with Pygame 2D simulations • Practical Implementation Experience: Go beyond theory and internalize autonomous driving algorithms by coding directly • Master Core Algorithms: Focused learning of essential algorithms such as Dijkstra, Pure Pursuit, ICP, etc. • Step-by-step Advanced Learning: Systematic difficulty progression from basics to SLAM • Lidar-based SLAM: Practical map building and localization in unknown environments

Basic

Python, Autonomous Driving, slam

Autonomous Driving with Python

hjk1000

DDPM to DDIM, Complete Mastery of Diffusion Through Implementation I

Sotaaz

This course is a hands-on masterclass that completely conquers the evolution of Diffusion Models through papers and code implementation. You'll learn the core models of generative AI, including DDPM (Denoising Diffusion Probabilistic Model) and DDIM, by studying the paper principles and implementing them directly. We analyze step-by-step the background of each model's emergence, mathematical formulations, network architectures (U-Net, VAE, Transformer), training processes (Noise Schedule, Denoising Step), and the ideas that led to performance improvements. Students will directly code all models using PyTorch, gaining not just paper comprehension but 'practical skills to reproduce and apply' them in real-world scenarios. Additionally, by comparing the differences between models and their developmental flow, you'll clearly understand how they expand and evolve. This course integrates theory, code, and practice into one comprehensive journey, providing researchers, developers, and creators alike with a systematic way to master the evolution of generative models. Beyond simply 'reading' papers, start your experience of 'understanding and recreating' through direct implementation now.

Basic

Python, Deep Learning(DL), AI

DDPM to DDIM, Complete Mastery of Diffusion Through Implementation I

Sotaaz

Hands-on! Building Intermediate AI Agent Services with LangChain and LangGraph: From RAG to Multi-Agents

YoungJea Oh

Simple tutorials alone make it difficult to apply in practice. I will clearly pass on my professional know-how, covering complex state management and multi-agent design methods.

Basic

AI, ChatGPT, prompt engineering

Hands-on! Building Intermediate AI Agent Services with LangChain and LangGraph: From RAG to Multi-Agents

YoungJea Oh

Codex with Silicon Valley Engineers

altoformula

From a developer who only used ChatGPT to a developer who handles AI agents. Learn practical ways to maximize coding productivity using Codex's Rules, Hooks, Skills, and MCP.

Beginner

AI, Python, codex

Codex with Silicon Valley Engineers

altoformula

Creating a YouTube Video Summarization AI Using the GPT API

Essential

The goal is to master the complex GPT API and Python in the easiest way possible through hands-on practice. You will develop a YouTube video summarization AI using the latest ChatGPT API and implement it as a web application using Streamlit.

Basic

Python, Big Data, AI

Creating a YouTube Video Summarization AI Using the GPT API

Essential

Artificial Intelligence with Python

hjk1000

Deep learning is a technology that learns data through neural networks composed of combinations of complex functions. In this lecture, we will mathematically understand the core concepts of deep learning and analyze them from the perspective of matrix operations. In particular, utilizing Python's NumPy library, we will visually examine how parameters are updated by directly implementing the forward and backward propagation processes of deep learning. Even the seemingly complex neural network structure becomes clear when analyzed with matrix operations. This lecture focuses more on understanding concepts than coding and is suitable for students who wish to intuitively grasp the principles of deep learning mathematically.

Basic

Python, Numpy, Tensorflow

Artificial Intelligence with Python

hjk1000

(Using Raspberry Pi) Building an AI Artificial Intelligence Autonomous Driving Car

usefulit

This is a hands-on course where you'll build an AI-based autonomous driving car using Raspberry Pi and various sensors.

Beginner

Python, Raspberry Pi

(Using Raspberry Pi) Building an AI Artificial Intelligence Autonomous Driving Car

usefulit

AI Comment Automation Program Development Lecture (Naver Blog)

lread90

Chatgpt is a program that reads and comments on posts written by my neighbors Lesson on developing marketing automation and neighbor management programs

Basic

Python, Naver Searching Keyword, Blog

AI Comment Automation Program Development Lecture (Naver Blog)

lread90

Getting Started with AI in 2026: How Should Students/Graduate Students/Developers Begin with Artificial Intelligence?

anjaeju

- I am a Research Engineer/AI PM running a 4-year-old AI startup. - This video is a lecture for those who want to start studying artificial intelligence "now" or in "2026". - Looking at college students, I see many who have no idea how to get started with artificial intelligence. - I hope that after watching this lecture, you'll be able to start studying artificial intelligence. - For reference, this lecture is not about "using AI in my work" or "getting started with monetization methods using GPT". - This is a video about how students, graduate students, or developers can get started when they want to study artificial intelligence.

Beginner

Python, AI, Machine Learning(ML)

Getting Started with AI in 2026: How Should Students/Graduate Students/Developers Begin with Artificial Intelligence?

anjaeju

Manuscript Generator Program Development Lecture (Chatgpt API)

lread90

During the Gold Rush, it was said that it was easier to become rich by selling gold mining tools than by mining gold yourself. How about selling programs in the ChatGPT Age of Exploration?

Basic

ChatGPT, AIPRM, REST API

Manuscript Generator Program Development Lecture (Chatgpt API)

lread90

Digital Transformation Using AI

pnuswedu

Learn machine learning techniques using Python and improve your ability to extract information from real-world data and develop predictive models!

Beginner

AI, RPA, Python

Digital Transformation Using AI

pnuswedu

Creating a YouTube AI Employee with ChatGPT and Python

SungYong Lee

Create a program using the GPT API, and even generate images and videos!

Basic

ChatGPT, gpt, Python

Creating a YouTube AI Employee with ChatGPT and Python

SungYong Lee

Practical Harness Engineering Completed in 2 Hours

knodark74

Building an MVP with AI is no longer difficult. However, most projects stop at the next stage. 👉 You’ve built the features 👉 But development doesn't continue Why does this happen? The problem isn't the code; 👉 It's because there is no structure that allows AI to work continuously. --- In this course, based on an existing project, 👉 We will cover the process of building a structure 👉 That allows AI to continue development on its own. --- Instead of simply using AI tools, we will: * Create a docs structure * Define the SSOT (Single Source of Truth) * Execute development on a per-ticket basis * Connect QA and iteration cycles 👉 To complete a single "AI Development System." --- Through this process, you will: 👉 Create a structure where development continues 👉 Even without a human manually coding. In other words, you can: 👉 Build and understand a development system 👉 That operates AI like a team. --- This course is designed for: 👉 Those who want to build a structure applicable to their projects immediately 👉 Those who started development with AI but found it difficult to sustain 👉 Those who want to move to the next level after "Vibe Coding" --- Beyond just learning, we provide an experience where you: 👉 Take action 👉 Build a structure that actually works 👉 And walk away with a framework you can apply to your own projects.

Basic

Python, cursor, ChatGPT

Practical Harness Engineering Completed in 2 Hours

knodark74

Automating AI-generated YouTube Shorts with one click (with n8n)

nightdaycoding

Automate YouTube Shorts creation with AI!! Learn how to automatically create YouTube Shorts using n8n from scratch. We will personally build a workflow that handles everything at once: Text → Image/Music → Video Creation → Uploading. Practice includes distinguishing between test/production modes and tips for saving costs. By the end of the lecture, you will complete 3 automation templates that can be used immediately. We will learn step-by-step, starting from node placement, so you can follow along even without coding knowledge. Join us in automating AI YouTube Shorts production!

Beginner

Python, youtube-api, n8n

Automating AI-generated YouTube Shorts with one click (with n8n)

nightdaycoding

Just 1 hour! Creating 'My Own AI Senior Developer' to install on my computer (Antigravity Vibe Coding) [Source code provided]

codebridge

[Source Code Provided] No coding knowledge required. Build it instantly in your browser using Google's latest tool (IDX) without any installation! Stop studying coding syntax! This is an ultra-fast, hands-on course where you'll build an RAG chatbot that perfectly understands internal company documents through AI conversation (Vibe Coding) in the Google IDX environment and deploy it live to the web.

Beginner

Python, AI, LLM

Just 1 hour! Creating 'My Own AI Senior Developer' to install on my computer (Antigravity Vibe Coding) [Source code provided]

codebridge

Evaluation methods for stable AI agent service operation

What you will gain after the course

The AI agent you've worked so hard onIs it okay to deploy?

🤯

😢

🤔

😳

What do you need when you need certainty?It is none other than 'AI Agent Evaluation'.

The start of a stable serviceAI Agent Evaluation

Characteristics of AI agents that differ from existing software

Indeterminacy of AI

Unstructured problems

Dynamic System

If you fail to properly monitor changes in your AI agent,your service could collapse at any time.

Immediately applicable in practiceAI Agent Evaluation Methods

01.

Cost- and time-savingGolden Dataset Construction

RAGAS

Custom Agent

Claude Code Skill

02.

Adopted by Big TechAgent Evaluation Methods

E2E + Component Evaluation

03.

Anthropic's Guide onHow to Quantify Agent Performance

pass@k

pass^k

📚

Introduction to the Learning Curriculum

Necessity of AI Agent Evaluation

Golden Dataset Construction Strategy

Designing AI Agent Evaluation Metrics

Advanced Quantitative Analysis of Agent Performance

We can solve the concerns of these people!

📌

AI Agent Developer

📌

AI Service Operations Manager

📌

LLM-based Service Planner

Notes before taking the course

Recommended for these people

HelloThis is jasonkang

Curriculum

Reviews

jasonkang's other courses

Similar courses

The AI agent you've worked so hard on
Is it okay to deploy?

What do you need when you need certainty?
It is none other than 'AI Agent Evaluation'.

The start of a stable service
AI Agent Evaluation

If you fail to properly monitor changes in your AI agent,
your service could collapse at any time.

Immediately applicable in practice
AI Agent Evaluation Methods

Cost- and time-saving
Golden Dataset Construction

Adopted by Big Tech
Agent Evaluation Methods

Anthropic's Guide on
How to Quantify Agent Performance

Recommended for
these people

Hello
This is jasonkang