Response time slows down
Blindly attaching search, reranking, summarization, and verification steps may improve accuracy, but it will cause service latency to increase drastically.
Learn the core principles and practical implementation of Classic RAG, Graph RAG, and Agentic RAG. You will design dynamic routing systems that optimize token efficiency and latency, and build long-term memory systems that combine GraphRAG's relational network reasoning with Agentic RAG's self-evaluation loops. This course is designed to complete your expertise in designing advanced RAG architectures that can be immediately applied in professional practice.
31 learners
Level Intermediate
Course period Unlimited
Reviews from Early Learners
5.0
tata
It is great to be able to understand the paradigm shift of RAG and the concept of hybrid systems, while also learning various ways to optimize RAG.
5.0
유진
This is an excellent lecture that provides a clear understanding of the new trends and overall architecture of RAG. At first, I thought I understood everything in my head, but I felt a bit lost and empty after each section, wondering, "Wait, how do I actually do this?" However, seeing how the instructor immediately incorporated the feedback and improvement requests I provided, I felt once again that this is a great instructor and a high-quality course. I am looking forward to the rest of the course even more as practice code and additional content are being added to each section. The content is fantastic, the explanations are clear, and the instructor is very responsive to feedback, so I highly recommend taking this course!
Architectural differences between Classic, Graph, and Agentic RAG and determining optimal use cases
Implementing a dynamic routing system considering token costs and latency
Deploying a long-term memory system combining GraphRAG and xMemory
This is a practical course where you will clearly understand the differences between Classic RAG, Graph RAG, and Agentic RAG. You will implement a dynamic routing system that selects the optimal pipeline based on query characteristics, as well as a long-term memory architecture based on GraphRAG and xMemory.
RAG systems are not just about putting documents into a vector DB and searching them. In an actual service, completely different designs are required depending on the nature of the question, document structure, cost, response speed, and depth of reasoning.
Blindly attaching search, reranking, summarization, and verification steps may improve accuracy, but it will cause service latency to increase drastically.
Simply including a lot of context for every question is straightforward, but it is difficult to avoid operational costs and context waste.
Simple similarity search alone makes it difficult to sufficiently handle relationships between multiple documents, cause and effect, and long-term context.
It is a system that determines the nature of a question and selects the most appropriate search, reasoning, and verification strategies. This course covers both the criteria for that judgment and the implementation structure.
Beyond simply typing along with basic implementation examples, this course focuses on developing the design intuition to independently determine which RAG structure to choose for any given situation.
Classic, Graph, and Agentic RAG's strengths, weaknesses, and suitable use cases can be clearly distinguished.
Design a hybrid routing structure that selects fast retrieval, deep reasoning, or self-evaluation loops based on the question type.
Learn advanced RAG systems featuring relational network reasoning and continuous memory structures by combining GraphRAG and xMemory.
This is the basic structure for quickly generating answers based on vector search and context injection. It is effective for simple questions, FAQs, and document-based responses.
It extracts entities and relationships from the text to create a knowledge graph and performs deeper reasoning by utilizing the connections between documents.
AI evaluates the search results, searches again if they are insufficient, and performs self-checks on the quality of the answer in an autonomous search loop.
Not all questions are processed in the same way. We design a structure that balances cost and quality by distinguishing between questions that require a quick answer, those that require relational reasoning, and those that require self-evaluation.
Determine the complexity of the question, the required context, and the depth of reasoning.
Dynamically selects the most appropriate path among Classic, Graph, and Agentic.
It executes document retrieval, relationship network exploration, and self-evaluation loops according to the situation.
Reduces token usage and latency while maintaining answer quality.
It is suitable for those who want to solve accuracy, token cost, and latency issues in existing RAG systems and design better retrieval and reasoning structures.
It is ideal for those who want to enhance their architecture design capabilities for complex knowledge management systems, document retrieval systems, and LLM-based services.
Recommended for those who want to understand the differences in RAG technologies and determine which architecture is necessary for production-level LLM products.
Summarize the RAG trilemma, structural differences between Classic/Graph/Agentic RAG, and the necessity of hybrid routers.
Expand basic RAG into a practical version through Advanced Classic RAG, token dieting, and GraphRAG adoption criteria.
Extract knowledge graphs from text and learn the structure of a Hybrid Searcher that utilizes relational networks.
It covers the core loop of Agentic RAG, which evaluates search results on its own and re-searches for missing information.
Build an advanced RAG structure capable of long-term memory and relational reasoning by combining GraphRAG and xMemory.
It expands the perspective on how to organize knowledge in a practical environment and how to design the paths for retrieval and reasoning.
Quickly search with Classic RAG, understand relationships with Graph RAG, evaluate autonomously with Agentic RAG, and design long-term memory with xMemory. This course will give you the insight to view RAG systems as practical architectures.
Who is this course right for?
AI engineers who need to optimize the performance of RAG systems
A backend developer designing a complex knowledge management system
AI Product Manager looking to build production-level LLM applications
Need to know before starting?
Understanding the Basic Concepts of Prompt Engineering and RAG
Experience in Python-based LLM API integration
Basic usage of vector databases (Pinecone, Weaviate, etc.)
Career Verified
1,165
Learners
107
Reviews
30
Answers
4.8
Rating
14
Courses
Based on my existing career and experience, I share know-how and tips while keeping up with global trends. I look forward to connecting with you!
Profile Website: https://codebridge-ai.pages.dev
Inquiries: codebridge747@gmail.com
🤖👾 US AI Master's Program
🏗 7th-year developer at a major IT company
📱 Currently developing and operating 14 Android apps, 7 iOS apps, and various websites
[Eng]
Based on my existing experience and knowledge, I am sharing the know-how and tips I want to provide while following global trends. Thank you for your support!
Profile : https://codebridge-ai.pages.dev
Contact: codebridge747@gmail.com
Developer at a major IT corporation in South Korea (6y +)
Bachelor's degree in Computer Engineering
All
30 lectures ∙ (2hr 54min)
Course Materials:
All
3 reviews
4.7
3 reviews
Reviews 9
∙
Average Rating 5.0
Reviews 10
∙
Average Rating 5.0
Edited
5
This is an excellent lecture that provides a clear understanding of the new trends and overall architecture of RAG. At first, I thought I understood everything in my head, but I felt a bit lost and empty after each section, wondering, "Wait, how do I actually do this?" However, seeing how the instructor immediately incorporated the feedback and improvement requests I provided, I felt once again that this is a great instructor and a high-quality course. I am looking forward to the rest of the course even more as practice code and additional content are being added to each section. The content is fantastic, the explanations are clear, and the instructor is very responsive to feedback, so I highly recommend taking this course!
Thank you so much for your kind words! It makes all the hard work I put into creating the lectures feel truly rewarding! 😂 I still have a lot to improve on, but I will continue to provide even better lectures and materials in the future. Thank you!
Reviews 3
∙
Average Rating 4.7
4
Overall, I think it was a great lecture. In particular, I have already developed a RAG service that retrieves answers using BM25+Vector, and I feel I can improve it further based on this course. However, one thing I wish there was more of is content on embedding techniques. I am currently chunking text or Markdown documents and storing them in a Qdrant vector database; I was hoping for some tips and know-how on how to efficiently store chunks and vector data, as well as how to retrieve that data effectively. Still, it was great to learn advanced technologies like GraphRAG and Agentic RAG!
Thank you! I'm glad it was helpful. Regarding the storage of chunks and vector data, I will update this lecture once the materials are ready. :)
Check out other courses by the instructor!
Explore other courses in the same field!
25% off for new members
$64.70
25%
$84.70