This course is a hands-on program where you'll directly implement generative AI applications based on text, images, voice, and documents using the OpenAI API.
Starting from setting up the Anaconda and Jupyter Notebook environment, it covers essential development environment configurations for real-world work, including API Key management and understanding costs and tokens.
Based on the latest Responses API, you'll implement text generation, summarization, classification, Vision (image understanding), voice processing, and PDF input processing, while practicing core features used directly in the field step by step, such as Function Calling, Structured Outputs (Pydantic), Embedding, and RAG (File Search).
Additionally, you'll learn how to expand beyond simple API calls to 'intelligent AI services' by including Web Search, Code Interpreter, Streaming, Background tasks, and Conversation State management.
Finally, the goal is to implement agent-based AI systems that autonomously select and execute tools using the Agents SDK and MCP (Model Context Protocol), while learning the structure and design perspectives necessary for actual service development.