Mastering data analysis and visualization with just two pages of documentation
When using Pandas for data analysis, preprocessing, machine learning, and deep learning, we often find ourselves relying on the same core functions.
There is a Pandas Cheat Sheet that compiles only these essential features. For those who have grown tired of trying to learn Pandas through thick books, this two-page document will teach you Python Pandas. Master the core functions of Pandas using the cheat sheet provided by the official Pandas tutorial.
I want to make your analysis method mine. Thank you.
5.0
세바스찬 주니어 3세
89% 수강 후 작성
As a beginner, I think this is a great lecture that will help you learn the overall concept of Pandas and learn thoroughly how to use it for actual analysis. I have taken several lectures on Pandas, but I think this is the only lecture that explains the details one by one with such quality. I will review it again and apply it to actual business data and analyze it. Thank you so much for making such a great lecture!!
5.0
pooh9431
37% 수강 후 작성
It's a good curriculum. Even a beginner like me could see the operating principles and results at the same time. What I like the most is the Q&A part. If there's something you don't know, you can ask the teacher questions and get answers right away. And the teacher's voice has such a good wavelength that it's easy to concentrate.
강의상세_배울수있는것_타이틀
Key features from the Pandas Cheat Sheet
Python Data Analysis
Data preprocessing for machine learning and deep learning
Data Visualization
Crawling, Preprocessing, Analyzing, and Visualizing the Seoul COVID-19 Site with Pandas
Python Pandas Data Analysis, all the essentials at once!
With just two pages of documents, can you really master Pandas data analysis?
Pandas is a library that allows you to use Excel-like functions in Python.
Large-scale data processing that Excel can't handle is also OK.
Pandas is a Python data analysis library for data analysis and preprocessing.
When performing data analysis, preprocessing, or visualization with Python Pandas, you tend to use certain core functions most of the time. And there is a cheat sheet that summarizes these essential functions in just two pages.
However, if you try to study a cheat sheet containing only these core concepts on your own, you might feel overwhelmed about what content to practice and how to do it. In this course, we will extract and explain only the core content of the cheat sheet and guide you on how to easily use the complex Matplotlib through Pandas. Shall we take on the challenge together?
Who is this for?
Those who want to perform preprocessing required for data analysis and ML/DL
Those who have been flustered when a large data file wouldn't open in Excel
Those who are tired of complex formulas while analyzing data using Excel
Those who have implemented various formulas in Excel but had to work overtime due to slow speeds
📣 Please check the prerequisite knowledge!
Python, Jupyter Notebook, Anaconda, and the concept of rows/columns in Excel are required as prerequisite knowledge.
Focusing on official documentation, easily, quickly, and accurately.
One, utilizing docstrings in Jupyter Notebook
You don't need to memorize many methods. The functions actually used are limited. In this course, I will show you how to practice by looking up official documentation within Jupyter Notebook so that you can learn on your own by referring to help menus and documents.
Two, how to do Pandas data visualization twice as well!
Do you know which graphs are appropriate for which types of data? In this course, we examine the differences and usage of bar charts, frequency distribution tables, histograms, and normal distributions. Furthermore, we explain various Python visualization methods and how to use options through the official Pandas documentation.
Bonus, including Series and DataFrame data visualization!
Additional Update! Real-world data analysis projects.
We will analyze the Seoul COVID-19 status website using Pandas, covering everything from data crawling to preprocessing, analysis, and visualization. You can apply what you've learned from just two pages of documentation to a project similar to real-world tasks. (Section 13)
First, we will directly analyze data that we frequently encounter in our daily lives through the news.
Which district had the highest number of confirmed cases?
Which hospital treated the most confirmed cases?
Are there specific hospitals that receive a high number of transfers from each district?
Which district had the highest number of confirmed cases from overseas?
How can we perform text preprocessing to group various countries into regions like Europe or South America?
What are the monthly differences in the number of confirmed cases from overseas inflows?
Second, understand and practice data preprocessing methods using Pandas.
How can we extract the year, month, day, day of the week, and week from a date in text format?
How can we calculate the cumulative number of confirmed cases using the status data?
What are the differences between groupby, crosstab, pivot, and pivot_table, and which function is most suitable to use?
Third, understand the data structures of DataFrames and Series, and process them into a format suitable for analysis.
What should the structure of a DataFrame look like to draw a graph using Pandas' plot?
If you want to represent values in different colors based on categorical values in a graph, how should you modify the dataframe?
What are the different ways to convert a Series into a DataFrame?
Python visualization & analysis examples, check them out directly in the lecture!
Curious about the knowledge sharer who created this course? 👩💻
Instructor Jo-eun Park X Inflearn Interview
강의소개.콘텐츠.추천문구
학습 대상은 누구일까요?
Those who want to perform preprocessing for data analysis, machine learning, and deep learning using Python.
Those who have been flustered when a file failed to load while trying to open a large amount of data in Excel.
Those who are tired of complex Excel formulas
Those who had to work overtime because Excel was slow despite implementing various formulas.
Instructor and Curriculum Designer for Data Science at Naver Connect Foundation Boostcourse
Seoul National University Big Data Innovation Sharing University, Seoul National University Continuing Education Center, Yonsei University DX Academy, Hanshin University ABC Camp, Hanyang University Graduate School, Chonnam National University,
Lectured at numerous educational institutions and corporations, including Korea Management Association (KMA), IGM Institute for Global Management, Samsung SDS Multicampus, LIKE LION, Fast Campus, Modulabs, and National Cancer Center.
Corporate data analysis across various domains (pharmaceuticals, telecommunications, automotive, commerce, education, government agencies, etc.)
Over 20 years of professional experience as a web backend developer and data analyst across various domains, including gaming, advertising, and education.
Overall, it was a good course. I studied Pandas on my own with a book and applied for this course for the purpose of repeated learning and application of Pandas.
The Pandas cheat sheet course was actually not that great. It won't be of much help to those who are new to Pandas, new to Python, or very beginners. It's okay as a YouTube video, but it's a bit disappointing as a paid lecture.
The Seoul City Corona Analysis course was good. It was a very useful part for me who studied the basics of Pandas because I was able to practice how to process data by applying Pandas to obtain the desired form and results.
And the tips given here and there were also very helpful. Like shift-tab, or using ? after a function to see the dot string... I think these are great tips...
Summary in one line: I don't recommend it to those who are just starting out with Pandas, but I highly recommend it to those who know a little about Pandas.
After graduating from the Department of Statistics and working as a data analyst, I quit my job and am studying data analysis using Python through Park Jo-eun's lecture. It was interesting to follow the lecture because it was easy to understand using timely data rather than obvious example data. I also watch YouTube a lot. Thank you!
It's a good curriculum. Even a beginner like me could see the operating principles and results at the same time. What I like the most is the Q&A part. If there's something you don't know, you can ask the teacher questions and get answers right away. And the teacher's voice has such a good wavelength that it's easy to concentrate.
As a beginner, I think this is a great lecture that will help you learn the overall concept of Pandas and learn thoroughly how to use it for actual analysis. I have taken several lectures on Pandas, but I think this is the only lecture that explains the details one by one with such quality. I will review it again and apply it to actual business data and analyze it. Thank you so much for making such a great lecture!!