Getting started with Python data analysis using public data
There was a rumor that Ediya would open a store near Starbucks. How different would the locations of Ediya and Starbucks be? Will the real estate price fluctuation trend from 2013 to 2019 be reflected in the apartment sales price? What kind of parks are there in my neighborhood? How can I utilize the data in the public data portal? The goal is to become familiar with Python and various data analysis libraries by handling various types of data through public data.
Part of the Seoul City COVID-19 outbreak status analysis course is being released in “Data Analysis and Visualization in Just Two Documents,” and you can view the entire analysis even if you do not take the course.
hello.
It's been over a year since I started running the course " Data Analysis and Visualization in Just Two Documents ".
Like the course “Getting Started with Python Data Analysis with Public Data,” this course will also be updated regularly.
The course " Data Analysis and Visualization in Just Two Documents " has added content on the analysis of the current status of COVID-19 in Seoul.
And to celebrate the update, we're offering a 20% discount until July 24th .
Even if you don't take the course, you can view some of the lectures in " Data Analysis and Visualization in Just Two Documents " and download the entire source code from the course .
We crawl, preprocess, analyze, and visualize the confirmed case status data released by Seoul Metropolitan Government as of June ( https://www.seoul.go.kr/coronaV/coronaStatus.do ) using Pandas.
Analyze what you have learned in just two documents into a project similar to your current job
We will analyze the Seoul City COVID-19 outbreak status site using Pandas, from crawling to preprocessing, analysis, and visualization.
We directly analyze data that we frequently encounter through news articles and daily life.
- Which district has the most confirmed cases?
- Which hospital treated the most confirmed cases?
- Are there any hospitals that people are frequently transferred to by district?
- Which district has the most confirmed cases from overseas?
- How can I preprocess texts from multiple countries, such as Europe, South America, etc?
- How much does the number of confirmed cases coming from overseas vary from month to month?
Understand and practice data preprocessing methods using Pandas.
- How do I get the year, month, day, day of the week, and week number from a text date?
- How can we calculate the cumulative number of confirmed cases using confirmed case status data?
- What is the difference between groupby, crosstab, pivot, and pivot_table, and which function is more appropriate to use?
Understand the data structure of data frames and series and process them into a form suitable for analysis.
- How should I create a data frame for drawing a graph using Pandas' plot?
- How can I change my dataframe if I want to display values in different colors based on their categorical values in the graph?
- Is there any way to convert a series to a dataframe?
Check it out for yourself in class!





Thanks to all the questions and great reviews you've left over the past year, I've been able to update the course.
We will continue to improve and update the course in the future.
If you leave a good review, it will be a great help in continuously improving the content.
I hope we can get rid of COVID-19 and get back to our normal lives!
thank you


