
모두의 한국어 텍스트 분석과 자연어처리 with 파이썬
박조은
파이썬 한국어 텍스트 분석과 자연어처리 워드클라우드 시각화, 형태소 분석, 토픽모델링, 군집화, 유사도 분석, 텍스트데이터 벡터화를 위한 단어 가방과 TF-IDF, 머신러닝과 딥러닝을 활용한 텍스트 분류, 허깅페이스 활용법
Basic
NLP, 텍스트마이닝, 머신러닝
The goal is to become familiar with Python and various data analysis libraries while handling various types of data through public data.
Let's visualize it using a visualization tool called plotnine, which can use ggplot syntax in Python.
I mainly use the Python standard library, Numpy, and Pandas.
Learn about and practice various preprocessing techniques required to obtain or process the desired data.
You will learn and be able to use the tools necessary to perform data analysis with Python.
By taking the lectures and solving assignments, you can master the parts you were curious about.
You can get a refund of your tuition just by completing the online study! :)
🌱 If you go alone, you can go fast, but if you go together, you can go far.
1. If you complete the course, we will provide a refund of 50,000 won .
- 100% progress in lecture videos
- Please submit three of the four assignments by the deadline. (Week 4 is mandatory!)
2. Study Period: June 24, 2019 - July 21, 2019
The study goes like this :)
- (Week 1) June 24th - June 30th: Opening remarks, Week 1 video and assignments
- (Week 2) July 1st - July 7th: Assignment code review and Week 2 video and assignment progress
- (Week 3) July 8th - July 14th: Assignment code review and Week 3 video and assignment progress
- (Week 4) July 15th - July 21st: Assignment code review and Week 4 video and assignment progress
* Please check the content of the video to be provided in “Curriculum”.
3. How to conduct the study
We will use Python, Pandas, ggplot(plotnine), Numpy, and Folium.
Let's analyze data from the public data portal using Pandas' reshape functions such as melt, concat, pivot, and transpose.
And we will summarize and analyze data using groupby, pivot_table, info, describe, value_counts, etc.
Visualize where each park is located in each region using Folium.
If you just want to listen to the lecture, please click below!
Q1. What is the Python version?
Python 3.6 or higher is available.
Q2. Does it matter if it's Windows or MAC OS?
While the OS doesn't matter, this tutorial was written for OSX. We'll provide a Colab link to help you learn regardless of your OS.
Q3. I'm not a computer major. Will I be able to follow along?
While this curriculum is already widely followed by non-programmers, individual differences may exist. The goal is to spark interest in programming through engaging case studies, even for those with no prior programming experience. I hope this study will provide an opportunity for you to discover and learn more about the topics you need.
Who is this course right for?
Anyone interested in becoming a data analyst
For those who want to learn data visualization with Python
Those who want to learn the basics of Python libraries such as Pandas and NumPy
Those who want to utilize public data
Anyone interested in data journalism
For those who want to find meaning through data
Anyone who wants to use data analysis in their work
18,782
Learners
787
Reviews
1,334
Answers
4.9
Rating
6
Courses
Microsoft MVP(Python Developer Technologies)
네이버 커넥트 재단 부스트코스 데이터사이언스 강의 설계 및 교수자
서울대 빅데이터혁신공유대학, 서울대 평생교육원, 연세대 DX Academy, 한신대 ABC Camp, 한양대 대학원, 전남대,
한국능률협회, 삼성SDS 멀티캠퍼스, 멋쟁이사자처럼, 패스트캠퍼스, 모두의연구소 등 다수의 교육기관 및 기업 강의
다양한 도메인(제약, 통신, 자동차, 커머스, 교육, 정부기관 등)의 기업 데이터 분석
20년이상 게임, 광고, 교육 등 다양한 도메인에서 웹 백엔드 개발자 및 데이터 분석가 현업 경험
All
41 lectures ∙ (5hr 33min)
Check out other courses by the instructor!