Introduction to Python Text Analysis - From Data Collection to Analysis
This course is for those who have learned the basic grammar of Python and will learn the core skills used in text data analysis. I hope that those who are interested in text analysis will gain more ideas through this course and use it for a long time.
Python text analysis that anyone can do, Useful from data collection to visualization and modeling!
Text miningData VisualizationTopic Modeling
# From collection to analysis
You've learned basic Python and want to try text mining, but you're feeling overwhelmed and unsure where to start? Even if you try studying from books, the content is often theoretical or extensive. However, you can achieve practical analysis without having to diligently use cutting-edge technologies like deep learning. This course focuses on fundamental and highly applicable topics, covering everything from data collection to analysis .
# Curriculum created by two field professionals
This course was developed in collaboration with two practitioners who use text analysis in their work. The lectures are also delivered in person. While there are pros and cons, this approach allowed us to develop more detailed course materials and prepare us to deliver the core of the techniques we've actually used in the field.
# Can non-majors take the course? 🙋🏽♀️
Of course. The material covered in this lecture isn't all that difficult once you understand it. If you're familiar with basic Python grammar, you can follow along, but you may need to bring in some math knowledge for additional explanation (e.g., logarithms, matrix multiplication).
Structure of this lecture 📚
Text data collection ✒️
Learn how to collect data from websites with Python.
Keyword Extraction and Word Cloud ✏️
Learn how to extract keywords from text and visualize them as word clouds.
Topic Modeling💡
Learn about topic modeling, a method for dividing documents into topics.
Provision of lecture materials
We provide you with a complete, well-organized set of materials. Open and view only the parts you need at any time.
This is a good lecture for easy introduction. However, I think you need to have basic knowledge of pandas and numpy to easily follow the practical lecture.