강의

멘토링

커뮤니티

Data Science

/

Data Analysis

Text Mining Practical Project - Analyzing News Data

You have learned the basics of programming, crawling, and text mining, but have you ever felt lost when it comes to actually doing a project? This is a lecture where you will go through a project with me from start to finish.

(3.2) 5 reviews

151 learners

  • coco
3시간 만에 완강할 수 있는 강의 ⏰
R
Big Data
Web Crawling

What you will gain after the course

  • News data analysis

  • Top keyword visualization

  • word2vec

  • Recommendation and search system

🙆🏻‍♀ This is a practical text mining project. This course covers everything from news data collection to extracting and visualizing monthly top keywords, and even creating a news recommendation system! 🙆🏻‍♂

🗒 Course Introduction

You've learned the basics of programming, crawling, and even taken a text mining course, but are you still feeling overwhelmed when it comes to actually working on a project? This course will walk you through a project from start to finish. This course will cover the following:

🌈 News data collection 

Nate News collects 400 articles per day across all categories in 2019.

🌈 News data preprocessing and top keyword extraction 

Nate News collects 400 articles per day across all categories in 2019.

🌈 Visualize Top Keywords with Excel

Let's visualize daily/monthly top keywords in Excel.

🌈 Visualize top keywords with charts

Nate News collects 400 articles per day across all categories in 2019.

🌈 Word2vec

A basic and widely used method of word representation is 'word2vec'. Let's learn about the concept and train it with news data.

🌈 Create search and recommendation models

We create a news search recommendation model by creating a sentence vec from the news title and using cosine similarity.

🙋🏻‍♂️ I'm curious!

Q. Can I listen without knowing R at all?
A. You should have a basic understanding of the R language, web crawling, and text mining to easily follow this course. 😭😭. I recommend taking the free R programming introductory course and the text mining course.

Recommended for
these people

Who is this course right for?

  • Anyone interested in trying out a text mining project

  • Anyone who wants to analyze news data

Need to know before starting?

  • R programming

  • Web crawling

  • Text Mining Basics

Hello
This is

8,335

Learners

505

Reviews

136

Answers

4.4

Rating

20

Courses

학부에서는 통계학을 전공하고 산업공학(인공지능) 박사를 받고 여전히 공부중인 백수입니다.

 

수상

ㆍ 제6회 빅콘테스트 게임유저이탈 알고리즘 개발 / 엔씨소프트상(2018)

ㆍ 제5회 빅콘테스트 대출 연체자 예측 알고리즘개발 / 한국정보통신진흥협회장상(2017)

ㆍ 2016 날씨 빅데이터 콘테스트/ 기상산업 진흥원장상(2016) 

ㆍ 제4회 빅콘테스트 보험사기 예측 알고리즘 개발 / 본선진출(2016)

ㆍ 제3회 빅콘테스트 야구 경기 예측 알고리즘 개발 / 미래창조과학부 장관상(2015)

* blog : https://bluediary8.tistory.com

주로 연구하는 분야는 데이터 사이언스, 강화학습, 딥러닝 입니다.

크롤링과 텍스트마이닝은 현재는 취미로 하고있습니다 :) 

크롤링을 이용해서 인기있는 커뮤니티 글만 수집해서 보여주는 마롱이라는 앱을 개발하였고

전국의 맛집리스트와 블로그를 수집해서 맛집 추천 앱도 만들었었죠 :) (시원하게 말아먹..)

지금은 인공지능을 연구하는 박사과정생입니다.

 

 

 

 

Curriculum

All

14 lectures ∙ (3hr 16min)

Course Materials:

Lecture resources
Published: 
Last updated: 

Reviews

All

5 reviews

3.2

5 reviews

  • 3812kim2408님의 프로필 이미지
    3812kim2408

    Reviews 4

    Average Rating 3.5

    3

    21% enrolled

    Just like that

    • gooddoctor8228님의 프로필 이미지
      gooddoctor8228

      Reviews 17

      Average Rating 4.6

      3

      36% enrolled

      Great lecture. It's a bit of a shame it's not in Python.

      • dhlim8093님의 프로필 이미지
        dhlim8093

        Reviews 1

        Average Rating 4.0

        4

        100% enrolled

        It was a good lecture.

        • gdkmh81211306님의 프로필 이미지
          gdkmh81211306

          Reviews 2

          Average Rating 5.0

          5

          36% enrolled

          I bought a lot because it helped me a lot with network analysis. I hope there will be more lectures.

          • indigo님의 프로필 이미지
            indigo

            Reviews 5

            Average Rating 3.8

            1

            100% enrolled

            The lecture content and the lecturer's diction are very poor. The Word2Vec part did not even provide the source code. Since the lecture was conducted spontaneously without any preparation, the lecture itself is disorganized. And if this person's lecture content were made into text and analyzed for frequency, the phrase 'Ja~' would probably be the most frequent. It's annoying to listen to.

            $26.40

            coco's other courses

            Check out other courses by the instructor!

            Similar courses

            Explore other courses in the same field!