LDAs
View the Project on GitHub loperntu/lads
資料科學家
的工作, 可以視為是一個探索、預測與解讀資料意義的互動歷程。而語言分析
的工作, 在了解文本資料的語意與情緒表現上是重要的關鍵。本課程結合 了目前統計程式設計與自然語言處理技術, 以較為簡潔容易入門的設計與實際操作導引, 希望可以讓毫無相關程式學習基礎的學生在本課程的帶領下, 達到以下的學習目標:
Week | Date | Topic | Lab |
---|---|---|---|
1 | 09/17 | Orientation | |
2 | 09/24 | Introduction to Data Science and Text Analytics | RStudio.Agilearning |
3 | 10/01 | Introduction to Data Science and Text Analytics | Linux.command-line |
4 | 10/08 | Preparing / Preprocessing text and linguistics ABC | R programming: chapter 1-4 |
5 | 10/15 | Preparing / Preprocessing text and linguistics ABC | R programming: chapter 5-7 |
6 | 10/22 | Exploratory data analysis and Infographics | Data Manipulation: chapter 1-2 |
7 | 10/29 | Exploratory data analysis and Infographics | Data Visualization: 1-3;7-10 |
8 | 11/05 | Corpus and natural language processing | Textual data manipulation: Regular Expression |
9 | 11/12 | Corpus and natural language processing | R and Statistics: course one (five chapters) |
10 | 11/19 | Text classification and clustering | |
11 | 11/26 | mini-Hackathon |
開始規劃期末展演 |
12 | 12/03 | Text classification and clustering | |
13 | 12/10 | Topic modeling | |
14 | 12/17 | Sentiment analysis | |
15 | 12/24 | Stylometrics and personality detection | |
16 | 12/31 | Discussion | |
17 | 01/07 | Term project presentation (un-conference) | |
18 | 01/14 | Final term project and report due |
謝舒凱 (Aber) <shukaihsieh@ntu.edu.tw>
施孟賢 (Simon) <simon.xian@gmail.com>
張瑜芸 (Taco) <yuyun.unita@gmail.com>
在課程投影片中講解基本概念,如果有興趣了解進階內容,可參考以下線上教材