PLSC 21510/31510: Introduction to Text as Data for Social Science

Spring 2022

About

Social scientists increasingly use large quantities of text-based data to address problems in industry and academy. This course provides students with an overview of popular techniques for collecting, processing, and analyzing text data from a social science perspective. We will first learn how to collect text data from a variety of sources, including application programming interfaces (APIs) and web-scraping. The second portion of the class provides an overview of popular methods to analyze text data, including sentiment analysis, topic models, supervised classification, and word embeddings. The course is applied in nature. While many of the techniques we discuss have their origins in computer science or statistics, this is not a CS or statistics course. Ultimately, the goal is to introduce students to modern techniques for computational text analysis and help them apply these methods to their own research.

To use

Run the code below in R to download this repo onto your machine.

# Install tidyverse if you have not already done so.
# install.packages("tidyverse")

library("usethis")
use_course("https://github.com/rochelleterman/TAD-F22/archive/main.zip")

Warning!

These materials are still in development and will be changing.

Contact

Rochelle Terman, Assistant Professor in Political Science, University of Chicago [email protected]

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
0_Intro		0_Intro
1_Strings		1_Strings
2_Collecting		2_Collecting
3_Preprocessing		3_Preprocessing
4_Describing		4_Describing
5_Dictionary		5_Dictionary
6_Unsupervised-1		6_Unsupervised-1
7_Unsupervised-2		7_Unsupervised-2
8_Supervised		8_Supervised
9_Embeddings		9_Embeddings
.gitignore		.gitignore
A_Syllabus.md		A_Syllabus.md
B_Installation.md		B_Installation.md
LICENSE.md		LICENSE.md
README.md		README.md
TAD-S22.Rproj		TAD-S22.Rproj

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PLSC 21510/31510: Introduction to Text as Data for Social Science

Spring 2022

About

To use

Warning!

Contact

About

Releases

Packages

Contributors 2

Languages

License

rochelleterman/TAD-F22

Folders and files

Latest commit

History

Repository files navigation

PLSC 21510/31510: Introduction to Text as Data for Social Science

Spring 2022

About

To use

Warning!

Contact

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages