LSE DS202
Data Science for Social Scientists
📓 Syllabus
A list of what happens every week.
Click on each Week’s link for more information (slides, lab instructions, recommended resources, etc.).
Intro | |||
---|---|---|---|
🗓️ W01 | Lecture |
Introduction, Context & Key Concepts (James et al. 2021, chap. 2) |
|
Lab |
No class this week. (Use this time to revisit basic R programming) |
||
Supervised Learning | |||
🗓️ W02 | Lecture |
Simple and Multiple Linear Regression (James et al. 2021, chap. 3) |
|
Lab |
Revision of R : data structures, basic commands and some tidyverse
|
||
🗓️ W03 | Lecture |
Classifiers (Logistic Regression & Naive Bayes) (James et al. 2021, chap. 4) |
|
Lab | Linear Regression (James et al. 2021, chap. 3) | ||
Formative |
|
||
🗓️ W04 | Lecture |
Resampling methods (James et al. 2021, chap. 5) |
|
Lab | Classification Methods (James et al. 2021, chap. 4) | ||
🗓️ W05 | Lecture |
Non-linear algorithms (SVM & tree-based models) (James et al. 2021, chaps. 8–9) |
|
Lab | Cross-Validation and the Bootstrap (James et al. 2021, chap. 5) | ||
Summative |
|
||
Unsupervised Learning | |||
🗓️ W07 | Lecture |
Unsupervised Learning: Clustering (James et al. 2021, chap. 12) |
|
Lab | Tree-based models (James et al. 2021, chap. 8) | ||
🗓️ W08 | Lecture |
Unsupervised Learning: Dimensionality Reduction (James et al. 2021, chap. 12) |
|
Lab |
Support Vector Machine + tidymodels + recap of cross-validation (James et al. 2021, chaps. 5, 9)
|
||
Summative |
|
||
Applications | |||
🗓️ W09 | Lecture | Applications: Text as Data & Topic Modelling | |
Guest: | Prof. Ken Benoit | ||
Lab | Unsupervised Learning: Clustering (James et al. 2021, chap. 12) | ||
🗓️ W10 | Lecture | Applications: Predictive Modelling on Tabular Data | |
Lab | Unsupervised Learning: Principal Component Analysis (James et al. 2021, chap. 12) | ||
Summative |
|
||
🗓️ W11 | Lecture | Applications: Social Media Data | |
Guests: |
Sara Luxmoore, MSc in Applied Social Data Science (LSE) Anton Boychenko, MSc in Applied Social Data Science (LSE) |
||
Lab | We will explore unsupervised models using a couple of text datasets | ||
… | |||
🗓️ Jan/23 | Exam |
|
References
James, Gareth, Daniela Witten, Trevor Hastie, and Robert Tibshirani. 2021. An Introduction to Statistical Learning: With Applications in R. Second edition. Springer Texts in Statistics. New York NY: Springer. https://www.statlearning.com/.