β Week 10 - Checklist
DS202 - Data Science for Social Scientists
Comprehension Check
By the end of the week, you should be able to:
- Reflect on how to pre-process data using
tidyverse
so as to create a dataset of frequency counts - Use tidyverseβs
mutate
,group_by
andsummarise
to aggregate data when needed (Useful Refs: R for Data Science | 5 Data transformation & R for Data Science | 12 Tidy Data) - Reflect on the impact of missing data on a dataset
- Run PCA on a dataset
- Explain the rationale behind PCAβs new features (how are they correlated? how do you find out how the original features form them?)
Time Management Tips
Here is a suggestion of how to program your week in relation to this course:
If your lab is on Monday
If your lab is on Monday:
On Monday:
π₯ Download: Before or once you arrive at the classroom, download the DS202_2022MT_w10_lab_rmark.Rmd file that contains the lab roadmap (under ποΈ Week 10 section on Moodle). Or browse the webpage version here.
π» Participate: Actively engage with the material in the lab. Ask your class teacher for help if anything is unclear. Work with others whenever possible and take notes of theoretical concepts or practical coding skill you might want to revisit later in the week.
π€ Drop-in Session: If you have any final questions about the Summative Problem Set 02, attend the Drop-in Session at CBG 2.05 on Monday 28 November, 2pm-4pm.
Tuesday to Thursday
π Read: Find some time to read (James et al. 2021, sec. 12.2) and reinforce your theoretical understanding of Principal Component Analysis; it is a very short section.
π Practice the tutorials: Dedicate some time to learn about
recipes
(W10 bonus content) and to practice the tutorials linked on W10 lab.- Here is another content you might find useful: Dimensionality Reduction from the Tidy Modelling with R book.
- Practice
group_by
&summarise
: R for Data Science | 5 Data transformation - Practice reshaping data when needed: R for Data Science | 12 Tidy Data)
πΊ Watch short videos: During the week, I will release short videos further explaining Principal Component Analysis. Try to reserve some time to watch those.
Friday
π« Attend the lecture: This week, Dr. Stuart Bramwell will join us on the second half of the lecture to talk about his research.
βοΈ Get ready for the Summative 03: the final problem set, Summative Problem Set 03, will be released this Friday 2 December.
The deadline is Thursday, 15 December 2022, 11:59 PM but better not to leave it to the last minute!
How to practice: W09 & W10 labs + the tutorials linked in the π Practice the tutorials section.
Any time
π You know the drill. Share your questions on the
#week10
channel in our Slack group.πWant to talk to someone else about this course? Try reaching out to your course representatives,
@Zhang Ruishan (Yoyo)
or@Rachitha Raghuram
.
If your lab is on Friday
If your lab is on Friday:
Monday
- π€ Drop-in Session: If you have any final questions about the Summative Problem Set 02, attend the Drop-in Session at CBG 2.05 on Monday 28 November, 2pm-4pm.
Monday - Thursday:
π Read: Find some time to read (James et al. 2021, sec. 12.2) and reinforce your theoretical understanding of Principal Component Analysis; it is a very short section.
πΊ Watch short videos: During the week, I will release short videos further explaining Principal Component Analysis. Try to reserve some time to watch those.
Friday
π₯ Download: Before or once you arrive at the classroom, download the DS202_2022MT_w10_lab_rmark.Rmd file that contains the lab roadmap (under ποΈ Week 10 section on Moodle). Or browse the webpage version here.
π» Participate: Actively engage with the material in the lab. Ask your class teacher for help if anything is unclear. Work with others whenever possible and take notes of theoretical concepts or practical coding skill you might want to revisit later in the week.
π« Attend the lecture: This week, Dr. Stuart Bramwell will join us on the second half of the lecture to talk about his research.
βοΈ Get ready for the Summative 03: the final problem set, Summative Problem Set 03, will be released this Friday 2 December.
The deadline is Thursday, 15 December 2022, 11:59 PM but better not to leave it to the last minute!
How to practice: W09 & W10 labs + the tutorials linked in the π Practice the tutorials section.
Some time early next week
- π Practice the tutorials: Dedicate some time to learn about
recipes
(W10 bonus content) and to practice the tutorials linked on W10 lab.- Here is another content you might find useful: Dimensionality Reduction from the Tidy Modelling with R book.
- Practice
group_by
&summarise
: R for Data Science | 5 Data transformation - Practice reshaping data when needed: R for Data Science | 12 Tidy Data)
Any time
π You know the drill. Share your questions on the
#week10
channel in our Slack group.πWant to talk to someone else about this course? Try reaching out to your course representatives,
@Zhang Ruishan (Yoyo)
or@Rachitha Raghuram
.