βœ… Week 05 - Checklist

DS202 - Data Science for Social Scientists

Author
Published

24 October 2022

Important

Keep in mind that after the lecture at the end of this week, on Friday 28 October 2022, we will release the Summative Problem Set 01. This is the first summative assessment of this course and it is worth 20% of your final grade. You will have until 9 November 2022 β€” Wednesday of Week 07 β€” to submit your solutions via Moodle.

Here is a suggestion of how to program your week in relation to this course:

If your lab is on Monday

If your lab is on Monday:

  • πŸ“₯ Download: Before or once you arrive at the classroom, download the DS202_2022MT_w05_lab_rmark.Rmd file that contains the lab roadmap (under πŸ—“οΈ Week 05 - Non-linear algorithms/W04 Lab Files section on Moodle). Or browse the webpage version here.

  • πŸ’» Participate: Actively engage with the material in the lab. Ask your class teacher for help if anything is unclear. Work with others whenever possible and take notes of theoretical concepts or practical coding skill you might want to revisit later in the week.

  • πŸ“™ Read: Find some time to read (James et al. 2021, chap. 5) and reinforce your theoretical understanding of resampling methods; the textbook is available online for free.

    • In the lecture/workshop last week, we only explored training/test splits and the idea of cross-validation; in the lab, you will have a chance to explore another technique called the bootstrap.
    • As you go through the text, try to connect what you read to the things you heard about in the lecture/workshop or the examples you explored in the lab.
  • ✍️ Solve: There is still time to submit your solutions to the Formative Problem Set 01, the deadline is Tuesday 25 October 23:59 UK time.

  • 🏫 Attend the lecture: This week, you will learn of two new algorithms: Support Vector Machine and Decision Tree. You will need to use these algorithms in the first summative that will be released on Friday.

If your lab is on Friday

If your lab is on Friday:

  • ✍️ Solve: There is still time to submit your solutions to the Formative Problem Set 01, the deadline is Tuesday 25 October 23:59 UK time.

  • πŸ“™ Read: Find some time to read (James et al. 2021, chap. 5) and reinforce your theoretical understanding of resampling methods; the textbook is available online for free.

    • In the lecture/workshop last week, we only explored training/test splits and the idea of cross-validation; in the lab, you will have a chance to explore another technique called the bootstrap.
    • As you go through the text, try to connect what you read to the things you heard about in the lecture/workshop or the examples you explored in the lab.
  • πŸ“₯ Download: Before or once you arrive at the classroom, download the DS202_2022MT_w05_lab_rmark.Rmd file that contains the lab roadmap (under πŸ—“οΈ Week 05 - Non-linear algorithms/W04 Lab Files section on Moodle). Or browse the webpage version here.

  • πŸ’» Participate: Actively engage with the material in the lab. Ask your class teacher for help if anything is unclear. Work with others whenever possible and take notes of theoretical concepts or practical coding skill you might want to revisit later in the week.

  • 🏫 Attend the lecture: This week, you will learn of two new algorithms: Support Vector Machine and Decision Tree. You will need to use these algorithms in the first summative that will be released on Friday.

Extra:

References

James, Gareth, Daniela Witten, Trevor Hastie, and Robert Tibshirani. 2021. An Introduction to Statistical Learning: With Applications in R. Second edition. Springer Texts in Statistics. New York NY: Springer. https://www.statlearning.com/.