β Week 06 - Checklist
DS202 - Data Science for Social Scientists
Here is a suggestion of how to program your week in relation to this course:
Revisit the code in π» Week 04 - Lab Roadmap & βοΈ Week 04 - Lab Solutions to reinforce your understanding of logistic regression and that of training/test split.
- In there, you will also find code to run an alternative classifier: Naive Bayes
Read the text on the π¨βπ« Week 04 - Lecture page to understand more about different training/test splits and also of k-fold cross validation.
The RMarkdown used in this lecture/workshop can be found on Moodle, under π₯W04 Lecture Files
Bookmark the page with π¨βπ« Week 05 - Lecture slides or π₯ download the PDFs
- In there, you will find code to run alternative algorithms: Decision Tree and Support Vector Machine (SVM)
Try to solve the βοΈ Summmative Problem Set (01) | W05-W07 by yourself.
Read the useful links about tidyverse (below)
Questions? Attend one of the drop-in sessions (below) or ask questions on #week06 channel on Slack
Day | Time | Classroom | Teacher |
---|---|---|---|
Tue 1 Nov 2022 | 10 a.m. - 12 p.m. | FAW 3.02 | Jon (Focus on R programming) |
Wed 2 Nov 2022 | 10 a.m. - 12 p.m. | FAW 3.02 | Mustafa |
Fri 4 Nov 2022 | 10 a.m. - 12 p.m. | FAW 1.02 | Yijun |
Useful Links about tidyverse
tidyverse
is a set of R packages that have several functions and facilities for working with data. I find tidyverse
more intuitive than base R, and thereβs an entire book available for free online (R for Data Science) that contains a lot of helpful tutorials about tidyverse. Let me point to a few specific chapters:
You might need or want to transform data when working on your problem sets. Check out Chapter 5 of R for Data Science online book.
Data visualization is another helpful skill. You can learn a bit more about ggplot, the tidyverse way of making plots, in Chapter 3 of R for Data Science online book.
What should you be looking for when working with data? Check Chapter 7 to learn the basics of exploratory data analysis.
Do you get confused about R Markdown, the idea of βknittingβ a file? Then read Chapter 27.
(More advanced) Do you want to learn how to reshape data or deal with more complex data manipulation? Then have a look at Chapter 12 - Tidy data and Chapter 13 - Iteration.
Are you already familiar with tidyverse, but you constantly need to Google how to do things? Save these cheatsheets to your computer.