๐ป Week 09 - Class Roadmap (90 min)
2022/23 Lent Term
๐ Learning Objectives
You will learn how to:
Note: we will be talking about classification, a supervised learning approach. This means that we will be using a dataset with a response variable (also known as the target variable) to predict the response variable (a class) for new observations.
๐ฃ๏ธ Roadmap
โ๏ธ Setup (~ 10 min)
๐ฏ ACTION POINTS:
- Go to Moodle and download the files for this weekโs lab.
- Open RStudio and create a new project.
- Add the files you download to the folder of your project.
- Open
DS101L_2022_23_W09_lab.Rmd
in RStudio. - Run the code chunks in the โ๏ธ Setup section to load the libraries you need for this lab.
Ask your colleagues and tutor for help if you get stuck.
1๏ธโฃ Load Data (~ 15 min)
๐งโ๐ซ INSTRUCTOR NOTES:
- Your instructor will explain what is in the data.
- Since we will take a supervised learning approach, your instructor will briefly explain what the input and output variables are.
๐ฏ ACTION POINTS:
Follow the action points in the markdown file.
2๏ธโฃ Training vs Test (~35 min)
๐จโ๐ซ TEACHING POINT:
- Your instructor will tell you about a common strategy to validate Machine Learning models: training versus test splits.
๐ฏ ACTION POINTS:
Follow the action points in the markdown file. You will learn how to create a training and test split and what a confusion matrix looks like.
There are also several points of discussion in the markdown file. You will discuss these points with your colleagues and tutor.
3๏ธโฃ Metrics (~ 30 min)
๐จโ๐ซ TEACHING POINT:
Each cell in the confusion matrix can be identified with a name. Your instructor will tell you about:
- False Positives
- False Negatives
- True Positives
- True Negatives
Take a look at this table of metrics together.
๐ฅ WORKING TOGETHER IN PAIRS:
Try to work out the solution to the questions posed in the markdown file. You can discuss these questions with your colleagues and tutor.
๐จโ๐ซ TEACHING POINT:
- Your instructor will explain what is good and bad about the metrics of this classification model!
๐กTake-home exercise
There is a take-home exercise in the markdown file. Can you answer the questions posed in there?
If you canโt work out the answers to the questions, send us a message on Slack!