π§βπ« Week 04 Lecture
Exploratory Data Analysis with pandas
π Schedule
πLocation: Thursday 24 October 2024, 4 pm - 6 pm at CLM.5.02
4:00 pm - 4:30 pm: I will create a repo on GitHub for todayβs lecture and discuss the Git Rituals first introduced in the π W04 Formative - Task 2
4:30 pm - 5:00 pm: Exploring the capabilities of
pandas
, Pythonβs main data manipulation library, and comparing it to lists and dictionaries5.00 pm - 5.10pm: πββοΈββ‘οΈQuick breakπββοΈ
5:10 pm - 6:00 pm: Curiosity-driven exploratory data analysis: I will write code to answer your data questions.
π Preparation
- Try to complete the π W04 Formative Exercise before the lecture.
π Lecture Notes
π TAKE NOTE:
You wonβt find βslides for studyingβ in this course. I do use slides in my lectures, but they serve as a visual aid to help me organise my thoughts. I tend to post those slides after the lecture on Slack, along with other links and resources.
Let me know if you want me to add notes on any specific topic or expand on something you might want to revisit later.
I created the lse-ds105/ds105a-2024 repository live in the lecture and added an initial folder with data and two Jupyter Notebooks.
How to clone this repository
Go to the repository page
Fork it! Click on the
Fork
button in the top right corner of the page. This will create a copy of the repository but under your account (so you can make mistakes as you learn without breaking the original).
- Configure the forked repository
- You should see the repository under your account now.
Rename or delete the old repository
If you had already cloned the original repository, you need to rename or delete it before cloning the new one, to avoid conflicts.
Whether you are on Nuvolos or your own computer:
- If you took notes on the repository yesterday that you want to keep, rename the folder from
ds105a-2024
to something likeds105a-2024-old
. If you want to do this via the Terminal, run the commandmv ds105a-2024 ds105a-2024-old
on bash, zsh or the Powershell - Otherwise, itβs safe to delete the folder. If you want to do this via the Terminal, run the command
rm -rf ds105a-2024
on bash or zsh. If you are on Powershell, runrmdir ds105a-2024
(Press βAβ when asked)
- If you took notes on the repository yesterday that you want to keep, rename the folder from
Just give me the files
If you prefer, you can download the files as they were before we started the lecture:
π¨ NOTE: NB02 is empty here, as it was created live in the lecture. For the up-to-date version of the notebook, refer to the GitHub repository.