π§βπ« Week 01 Lecture
Welcome to DS205 + the ASCOR Dataset
Last Updated: 20 January 15:40 - Added TPI presentation slides and Nuvolos guides.
Welcome to DS205, your gateway to professional-grade data engineering capabilities.
π Session Details
- Date: Monday, 20 January 2025
- Time: 10:00 am - 12:00 pm
- Location: KSW.1.01
- Guests: Sylvan + Valentin from TPI
β οΈ IMPORTANT As this is a new course, we will be learning and adapting as we go. If you notice something that is not working super well or if you have any suggestions, please let me know!
π£οΈ Lecture Structure
In our first ever DS205 lecture, weβll cover the following:
Course Overview: Understand the core objectives and themes of DS205 and how TPI will serve as real stakeholders for us. I will also discuss the structure and timing of our assessments. Those who joined this course from DS105 will find that it will be fairly similar.
Introduction to TPI and ASCOR: Sylvan and Valentin from TPI will introduce their work and they will do a special βdeep diveβ into ASCOR. This dataset will be our primary focus throughout the course. We will discuss the datasetβs structure, its relevance to TPIβs mission, and how what we learn in this course will be directly applicable to their work.
Hands-on with the ASCOR data: We will then apply key pandas functions to real ASCOR data. This should serve as a brief recap of pandas and a way to bridge theory with practice.
π¬ Lecture Slides
Here you will find the slides used during the lecture, both mine and the ones presented by Sylvan and Valentin.
Jonβs Slides
Use keyboard arrows to navigate. Select the slides below or view fullscreen.
π TPI Centre Presentation
Click on the button below to grab the slides presented by Sylvan and Valentin.
Brief Practical Demonstration
During the final 30 minutes of the session, I covered:
- VS Code environment setup
- ASCOR website navigation
- Jupyter Notebook data exploration
Final Thoughts
After we heard from Sylvan and Valentin, I demonstrated how we use VS Code in this course, I showed you around the ASCOR website and I started to explore one of the files from this dataset on a Jupyter Notebook.
Youβll practice these skills in Tuesdayβs π» W01 Lab with Alex and I will post solutions to the lab afterwards, too.
π Essential Actions
- Join our Slack 1 - our primary communication hub.
- Access your dedicated cloud development environment:
- Review the π Syllabus and the βοΈ Assessment Structure.
- Prepare for Tuesdayβs π» W01 Lab
Development Environment
We will be writing a lot of code throughout this course and we will be using VS Code and Jupyter notebooks for most of our work. We have a dedicated Nuvolos workspace for you to use. However, if you prefer to work locally on your own machine, you will have to make sure you have a few tools installed.
Here are the two options:
Option 1: Nuvolos Platform
Access our Nuvolos - First Time workspace. This cloud environment comes pre-configured with all required tools.
Option 2: Local Setup
If you prefer working locally, ensure you have:
- Python 3.10+
- VS Code
- GitHub CLI
- Git (installed and configured)
π₯ Session Recording
Typically, the recordings are made available on Moodle in the afternoon. However, the lecture recording was nowhere to be found! I have asked Eden Digital for help. It might be that new Moodle courses need to be linked somewhere internally in some LSE systems and it wasnβt active yet.
Iβll keep you posted.
Jon 20 Jan 2025, 2.04pm
Footnotes
The Slack link is private and only available on Moodleβ©οΈ