DS105 2025-2026 Autumn Term Icon

πŸ“” Syllabus

DS105A (2025/26 Autumn Term)

Author

Dr Jon Cardoso-Silva

Modified

14 October 2025

Check this page every week for updates on course content and requirements.

πŸ–₯️ Week 01 29 Sep 2025
-
03 Oct 2025

Setup and First Data Exploration

Reserve 1-2 hours to set up and interact with the platforms of communication (Slack) and for coding (Nuvolos) we will use throughout the course as well as to experience the kind of data analysis you will be doing in the near future.

You’ll use a Jupyter notebook to explore yearly heatwave counts, then compare your step-by-step process with an AI-only approach to understand when each method works best.

Note

No need to install anything!

All assessed work in this course can be done on Nuvolos Cloud with VS Code preconfigured. You do not need to install anything on your machine unless you prefer to. If you do, we will provide an optional local setup note later in the course, but Nuvolos remains the default.

Click here to see how to get help this week

We love hearing from you! Truly! Don’t hesitate to contact us for help.

Here are the best ways to get help with your DS105A studies this week:

  • Slack: Post questions or comments to the #help channel at any time. Jon monitors regularly throughout the week, with dedicated support from Riya (Monday 6-8pm) and Tabby (Tuesday 5-7pm).

  • Office Hours: Need 1-to-1 support? Book a 15-minute slot via StudentHub by searching for our names:

    • Tabtim Duenger (Tabby): Tuesday & Wednesday 9-10am (online)
    • Jonathan Cardoso-Silva (Jon): Wednesday 2-5pm (in-person, COL.1.03)
    • Riya Chhikara: Friday 9-10am (in-person, DSI visualisation studio COL.1.06)
  • Drop-in Sessions: Pedro hosts drop-in sessions Wednesday 2-4pm at DSI visualisation studio (COL.1.06). No booking required, but please confirm attendance via e-mail to Kevin ().

  • Admin: For class changes, extensions, or other administrative queries, email Kevin ().

From Data to Insight: The Importance of Clean Data

Introduction to course goals and assessment. Reveal solution to the formative. Discuss the AI diagnostic: when AI helps versus when it fails. Notebooks as a professional tool.

Exploring Daily Temperatures with pandas

Build on Week 01’s yearly heatwave data by exploring raw daily temperature records. Learn to load CSV files into pandas, apply basic DataFrame operations like filtering and sorting, and connect daily observations to the summary data you worked with in practice.

πŸ–₯️ Week 02 06 Oct 2025
-
10 Oct 2025

Python Foundations & File Management

Complete foundational Python lessons through DataQuest, focusing on variables, data types, and collections. Learn to organise your workspace with proper folder structures and document your learning using Markdown formatting in Jupyter notebooks.

Tip

Actively post questions to Slack or attend support sessions if you get stuck or confused about any of the tasks.

Click here to see how to get help this week

There is a way to get help every single day of the week. Don’t hesitate to contact us for help. We love hearing from you!

Here are the best ways to get help with your DS105A studies this week:

  • Slack: Post questions or comments to the #help channel at any time. Jon monitors regularly throughout the week, with dedicated support from Riya (Monday 6-8pm) and Tabby (Tuesday 5-7pm).

  • Office Hours: Book a 15-minute slot via StudentHub by searching for our names:

    • Tabtim Duenger (Tabby): Tuesday & Wednesday 9-10am (online)
    • Jonathan Cardoso-Silva (Jon): Tuesday 2-5pm (in-person, COL.1.03)
    • Riya Chhikara: Friday 9-10am (in-person, DSI visualisation studio COL.1.06)
  • Drop-in Sessions: Pedro hosts drop-in sessions Wednesday 2-4pm at DSI visualisation studio (COL.1.06). No booking required, but please confirm attendance via e-mail to Kevin ().

  • Admin: For class changes, extensions, or other administrative queries, email Kevin ().

From Tables to Live Data: pandas and one API

Connect your DataQuest Python foundations to real data analysis. Explore how collections (lists and dictionaries) relate to DataFrames from Week 01. See a live API demonstration with weather data and preview next week’s JSON work.

From Collections to DataFrames

Practice connecting the Python collections you learned in DataQuest (lists and dictionaries) to pandas DataFrame operations. Apply bracket notation and indexing skills to real data analysis tasks.

πŸ–₯️ Week 03 13 Oct 2025
-
17 Oct 2025

Terminal Navigation and Python Control Flow

Play Shell-it game to learn file system navigation, practise Terminal commands for professional workflows, and complete DataQuest lessons on loops and conditionals. Create your first notebook using Terminal commands, explore different file formats, and document your learning journey.

Note

Warm-up: Shell-it (paths & terminals)

Play β€œShell-it: The London Episode” at least 10 times: https://shell-it.vercel.app/. Spend one minute reflecting on paths and directories before opening the Nuvolos terminal. This helps connect mental models of locations with real filesystem navigation.

Tip

Actively post questions to Slack or attend support sessions if you get stuck or confused about any of the tasks.

Click here to see how to get help this week

There is a way to get help every single day of the week. Don’t hesitate to contact us for help. We love hearing from you!

Here are the best ways to get help with your DS105A studies this week:

  • Slack: Post questions or comments to the #help channel at any time. Jon monitors regularly throughout the week, with dedicated support from Riya (Monday 6-8pm) and Tabby (Tuesday 5-7pm).

  • Office Hours: Book a 15-minute slot via StudentHub by searching for our names:

    • Tabtim Duenger (Tabby): Tuesday & Wednesday 9-10am (online)
    • Jonathan Cardoso-Silva (Jon): Wednesday 2-5pm (in-person, COL.1.03)
    • Riya Chhikara: Friday 9-10am (in-person, DSI visualisation studio COL.1.06)
  • Drop-in Sessions: Pedro hosts drop-in sessions Wednesday 2-4pm at DSI visualisation studio (COL.1.06) CANCELLED this week. Rescheduled to Monday 12:30-14:30 (Week 04) to provide additional support for the πŸ“ W04 Practice exercise. No booking required, but please confirm attendance by responding to the calendar invite sent by Kevin.

  • Admin: For extensions or other administrative queries, email Kevin ( )

Operating Systems, File I/O, and Git Setup

Understand how operating systems organise files and why paths matter for reproducible code. Learn about environment variables and their role in software. See a live demonstration of collecting API data and saving to JSON and CSV files. After the break, hands-on Git setup session where you create your personal repository and practise the basic Git workflow.

Control Flow Practice with Temperature Data

Apply loops and conditionals from DataQuest to real temperature analysis. Fetch July 2024 weather data from Open-Meteo API, save to JSON files, then use control flow to categorise hot days. Practise the complete workflow you’ll need for πŸ“ W04 Practice, connecting API data collection with Python analysis skills.

πŸ–₯️ Week 04 20 Oct 2025
-
24 Oct 2025

London Heatwave Analysis

Build your first complete data pipeline: fetch historical temperature data from Open-Meteo for London (1990-2025), transform the JSON data into a usable format, and create a heatwave summary table showing yearly counts.

Tip

This assignment will help you practice for the upcoming summative assessment. Actively post questions to Slack or attend support sessions if you get stuck.

Mini-Project 1 (20%) - Weather Data Analysis

Instructions will be released this week. Deadline: Week 06 Thursday 8 pm.

Click here to see how to get help this week

There is a way to get help every single day of the week. Don’t hesitate to contact us for help. We love hearing from you!

Here are the best ways to get help with your DS105A studies this week:

  • Slack: Post questions or comments to the #help channel at any time. Jon monitors regularly throughout the week, with dedicated support from Riya (Monday 6-8pm) and Tabby (Tuesday 5-7pm).

  • Office Hours: Book a 15-minute slot via StudentHub by searching for our names:

    • Tabtim Duenger (Tabby): Tuesday & Wednesday 9-10am (online)
    • Jonathan Cardoso-Silva (Jon): Wednesday 2-5pm (in-person, COL.1.03)
    • Riya Chhikara: Friday 9-10am (in-person, DSI visualisation studio COL.1.06)
  • Drop-in Sessions: Pedro hosts two drop-in sessions this week:

    • Monday, 20 October, 12:30-14:30 (rescheduled from Week 03) - additional support for the W04 Practice exercise
    • Wednesday, 22 October, 14:00-16:00 (regular session)

    Both sessions are at DSI visualisation studio (COL.1.06). No booking required, but please confirm attendance by responding to the calendar invite sent by Kevin.

  • Admin: For extensions or other administrative queries, email Kevin ( )

Working with DataFrames

Understand why tidy, reproducible steps matter for data analysis. Learn how Python collections (lists and dictionaries) connect to pandas DataFrames, and practice essential operations like selection, grouping, and creating simple plots.

Data Quality and Transformation Workshop

Work with real-world messy data to understand when to clean versus when to investigate inconsistencies. Practice applying transformations to datasets and learn to document your data quality decisions clearly.

πŸ–₯️ Week 05 27 Oct 2025
-
31 Oct 2025

Click here to see how to get help this week

There is a way to get help every single day of the week. Don’t hesitate to contact us for help. We love hearing from you!

Here are the best ways to get help with your DS105A studies this week:

  • Slack: Post questions or comments to the #help channel at any time. Jon monitors regularly throughout the week, with dedicated support from Riya (Monday 6-8pm) and Tabby (Tuesday 5-7pm).

  • Office Hours: Book a 15-minute slot via StudentHub by searching for our names:

    • Tabtim Duenger (Tabby): Tuesday & Wednesday 9-10am (online)
    • Jonathan Cardoso-Silva (Jon): Wednesday 2-5pm (in-person, COL.1.03)
    • Riya Chhikara: Friday 9-10am (in-person, DSI visualisation studio COL.1.06)
  • Drop-in Sessions: Pedro hosts drop-in sessions Wednesday 2-4pm at DSI visualisation studio (COL.1.06). No booking required, but please confirm attendance via e-mail to Kevin ().

  • Admin: For extensions or other administrative queries, email Kevin ( )

Data Transformation and Visualisation Design

Master pandas transformations including .apply() with custom functions in Python (the def operator) and .groupby() aggregations. Learn the principles of seaborn visualisation design and standards for communication of insights.

Visual Communication Workshop

Practice creating multiple chart types from the same data, learn to justify your choices for different audiences, and develop skills in writing compelling narrative titles.

πŸ–₯️ Week 06 03 Nov 2025
-
07 Nov 2025

Focus on Mini-Project 1. Support sessions available.

Submit your Mini-Project 1 (20%) via GitHub until Thursday 8 pm.

πŸ–₯️ Week 07 10 Nov 2025
-
14 Nov 2025

Click here to see how to get help this week

There is a way to get help every single day of the week. Don’t hesitate to contact us for help. We love hearing from you!

Here are the best ways to get help with your DS105A studies this week:

  • Slack: Post questions or comments to the #help channel at any time. Jon monitors regularly throughout the week, with dedicated support from Riya (Monday 6-8pm) and Tabby (Tuesday 5-7pm).

  • Office Hours: Book a 15-minute slot via StudentHub by searching for our names:

    • Tabtim Duenger (Tabby): Tuesday & Wednesday 9-10am (online)
    • Jonathan Cardoso-Silva (Jon): Wednesday 2-5pm (in-person, COL.1.03)
    • Riya Chhikara: Friday 9-10am (in-person, DSI visualisation studio COL.1.06)
  • Drop-in Sessions: Pedro hosts drop-in sessions Wednesday 2-4pm at DSI visualisation studio (COL.1.06). No booking required, but please confirm attendance via e-mail to Kevin ().

  • Admin: For extensions or other administrative queries, email Kevin ( )

Advanced Data Reshaping and Databases

melt, pivot, multi-index operations. SQLite introduction. When to use databases versus files. Basic SQL queries.

Reshaping Workshop and Database Setup

Tidy data principles and database design concepts. Transform wide and long datasets, set up SQLite, practise SELECT, WHERE, GROUP BY, and compare pandas versus SQL approaches.

Mini-Project 2 (30%) - London Travel Time Analysis

Instructions will be released this week. Deadline: Week 10 Wednesday 8 pm.

πŸ–₯️ Week 08 17 Nov 2025
-
21 Nov 2025

Click here to see how to get help this week

There is a way to get help every single day of the week. Don’t hesitate to contact us for help. We love hearing from you!

Here are the best ways to get help with your DS105A studies this week:

  • Slack: Post questions or comments to the #help channel at any time. Jon monitors regularly throughout the week, with dedicated support from Riya (Monday 6-8pm) and Tabby (Tuesday 5-7pm).

  • Office Hours: Book a 15-minute slot via StudentHub by searching for our names:

    • Tabtim Duenger (Tabby): Tuesday & Wednesday 9-10am (online)
    • Jonathan Cardoso-Silva (Jon): Wednesday 2-5pm (in-person, COL.1.03)
    • Riya Chhikara: Friday 9-10am (in-person, DSI visualisation studio COL.1.06)
  • Drop-in Sessions: Pedro hosts drop-in sessions Wednesday 2-4pm at DSI visualisation studio (COL.1.06). No booking required, but please confirm attendance via e-mail to Kevin ().

  • Admin: For extensions or other administrative queries, email Kevin ( )

Multi-Source Integration and Complex Joins

Joining multiple JSON-derived tables, strategies for stable keys, deduplication, and handling rate limits with caching to data/raw. Using lookup services when you genuinely need a key.

Integration Patterns and Error Handling

Joining at the right grain, avoiding accidental cartesian products, and retry and backoff for flaky endpoints. Integrate two or three sources into one tidy table, validate join keys, and implement basic caching.

πŸ–₯️ Week 09 24 Nov 2025
-
28 Nov 2025

Click here to see how to get help this week

There is a way to get help every single day of the week. Don’t hesitate to contact us for help. We love hearing from you!

Here are the best ways to get help with your DS105A studies this week:

  • Slack: Post questions or comments to the #help channel at any time. Jon monitors regularly throughout the week, with dedicated support from Riya (Monday 6-8pm) and Tabby (Tuesday 5-7pm).

  • Office Hours: Book a 15-minute slot via StudentHub by searching for our names:

    • Tabtim Duenger (Tabby): Tuesday & Wednesday 9-10am (online)
    • Jonathan Cardoso-Silva (Jon): Wednesday 2-5pm (in-person, COL.1.03)
    • Riya Chhikara: Friday 9-10am (in-person, DSI visualisation studio COL.1.06)
  • Drop-in Sessions: Pedro hosts drop-in sessions Wednesday 2-4pm at DSI visualisation studio (COL.1.06). No booking required, but please confirm attendance via e-mail to Kevin ().

  • Admin: For extensions or other administrative queries, email Kevin ( )

Designing a Good Database Schema and Visualisation Refinement

Relational design patterns, keys and constraints, and how schema choices impact analysis. Revisiting message-first visualisation with more complex tables.

From Tables to Communication

Aligning schema with analysis questions and refining visualisations for clarity and honesty. Refine plots and narratives to support your findings.

πŸ–₯️ Week 10 01 Dec 2025
-
05 Dec 2025

Click here to see how to get help this week

There is a way to get help every single day of the week. Don’t hesitate to contact us for help. We love hearing from you!

Here are the best ways to get help with your DS105A studies this week:

  • Slack: Post questions or comments to the #help channel at any time. Jon monitors regularly throughout the week, with dedicated support from Riya (Monday 6-8pm) and Tabby (Tuesday 5-7pm).

  • Office Hours: Book a 15-minute slot via StudentHub by searching for our names:

    • Tabtim Duenger (Tabby): Tuesday & Wednesday 9-10am (online)
    • Jonathan Cardoso-Silva (Jon): Wednesday 2-5pm (in-person, COL.1.03)
    • Riya Chhikara: Friday 9-10am (in-person, DSI visualisation studio COL.1.06)
  • Drop-in Sessions: Pedro hosts drop-in sessions Wednesday 2-4pm at DSI visualisation studio (COL.1.06). No booking required, but please confirm attendance via e-mail to Kevin ().

  • Admin: For extensions or other administrative queries, email Kevin ( )

Advanced SQL and Data Pipeline Design

Subqueries and window functions (illustrative). ETL concepts and basic performance considerations.

Pipeline Design Workshop

Production-like data workflows and scalability considerations. Design a multi-stage pipeline, optimise SQL queries, and practise advanced joins and aggregations.

Submit your Mini-Project 2 (30%) via GitHub until Wednesday 8 pm.

Group Project (50%)

Instructions will be released this week. Due next term (Winter Term Week 03 for Autumn cohorts). The exact date will be provided on the group project page.

Assessment breakdown: 10% group presentation (Week 11), 30% project quality, 10% individual contribution.

πŸ–₯️ Week 11 08 Dec 2025
-
12 Dec 2025

Click here to see how to get help this week

There is a way to get help every single day of the week. Don’t hesitate to contact us for help. We love hearing from you!

Here are the best ways to get help with your DS105A studies this week:

  • Slack: Post questions or comments to the #help channel at any time. Jon monitors regularly throughout the week, with dedicated support from Riya (Monday 6-8pm) and Tabby (Tuesday 5-7pm).

  • Office Hours: Book a 15-minute slot via StudentHub by searching for our names:

    • Tabtim Duenger (Tabby): Tuesday & Wednesday 9-10am (online)
    • Jonathan Cardoso-Silva (Jon): Wednesday 2-5pm (in-person, COL.1.03)
    • Riya Chhikara: Friday 9-10am (in-person, DSI visualisation studio COL.1.06)
  • Drop-in Sessions: Pedro hosts drop-in sessions Wednesday 2-4pm at DSI visualisation studio (COL.1.06). No booking required, but please confirm attendance via e-mail to Kevin ().

  • Admin: For extensions or other administrative queries, email Kevin ( )

Team Collaboration and Technical Communication

Branching workflows, merge conflicts, and technical documentation standards.

Group Project Presentations (10%)

5-minute group presentations on your project plans. Peer feedback session and establishing team workflows for the Winter Term project work.