LSE ME204

Data Engineering for the Social World

Author

Page Update Notice:

We are currently updating this page with the latest course information. Expect a full version by the start of Summer School - Session 2 (14 July 2025).

🎯 Course Overview

ME204 teaches you to collect, clean, and analyse real-world data using Python. Over three weeks, you’ll learn to work with messy data and transform it into insights.

The course focuses on practical data engineering skills: collecting data through APIs, cleaning datasets with pandas, creating visualizations, and using professional workflows including Git and GitHub. You’ll work on projects using real data and build a website to showcase your findings.

What you can expect to learn:

By the end of this course, you will be able to:

🌐 Collect data from real websites and APIs using Python

🧹 Clean and preprocess messy real-world data into analysis-ready formats

🗄️ Apply SQL fundamentals and best practices for data storage

📊 Create effective visualizations for exploratory data analysis

⚙️ Use professional workflows including Git, GitHub, and Generative AI tools

🚀 Build and present a complete data project with web-based reporting


👥 Meet Your Teaching Team

Name: Dr Jon Cardoso-Silva
Links: LSE, GitHub, LinkedIn, 📧
Role at LSE: Assistant Professor (Education)
LSE Data Science Institute
At LSE since 2021
Background:
  • PhD in Computer Science (King’s College London)
  • Former roles: Tech Lead, Data Scientist, Software Engineer
Likes to think about: How Generative AI is influencing the way we learn 1.

COURSE LEADER | LECTURER

Dr Stuart Bramwell
DPhil in Politics (Oxford University)
Teaches: afternoon classes
📧

CLASS TEACHER

❓ Questions?

📧 Email us using the contact details above
🏢 Office Hours available during the course

Last updated: 10 July 2025

Footnotes

  1. Read about the GENIAL project↩︎