LSE ME204
Data Engineering for the Social World
Page Update Notice:
We are currently updating this page with the latest course information. Expect a full version by the start of Summer School - Session 2 (14 July 2025).
🎯 Course Overview
ME204 teaches you to collect, clean, and analyse real-world data using Python. Over three weeks, you’ll learn to work with messy data and transform it into insights.
The course focuses on practical data engineering skills: collecting data through APIs, cleaning datasets with pandas, creating visualizations, and using professional workflows including Git and GitHub. You’ll work on projects using real data and build a website to showcase your findings.
What you can expect to learn:
By the end of this course, you will be able to:
🌐 Collect data from real websites and APIs using Python
🧹 Clean and preprocess messy real-world data into analysis-ready formats
🗄️ Apply SQL fundamentals and best practices for data storage
📊 Create effective visualizations for exploratory data analysis
⚙️ Use professional workflows including Git, GitHub, and Generative AI tools
🚀 Build and present a complete data project with web-based reporting
👥 Meet Your Teaching Team

Name: | Dr Jon Cardoso-Silva |
Links: |
![]() ![]() |
Role at LSE: |
Assistant Professor (Education) LSE Data Science Institute At LSE since 2021 |
Background: |
|
Likes to think about: | How Generative AI is influencing the way we learn 1. |
COURSE LEADER | LECTURER
Dr Stuart Bramwell
DPhil in Politics (Oxford University)
Teaches: afternoon classes
📧
CLASS TEACHER
❓ Questions?
📧 Email us using the contact details above
🏢 Office Hours available during the course
Last updated: 10 July 2025
Footnotes
Read about the
GENIAL project↩︎