LSE DS205 - Advanced Data Manipulation

Author
Image created with the AI embedded in MS Designer using the prompt 'abstract green and blue icon depicting the advanced stages of data wrangling, API design, and scalable pipelines for sustainability-focused data engineering.'

📑 Course Brief

Focus: master advanced data manipulation techniques for real-world data.

How: a blend of live coding, hands-on exercises, and a collaborative group project in partnership with the Transition Pathway Initiative (TPI) Centre.

🎯 Learning Objectives

  • Use pandas and similar libraries to manipulate and analyse complex datasets effectively.
  • Implement REST APIs to enable structured and user-friendly access to data.
  • Develop expertise in web scraping tools such as Scrapy and Selenium to collect data from the web.
  • Learn and apply best practices for GitHub workflows, including version control, pull requests, and collaboration.
  • Understand and work with databases and integration techniques for seamless data querying and manipulation.
  • Apply NLP techniques and modern AI tools to process unstructured data.
  • Build retrieval-augmented generation (RAG) pipelines for data applications.
  • Deliver production-ready data pipelines that meet real-world stakeholder needs.

Select Academic Year/Term: