πŸ—“οΈ Week 09 - Unstructured Data (Text, Audio, Video)

2023/24 Autumn Term

Author

We have been exploring tidy, rectangular data. But now it is time to explore the challenges associated with unstructured data: text, audio and video.

The lecture will be heavily demo-based.

πŸ‘¨β€πŸ« Lecture Slides

Either click on the slide area below or click here to view it in fullscreen. Use your keypad to navigate the slides. You can also find a PDF version on Moodle.

Today, we’ll be using a couple of demos throughout the lecture. You can download the Jupyter notebooks associated with each demo here.

  • Use the link below to download the Project Gutenberg case study materials:

  • Use the link below to download the newspapers case study materials (.zip archive containing a Jupyter notebook (.ipynb) and a serialized dataframe (.pkl) ):

πŸŽ₯ Looking for lecture recordings? You can only find those on Moodle.

πŸ†˜ Drop-in session

Don’t forget there is a drop-in session scheduled tomorrow (i.e Tuesday 21st November) between 4.30-6pm to answer any questions you might have about the formative essay but also about any other topic you’re struggling with (e.g Quarto, programming, etc.). So, see you tomorrow at COL 1.06!

βŒ› Deadline Approaching

Head to πŸ“ Formative 03 page to find out more about next week’s formative essay. The essay is due on November 30th.

πŸ“Ÿ Communication

  • Post your reflections, questions, and links on Slack.
  • Book office hours if you want to discuss your coursework with either me or Garima.