๐Ÿ—“๏ธ Week 10 - Unstructured Data (Text, Audio, Video)

2023/24 Winter Term

Author

We have been exploring tidy, rectangular data. But now it is time to explore the challenges associated with unstructured data: text, audio and video.

The lecture will be heavily demo-based.

๐Ÿ‘จโ€๐Ÿซ Lecture Slides

Either click on the slide area below or click here to view it in fullscreen. Use your keypad to navigate the slides. You can also find a PDF version on Moodle.

Today, weโ€™ll be using a couple of demos throughout the lecture. You can download the Jupyter notebooks associated with each demo here.

  • Use the link below to download the Project Gutenberg case study materials:

  • Use the link below to download the newspapers case study materials (.zip archive containing a Jupyter notebook (.ipynb) and a serialized dataframe (.pkl) ):

๐ŸŽฅ Looking for lecture recordings? You can only find those on Moodle.

Online and AI tools for modern literature review

I will show you how to use the following tools to conduct a literature review:

Original tweet: https://twitter.com/Artifexx/status/1632277025472888833

โœ๏ธ Coursework to prepare for next weekโ€™s lecture

Within your respective assigned groups, find 2-3 examples of AI ethical issues (you can look at newspapers or academic papers for sources of examples.)

โœ๏ธ Coursework to replace this weekโ€™s (original) missed lecture material

Read the following papers:

  1. Hofman, Jake M., Amit Sharma, and Duncan J. Watts. 2017. โ€œPrediction and Explanation in Social Systems.โ€ Science 355 (6324): 486โ€“88.

  2. Rettberg, Jill Walker. 2022. โ€œAlgorithmic Failure as a Humanities Methodology: Machine Learningโ€™s Mispredictions Identify Rich Cases for Qualitative Analysis.โ€ Big Data & Society 9 (2): 205395172211312.

  3. Hullman, Jessica, Sayash Kapoor, Priyanka Nanayakkara, Andrew Gelman, and Arvind Narayanan. 2022. โ€œThe Worst of Both Worlds: A Comparative Analysis of Errors in Learning from Data in Psychology and Machine Learning.โ€ In Proceedings of the 2022 AAAI/ACM Conference on AI, Ethics, and Society, 335โ€“48. Oxford United Kingdom: ACM.

  4. Verhagen, Mark D. 2022. โ€œA Pragmatistโ€™s Guide to Using Prediction in the Social Sciences.โ€ Socius: Sociological Research for a Dynamic World 8 (January): 237802312210817.

And answer the following questions about each of the readings:

  • What is the main idea of the article?
  • What are the main takeaways?
  • What are the main implications of the article?

Feel free to work in pairs or groups of three on this homework.

๐Ÿ“Ÿ Communication

  • Post your reflections, questions, and links on Slack.
  • Book office hours if you want to discuss your coursework with either me or Riya.