DS105 Data for Data Science 🖥️ 🤹
9/30/22
“[…] a field of study and practice that involves the collection, storage, and processing of data in order to derive important 💡 insights into a problem or a phenomenon.
Such data may be generated by humans (surveys, logs, etc.) or machines (weather data, road vision, etc.),
and could be in different formats (text, audio, video, augmented or virtual reality, etc.).”
knows everything about statistics
able to communicate insights perfectly
fully understands businesses like no one
is a fluent computer programmer
We are all jugglers 🤹
It is often said that 80% of the time and effort spent on a data science project goes to the tasks highlighted above.
And this is what this course is about! You will learn some of the most common tools used during this process.
Python
Github!
Use Github for everything related to your project!
Important
Don’t share code via e-mail, Dropbox, Google Drive or anything like that!
It is a bad practice as things get messy very quickly.
Tip: Data is Plural
Data is Plural run by Buzzfeed’s data editor 🧑 Jeremy Singer-Vine. People send him interesting/funny/odd datasets and he shares them in a weekly newsletter. Here’s the link to the website (the google doc list of datasets is linked here)
DS105 - Data for Data Science 🖥️ 🤹