ποΈ Week 03 β Day 04: Wrap up: managing your data pipeline
Get help with the course material
Managing your data pipeline
We will review the folder structure enforced in this course, and why it is important to keep your data pipeline organized.
- Why separate raw vs. processed (clean) data?
- Why separate data from code?
- How to ensure replicability of your work? (version control, README.md, requirements.txt, neat Jupyter notebooks)
- How to make the most of Git/GitHub?
- The importance of communicating your ideas as clearly as possible.
π¦Έπ» Super tech support
I will be around to help you with any questions you might have about the course material, and with any issues you might be facing while working on your final project.
Get help with:
- Your specific scraping needs
- Selenium
- Databases
- Merging data from multiple tables
- Data vizualization
- Creating markdown websites
- Git/GitHub
HUGE THANKS TO ALL OF YOU for your hard work and dedication throughout this course! π