Our great sponsors
-
cookiecutter-data-science
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
Kedro
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.
I've found https://github.com/drivendata/cookiecutter-data-science as a guide, but haven't found any repos that solve a problem end to end actually use it. Are there any good repos or resources that exemplify how to solve a DS/ML case end-to-end? Including any UI (a report, stream, dash etc) needed for delivery, handling data, preprocessing, training and local development.
We have tons of examples that follow a standard layout, here’s one: https://github.com/ploomber/projects/tree/master/templates/ml-intermediate
For the lazy ones out there, here's the link to their github repo.
Related posts
- Data Science/ Analyst Zertifikate für den Job Markt?
- I have been working in this field for about 4 years now. I have a question on how you guys organize your files and work. I find that no matter how organized I am. It all looks like chaos when I am handing it over to someone else( for example transitioning to a new job). Any help is appreciated.
- Kedro
- Kendro
- Kedro – Creating reproducible, maintainable and modular data science code