What is the best structured ds project you have seen?

This page summarizes the projects mentioned and recommended in the original post on /r/datascience

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • dslp

    The Data Science Lifecycle Process is a process for taking data science teams from Idea to Value repeatedly and sustainably. The process is documented in this repo.

  • projects

    Sample projects using Ploomber. (by ploomber)

  • If you want to take a look at a full example of an organized ML project (training locally, training in Kubernetes, deploying as a microservice, packaging, unit tests). Check out this example.

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • ploomber

    The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️

  • It uses Ploomber which is a workflow orchestrator similar to Kedro.

  • Kedro

    Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.

  • Another one of my personal faves is Kedro. Great ETL framework made especially for data scientists.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts