New DS here. Where can I learn best practices for organizing a project, folder structure, BASH scripting/scheduling, etc?

This page summarizes the projects mentioned and recommended in the original post on reddit.com/r/datascience

Our great sponsors
  • Appwrite - The Open Source Firebase alternative introduces iOS support
  • InfluxDB - Access the most powerful time series database as a service
  • SonarLint - Clean code begins in your IDE with SonarLint
  • dslp

    The Data Science Lifecycle Process is a process for taking data science teams from Idea to Value repeatedly and sustainably. The process is documented in this repo.

    As an addendum to u/GryffinLoL I’d add this resource if you’re using any kind of VCS tooling. It has some solid suggestions.

  • missing-semester

    The Missing Semester of Your CS Education 📚

    https://missing.csail.mit.edu This one will teach you all about general computer skills, things like navigating the command line, bash scripting, cron scheduling, and make files which will be very useful to you.

  • Appwrite

    Appwrite - The Open Source Firebase alternative introduces iOS support. Appwrite is an open source backend server that helps you build native iOS applications much faster with realtime APIs for authentication, databases, files storage, cloud functions and much more!

  • govcookiecutter

    A cookiecutter template for data science projects within His Majesty's Government and wider public sector.

    For testing your statistical assumptions and performance, govcookiecutter is worth a look for its integration of agile and baked in unit testing, see: https://github.com/best-practice-and-impact/govcookiecutter

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts