[P] Open data transformations in Python, no SQL required

This page summarizes the projects mentioned and recommended in the original post on reddit.com/r/MachineLearning

Our great sponsors
  • Scout APM - Less time debugging, more time building
  • JetBrains - Developer Ecosystem Survey 2022
  • SonarLint - Deliver Cleaner and Safer Code - Right in Your IDE of Choice!
  • RasgoQL

    Write python locally, execute SQL in your data warehouse

    You can check it out here: https://github.com/rasgointelligence/RasgoQL

  • fugue

    A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask without any rewrites.

    This looks similar to fugue, am I right? How do they compare?

  • Scout APM

    Less time debugging, more time building. Scout APM allows you to find and fix performance issues with no hassle. Now with error monitoring and external services monitoring, Scout is a developer's best friend when it comes to application development.

  • ploomber

    The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️

    Yeah, I fully agree with you. SQL has many disadvantages (although many good things as well). In any case, I'm not advocating for SQL, it's just what I've seen recently. I'm a Python fan building tools for data analysis in Python so I hope this SQL trend doesn't go too far as in "let's only do SQL" :)

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts