why python is used much more in DS instead of a more efficient language like Java or Rust?

This page summarizes the projects mentioned and recommended in the original post on /r/java

Our great sponsors
  • InfluxDB - Access the most powerful time series database as a service
  • Sonar - Write Clean Python Code. Always.
  • SaaSHub - Software Alternatives and Reviews
  • NumPy

    The fundamental package for scientific computing with Python.

    A lot of Data Science work in Python is based on Numpy. If you look at the GitHub repo, 30% of the code is C: https://github.com/numpy/numpy

  • tensorflow

    An Open Source Machine Learning Framework for Everyone

    If you want to look at machine learning, 60% of Tensorflow is in C++: https://github.com/tensorflow/tensorflow

  • InfluxDB

    Access the most powerful time series database as a service. Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression.

  • examples

    TensorFlow examples (by tensorflow)

    Numba plugs into LLVM JIT API, Tensorflow is written in C++, PyTorch core is C++, NumPy integrates BLAS libraries written in C and Fortran.

  • Pytorch

    Tensors and Dynamic neural networks in Python with strong GPU acceleration

    Numba plugs into LLVM JIT API, Tensorflow is written in C++, PyTorch core is C++, NumPy integrates BLAS libraries written in C and Fortran.

  • Numba

    NumPy aware dynamic Python compiler using LLVM

    Numba plugs into LLVM JIT API, Tensorflow is written in C++, PyTorch core is C++, NumPy integrates BLAS libraries written in C and Fortran.

  • FrameworkBenchmarks

    Source for the TechEmpower Framework Benchmarks project

  • bjoern

    A screamingly fast Python 2/3 WSGI server written in C.

    So, the thing that's handling lower-level stuff is separate from the application server. So, you can pick a performant server, like Bjoern to host a WSGI application, like Django. Bjoern is 60% C. Just like Numpy and Tensorflow that I mentioned before.

  • Sonar

    Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.

  • fastapi

    FastAPI framework, high performance, easy to learn, fast to code, ready for production

    For an end-to-end solution, in my experience FastAPI is more than quick enough for APIs that I am writing. If I need an actual website (rendered HTML) then I'd typically just grab Django because it's easy.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts