Python Numpy

Open-source Python projects categorized as Numpy

Top 23 Python Numpy Projects

  1. Pytorch

    Tensors and Dynamic neural networks in Python with strong GPU acceleration

    Project mention: Must-Know 2025 Developer’s Roadmap and Key Programming Trends | dev.to | 2025-02-05

    Python’s Growth in Data Work and AI: Python continues to lead because of its easy-to-read style and the huge number of libraries available for tasks from data work to artificial intelligence. Tools like TensorFlow and PyTorch make it a must-have. Whether you’re experienced or just starting, Python’s clear style makes it a good choice for diving into machine learning. Actionable Tip: If you’re new to Python, try projects that combine data with everyday problems. For example, build a simple recommendation system using Pandas and scikit-learn.

  2. Nutrient

    Nutrient – The #1 PDF SDK Library, trusted by 10K+ developers. Other PDF SDKs promise a lot - then break. Laggy scrolling, poor mobile UX, tons of bugs, and lack of support cost you endless frustrations. Nutrient’s SDK handles billion-page workloads - so you don’t have to debug PDFs. Used by ~1 billion end users in more than 150 different countries.

    Nutrient logo
  3. 30-Days-Of-Python

    30 days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days. This challenge may take more than100 days, follow your own pace. These videos may help too: https://www.youtube.com/channel/UC7PNRuno1rzYPb1xLa4yktw

    Project mention: 17 Best GitHub Repositories to Learn Python | dev.to | 2025-02-06

    30-Days-Of-Python

  4. NumPy

    The fundamental package for scientific computing with Python.

    Project mention: Building an AI-powered Financial Data Analyzer with NodeJS, Python, SvelteKit, and TailwindCSS - Part 0 | dev.to | 2025-01-31

    The AI Service will be built using aiohttp (asynchronous Python web server) and integrates PyTorch, Hugging Face Transformers, numpy, pandas, and scikit-learn for financial data analysis.

  5. data-science-ipython-notebooks

    Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

  6. datasets

    🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

    Project mention: 20 Open Source Tools I Recommend to Build, Share, and Run AI Projects | dev.to | 2024-11-13

    Datasets library repository for accessing and sharing datasets with the community.

  7. ivy

    Convert Machine Learning Code Between Frameworks

  8. Dask

    Parallel computing with task scheduling

    Project mention: Ask HN: What's the right tool for this job? | news.ycombinator.com | 2024-07-20

    From what I've seen, there are sort of two paths. I'll provide a well known example from each.

    1. lang specific distributed task library

    For example, in Python, celery is a pretty popular task system. If you (the dev) are the one doing all the code and running the workflows, it might work well for you. You build the core code and functions, and it handles the processing and resource stuff with a little config.

    * https://github.com/celery/celery

    Or lower level:

    * https://github.com/dask/dask

    2. DAG Workflow systems

    There are also whole systems for what you're describing. They've gotten especially popular in the ML ops and data engineering world. A common one is AirFlow:

    * https://github.com/apache/airflow

  9. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
  10. numpy-100

    100 numpy exercises (with solutions)

  11. scientific-visualization-book

    An open access book on scientific visualization using python and matplotlib

    Project mention: Scientific Visualization: Python and Matplotlib, by Nicolas Rougier | news.ycombinator.com | 2024-09-17
  12. Numba

    NumPy aware dynamic Python compiler using LLVM

    Project mention: CuPy: NumPy and SciPy for GPU | news.ycombinator.com | 2024-09-20

    I'm surprised to see pytorch and Jax mentioned as alternatives but not numba : https://github.com/numba/numba

    I've recently had to implement a few kernels to lower the memory footprint and runtime of some pytorch function : it's been really nice because numba kernels have type hints support (as opposed to raw cupy kernels).

  13. mlcourse.ai

    Open Machine Learning Course

  14. cupy

    NumPy & SciPy for GPU

    Project mention: CuPy: NumPy and SciPy for GPU | news.ycombinator.com | 2024-09-20
  15. einops

    Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)

    Project mention: Einops Rocks | news.ycombinator.com | 2024-11-15
  16. trax

    Trax — Deep Learning with Clear Code and Speed

    Project mention: Maxtext: A simple, performant and scalable Jax LLM | news.ycombinator.com | 2024-04-23

    Is t5x an encoder/decoder architecture?

    Some more general options.

    The Flax ecosystem

    https://github.com/google/flax?tab=readme-ov-file

    or dm-haiku

    https://github.com/google-deepmind/dm-haiku

    were some of the best developed communities in the Jax AI field

    Perhaps the “trax” repo? https://github.com/google/trax

    Some HF examples https://github.com/huggingface/transformers/tree/main/exampl...

    Sadly it seems much of the work is proprietary these days, but one example could be Grok-1, if you customize the details. https://github.com/xai-org/grok-1/blob/main/run.py

  17. tensorboardX

    tensorboard for pytorch (and chainer, mxnet, numpy, ...)

  18. orjson

    Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy

    Project mention: Web scraping of a dynamic website using Python with HTTP Client | dev.to | 2024-09-28

    The library already has support for an HTTP client that allows bypassing Cloudflare - CurlImpersonateHttpClient. Since we have to work with JSON responses we could use parsel_crawler added in version 0.3.0, but I think this is excessive for such tasks, besides I like the high speed of orjson.. Therefore, we'll need to implement our crawler rather than using one of the ready-made ones.

  19. chainer

    A flexible framework of neural networks for deep learning

  20. orange

    🍊 :bar_chart: :bulb: Orange: Interactive data analysis

    Project mention: Hierarchical Clustering | news.ycombinator.com | 2024-04-20

    I know I've tooted its horn before, but Orange3 is a pretty neat Python-based GUI platform that makes this and a metric buttload of other statistical/ML techniques available to non-programmer types.

    Just watch out for null character `x00` in the corpus. That always seems to kill it stone dead.

    https://orangedatamining.com/

    https://orange3.readthedocs.io/projects/orange-visual-progra...

  21. datasets

    TFDS is a collection of datasets ready to use with TensorFlow, Jax, ... (by tensorflow)

  22. PyQtGraph

    Fast data visualization and GUI tools for scientific / engineering applications

  23. xarray

    N-D labeled arrays and datasets in Python

    Project mention: Spectral Imaging Made Easy: A Powerful Python Library | news.ycombinator.com | 2024-12-23

    Interesting - I'm curious whether you feel that Xarray covers these use cases already?

    https://xarray.dev/

    Especially as I've said before that Hyperspy shares so many features in common with Xarray that Hyperspy should just use Xarray under the hood.

    https://github.com/hyperspy/hyperspy/discussions/3405

  24. mars

    Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.

  25. numpyro

    Probabilistic programming with NumPy powered by JAX for autograd and JIT compilation to GPU/TPU/CPU.

    Project mention: Numpyro: Probabilistic programming with NumPy powered by Jax | news.ycombinator.com | 2024-11-16
  26. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Numpy discussion

Log in or Post with

Python Numpy related posts

  • Building an AI-powered Financial Data Analyzer with NodeJS, Python, SvelteKit, and TailwindCSS - Part 0

    7 projects | dev.to | 31 Jan 2025
  • PyTorch 2.6.0 Release

    1 project | news.ycombinator.com | 29 Jan 2025
  • Responsible Innovation: Open Source Best Practices for Sustainable AI

    1 project | dev.to | 27 Jan 2025
  • Making VLLM work on WSL2

    1 project | dev.to | 17 Jan 2025
  • Experiments with Byte Matrix Multiplication

    2 projects | news.ycombinator.com | 10 Jan 2025
  • Back To Basics: git

    1 project | dev.to | 27 Dec 2024
  • Deep Dive: Meta's AI Infrastructure and Developer Tools - A Technical Analysis

    1 project | dev.to | 27 Dec 2024
  • A note from our sponsor - SaaSHub
    www.saashub.com | 12 Feb 2025
    SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source Numpy projects in Python? This list will help you:

# Project Stars
1 Pytorch 86,719
2 30-Days-Of-Python 44,314
3 NumPy 28,739
4 data-science-ipython-notebooks 27,837
5 datasets 19,560
6 ivy 14,008
7 Dask 12,909
8 numpy-100 12,406
9 scientific-visualization-book 10,774
10 Numba 10,190
11 mlcourse.ai 9,910
12 cupy 9,739
13 einops 8,706
14 trax 8,159
15 tensorboardX 7,910
16 orjson 6,545
17 chainer 5,894
18 orange 4,974
19 datasets 4,352
20 PyQtGraph 3,969
21 xarray 3,701
22 mars 2,712
23 numpyro 2,383

Sponsored
Nutrient – The #1 PDF SDK Library, trusted by 10K+ developers
Other PDF SDKs promise a lot - then break. Laggy scrolling, poor mobile UX, tons of bugs, and lack of support cost you endless frustrations. Nutrient’s SDK handles billion-page workloads - so you don’t have to debug PDFs. Used by ~1 billion end users in more than 150 different countries.
www.nutrient.io

Did you know that Python is
the 2nd most popular programming language
based on number of references?