SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 Python Numpy Projects
-
Project mention: Must-Know 2025 Developer’s Roadmap and Key Programming Trends | dev.to | 2025-02-05
Python’s Growth in Data Work and AI: Python continues to lead because of its easy-to-read style and the huge number of libraries available for tasks from data work to artificial intelligence. Tools like TensorFlow and PyTorch make it a must-have. Whether you’re experienced or just starting, Python’s clear style makes it a good choice for diving into machine learning. Actionable Tip: If you’re new to Python, try projects that combine data with everyday problems. For example, build a simple recommendation system using Pandas and scikit-learn.
-
Nutrient
Nutrient – The #1 PDF SDK Library, trusted by 10K+ developers. Other PDF SDKs promise a lot - then break. Laggy scrolling, poor mobile UX, tons of bugs, and lack of support cost you endless frustrations. Nutrient’s SDK handles billion-page workloads - so you don’t have to debug PDFs. Used by ~1 billion end users in more than 150 different countries.
-
30-Days-Of-Python
30 days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days. This challenge may take more than100 days, follow your own pace. These videos may help too: https://www.youtube.com/channel/UC7PNRuno1rzYPb1xLa4yktw
30-Days-Of-Python
-
Project mention: Building an AI-powered Financial Data Analyzer with NodeJS, Python, SvelteKit, and TailwindCSS - Part 0 | dev.to | 2025-01-31
The AI Service will be built using aiohttp (asynchronous Python web server) and integrates PyTorch, Hugging Face Transformers, numpy, pandas, and scikit-learn for financial data analysis.
-
data-science-ipython-notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
-
datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Project mention: 20 Open Source Tools I Recommend to Build, Share, and Run AI Projects | dev.to | 2024-11-13Datasets library repository for accessing and sharing datasets with the community.
-
-
From what I've seen, there are sort of two paths. I'll provide a well known example from each.
1. lang specific distributed task library
For example, in Python, celery is a pretty popular task system. If you (the dev) are the one doing all the code and running the workflows, it might work well for you. You build the core code and functions, and it handles the processing and resource stuff with a little config.
* https://github.com/celery/celery
Or lower level:
* https://github.com/dask/dask
2. DAG Workflow systems
There are also whole systems for what you're describing. They've gotten especially popular in the ML ops and data engineering world. A common one is AirFlow:
* https://github.com/apache/airflow
-
CodeRabbit
CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
-
-
scientific-visualization-book
An open access book on scientific visualization using python and matplotlib
Project mention: Scientific Visualization: Python and Matplotlib, by Nicolas Rougier | news.ycombinator.com | 2024-09-17 -
I'm surprised to see pytorch and Jax mentioned as alternatives but not numba : https://github.com/numba/numba
I've recently had to implement a few kernels to lower the memory footprint and runtime of some pytorch function : it's been really nice because numba kernels have type hints support (as opposed to raw cupy kernels).
-
-
-
einops
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
-
Project mention: Maxtext: A simple, performant and scalable Jax LLM | news.ycombinator.com | 2024-04-23
Is t5x an encoder/decoder architecture?
Some more general options.
The Flax ecosystem
https://github.com/google/flax?tab=readme-ov-file
or dm-haiku
https://github.com/google-deepmind/dm-haiku
were some of the best developed communities in the Jax AI field
Perhaps the “trax” repo? https://github.com/google/trax
Some HF examples https://github.com/huggingface/transformers/tree/main/exampl...
Sadly it seems much of the work is proprietary these days, but one example could be Grok-1, if you customize the details. https://github.com/xai-org/grok-1/blob/main/run.py
-
-
Project mention: Web scraping of a dynamic website using Python with HTTP Client | dev.to | 2024-09-28
The library already has support for an HTTP client that allows bypassing Cloudflare - CurlImpersonateHttpClient. Since we have to work with JSON responses we could use parsel_crawler added in version 0.3.0, but I think this is excessive for such tasks, besides I like the high speed of orjson.. Therefore, we'll need to implement our crawler rather than using one of the ready-made ones.
-
-
I know I've tooted its horn before, but Orange3 is a pretty neat Python-based GUI platform that makes this and a metric buttload of other statistical/ML techniques available to non-programmer types.
Just watch out for null character `x00` in the corpus. That always seems to kill it stone dead.
https://orangedatamining.com/
https://orange3.readthedocs.io/projects/orange-visual-progra...
-
-
-
Project mention: Spectral Imaging Made Easy: A Powerful Python Library | news.ycombinator.com | 2024-12-23
Interesting - I'm curious whether you feel that Xarray covers these use cases already?
https://xarray.dev/
Especially as I've said before that Hyperspy shares so many features in common with Xarray that Hyperspy should just use Xarray under the hood.
https://github.com/hyperspy/hyperspy/discussions/3405
-
mars
Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
-
numpyro
Probabilistic programming with NumPy powered by JAX for autograd and JIT compilation to GPU/TPU/CPU.
Project mention: Numpyro: Probabilistic programming with NumPy powered by Jax | news.ycombinator.com | 2024-11-16 -
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python Numpy discussion
Python Numpy related posts
-
Building an AI-powered Financial Data Analyzer with NodeJS, Python, SvelteKit, and TailwindCSS - Part 0
-
PyTorch 2.6.0 Release
-
Responsible Innovation: Open Source Best Practices for Sustainable AI
-
Making VLLM work on WSL2
-
Experiments with Byte Matrix Multiplication
-
Back To Basics: git
-
Deep Dive: Meta's AI Infrastructure and Developer Tools - A Technical Analysis
-
A note from our sponsor - SaaSHub
www.saashub.com | 12 Feb 2025
Index
What are some of the best open-source Numpy projects in Python? This list will help you:
# | Project | Stars |
---|---|---|
1 | Pytorch | 86,719 |
2 | 30-Days-Of-Python | 44,314 |
3 | NumPy | 28,739 |
4 | data-science-ipython-notebooks | 27,837 |
5 | datasets | 19,560 |
6 | ivy | 14,008 |
7 | Dask | 12,909 |
8 | numpy-100 | 12,406 |
9 | scientific-visualization-book | 10,774 |
10 | Numba | 10,190 |
11 | mlcourse.ai | 9,910 |
12 | cupy | 9,739 |
13 | einops | 8,706 |
14 | trax | 8,159 |
15 | tensorboardX | 7,910 |
16 | orjson | 6,545 |
17 | chainer | 5,894 |
18 | orange | 4,974 |
19 | datasets | 4,352 |
20 | PyQtGraph | 3,969 |
21 | xarray | 3,701 |
22 | mars | 2,712 |
23 | numpyro | 2,383 |