SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 Python Numpy Projects
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
The fundamental package for scientific computing with Python.Project mention: What are the best Python libraries to learn for beginners? | reddit.com/r/learnpython | 2023-01-30
NumPy: Scientific computing library and I know this one is the most popular especially in Data Science.
Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation toolsProject mention: Need help with a data science project | reddit.com/r/learnmachinelearning | 2023-01-30
Parallel computing with task schedulingProject mention: A peek into Location Data Science at Ola | dev.to | 2022-09-26
Data scientists work on phenomenally large datasets, and Dask is a handy tool for exploration within the confines of a single cloud VM or their local PCs. Location data visualization is an essential part of deciding further algorithm development and roadmap for projects. This lays the foundation for data engineering and science to work at scale, with petabytes of data.
100 numpy exercises (with solutions)Project mention: Python numpy, pandas, matplotlib | reddit.com/r/PinoyProgrammer | 2023-01-27
haven't tried yet but you can check something like this https://github.com/rougier/numpy-100. this is for numpy, maybe there could be something for pandas or matploblib
The Unified Machine Learning FrameworkProject mention: CoreML Stable Diffusion | news.ycombinator.com | 2022-12-01
ROCm's great for data centers, but good luck finding anything about desktop GPUs on their site apart from this lone blog post: https://community.amd.com/t5/instinct-accelerators/exploring...
There's a good explanation of AMD's ROCm targets here: https://news.ycombinator.com/item?id=28200477
It's currently a PITA to get common Python libs like Numba to even talk to AMD cards (admittedly Numba won't talk to older Nvidia cards either and they deprecate ruthlessly; I had to downgrade 8 versions to get it working with a 5yo mobile workstation). YC-backed Ivy claims to be working on unifying ML frameworks in a hardware-agnostic way but I don't have enough experience to assess how well they're succeeding yet: https://lets-unify.ai
I was happy to see DiffusionBee does talk the GPU in my late-model intel Mac, though for some reason it only uses 50% of its power right now. I'm sure the situation will improve as Metal 3.0 and Vulkan get more established.
An open access book on scientific visualization using python and matplotlibProject mention: What's New in Matplotlib 3.6.0 | news.ycombinator.com | 2022-09-17
I had the same problem until I found this tutorial:
If you wan something deeper the same person has written a book:
Build time-series-based applications quickly and at scale.. InfluxDB is the Time Series Platform where developers build real-time applications for analytics, IoT and cloud-native services. Easy to start, it is available in the cloud or on-premises.
Open Machine Learning CourseProject mention: mlcourse.ai: NEW Courses - star count:8584.0 | reddit.com/r/algoprojects | 2023-02-04
NumPy aware dynamic Python compiler using LLVMProject mention: A backtester idea | reddit.com/r/algotrading | 2023-02-03
The heavy part in backtest is calculations. And they are done in pandas which is partially written in c. Also, we can use numba: https://numba.pydata.org/
Trax — Deep Learning with Clear Code and SpeedProject mention: The founder of Gmail claims that ChatGPT can “kill” Google in two years. | reddit.com/r/Futurology | 2023-01-31
But a couple years later they came out with open source implementations yeah: https://github.com/google/trax/tree/master/trax/models/reformer
NumPy & SciPy for GPUProject mention: ELI5: Why doesn't numpy work on GPUs? | reddit.com/r/learnpython | 2023-01-31
u/Spataner's answer is great. If you WANT GPU-enabled numpy functions, I would check out CuPy: https://cupy.dev/
Deep learning operations reinvented (for pytorch, tensorflow, jax and others)Project mention: Delimiter-First Code | news.ycombinator.com | 2022-12-09
A flexible framework of neural networks for deep learningProject mention: Protip: the upscaler matters a lot | reddit.com/r/StableDiffusion | 2023-01-13
Sorry maybe someone could chime in and help but I use chainer to upscale. https://github.com/chainer/chainer
Fast, correct Python JSON library supporting dataclasses, datetimes, and numpyProject mention: Starlite development updates January ’23 | reddit.com/r/Python | 2023-01-10
In version 1.45.0, we introduced msgspec as our serialization backend, replacing orjson. This had some immediate performance benefits, but that's not the main reason we made the switch.
🍊 :bar_chart: :bulb: Orange: Interactive data analysisProject mention: Statistical Analysis software based on Python? | reddit.com/r/Python | 2023-01-28
Only thing I can think of is Orange, which has some statistics capability, but isn't its focus.
TFDS is a collection of datasets ready to use with TensorFlow, Jax, ... (by tensorflow)Project mention: TensorFlow Datasets (TFDS): a collection of ready-to-use datasets | news.ycombinator.com | 2022-12-21
I tried Librispeech, a very common dataset for speech recognition, in both HF and TFDS.
TFDS performed extremely bad.
First it failed because the official hosting server only allows 5 simultaneous connections, and TFDS totally ignored that and makes up to 50 simultaneous downloads and that breaks. I wonder if anyone actually tested this?
Then you need to have some computer with 30GB to do the preparation, which might fail on your computer. This is where I stopped. https://github.com/tensorflow/datasets/issues/3887. It might be fixed now but it took them 8 months to respond to my issue.
On HF, it just worked. There was a smaller issue in how the dataset was split up but that is fixed now, and their response was very fast and great.
Fast data visualization and GUI tools for scientific / engineering applicationsProject mention: Use cases for PySide | reddit.com/r/Python | 2023-01-31
Image, 3D, or data visualization applications using OpenCV and the SciPy ecosystem. The Graphics View Framework can display an image and let the user interact with it, and the Python ecosystem is very rich for image processing, data analysis, and visualization. For example, LabelMe for image labeling, PyQtGraph for scientific graphics, or custom QWidget integration in Maya.
N-D labeled arrays and datasets in PythonProject mention: Request for Startups: Climate Tech | news.ycombinator.com | 2022-12-15
PyTorch and JAX are used heavily in climate science on the ML side. For more general analytics, not so much. Many of our users like to use Xarray as a high-level API. There has been some work to integrate Xarray with PyTorch (https://github.com/pydata/xarray/issues/3232) but we're not there yet.
The Python Array API standard should help align these different back-ends: https://data-apis.org/array-api/latest/
Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
NLP made easyProject mention: How usable is Julia for Natural Language Processing Machine learning? | reddit.com/r/Julia | 2022-10-28
Probabilistic programming with NumPy powered by JAX for autograd and JIT compilation to GPU/TPU/CPU.
napari: a fast, interactive, multi-dimensional image viewer for pythonProject mention: Introspect type hints Pythonically in O(1) time. | reddit.com/r/Python | 2022-09-25
Much thanks to @tlambert03 – who also authors Napari, a fast multidimensional image viewer in Python you might also enjoy.
A python module for scientific analysis of 3D dataProject mention: Using Rust to speed up 3D rendering in the browser | reddit.com/r/rust | 2022-03-16
Even though it's not Rust nor browser, I'm leaving this Python library here, because I was made aware of it recently: https://github.com/marcomusy/vedo
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python Numpy related posts
A backtester idea
2 projects | reddit.com/r/algotrading | 3 Feb 2023
The founder of Gmail claims that ChatGPT can “kill” Google in two years.
1 project | reddit.com/r/Futurology | 31 Jan 2023
What are the best Python libraries to learn for beginners?
7 projects | reddit.com/r/learnpython | 30 Jan 2023
#01 Benchmark of four JIT Backends
3 projects | dev.to | 31 Jan 2023
A new way to accelerate your data science workflow
1 project | reddit.com/r/Python | 27 Jan 2023
Python numpy, pandas, matplotlib
1 project | reddit.com/r/PinoyProgrammer | 27 Jan 2023
Joining the Open Source Development Course
4 projects | dev.to | 20 Jan 2023
A note from our sponsor - #<SponsorshipServiceOld:0x00007fea592a2e20>
www.saashub.com | 4 Feb 2023
What are some of the best open-source Numpy projects in Python? This list will help you: