SaaSHub helps you find the best software and product alternatives Learn more โ
Top 23 Python Machinelearning Projects
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
vaex
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second ๐
-
clearml
ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling & Serving in one MLOps/LLMOps solution
-
awesome-open-gpt
Collection of Open Source Projects Related to GPT๏ผGPT็ธๅ ณๅผๆบ้กน็ฎๅ้๐ใ็ฒพ้๐ฅ๐ฅ
-
igel
a delightful machine learning tool that allows you to train, test, and use models without writing code
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
retentioneering-tools
Retentioneering: product analytics, data-driven CJM optimization, marketing analytics, web analytics, transaction analytics, graph visualization, process mining, and behavioral segmentation in Python. Predictive analytics over clickstream, AB tests, machine learning, and Markov Chain simulations.
-
covalent
Pythonic tool for orchestrating machine-learning/high performance/quantum-computing workflows in heterogeneous compute environments. (by AgnostiqHQ)
-
CodeRL
This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (NeurIPS22).
-
Machine-Learning-Guide
Machine learning Guide. Learn all about Machine Learning Tools, Libraries, Frameworks, Large Language Models (LLMs), and Training Models.
-
zoofs
zoofs is a python library for performing feature selection using a variety of nature-inspired wrapper algorithms. The algorithms range from swarm-intelligence to physics-based to Evolutionary. It's easy to use , flexible and powerful tool to reduce your feature size.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: Show HN: Toolkit for LLM Fine-Tuning, Ablating and Testing | news.ycombinator.com | 2024-04-07This is a great project, little bit similar to https://github.com/ludwig-ai/ludwig, but it includes testing capabilities and ablation.
questions regarding the LLM testing aspect: How extensive is the test coverage for LLM use cases, and what is the current state of this project area? Do you offer any guarantees, or is it considered an open-ended problem?
Would love to see more progress toward this area!
Extract from awesome-open-gpt
We (Marqo) are doing a lot on 1 and 2. There is a huge amount to be done on the ML side of vector search and we are investing heavily in it. I think it has not quite sunk in that vector search systems are ML systems and everything that comes with that. I would love to chat about 1 and 2 so feel free to email me (email is in my profile). What we have done so far is here -> https://github.com/marqo-ai/marqo
Project mention: Fast Llama 2 on CPUs with Sparse Fine-Tuning and DeepSparse | news.ycombinator.com | 2023-11-23Interesting company. Yannic Kilcher interviewed Nir Shavit last year and they went into some depth: https://www.youtube.com/watch?v=0PAiQ1jTN5k DeepSparse is on GitHub: https://github.com/neuralmagic/deepsparse
In order to try to solve this issue, NannyML was created. NannyML is an open-source Python library designed in order to make it easy to monitor drift in the distributions of our model input variables and estimate our model performance (even without labels!) thanks to the Confidence-Based Performance Estimation algorithm they developed. But first of all, why do models need to be monitored and why their performance might vary over time?
Project mention: Help Needed: Converting PlantNet-300k Pretrained Model Weights from Tar to h5 Format Help | /r/learnpython | 2023-06-09It's almost certainly a pickled pytorch model so you will first need to load it using pytorch and then write it out to h5 (legacy keras format) with https://github.com/gmalivenko/pytorch2keras.
Project mention: Show HN: Custom Action Recognition with ActionAI | news.ycombinator.com | 2023-09-23
Pretty interesting request, if SSH is not used, i would try using something like dask which uses tcp to connect and execute assuming your workers are in another machine.I also think something like covalent can be used to extend your own custom plugin in their ecosystem to connect how you want. We have a very custom private plugin written on top of covalent's to have a custom protocol to connect our central on-prem GPU machines to our local laptops that is rpc based, mostly for high performance as well as some mandate security from where the GPU machines are. Once done it is pretty much something like
Python Machinelearning related posts
-
Ask HN: Is there any good semantic search GUI for images or documents?
-
Fast Llama 2 on CPUs with Sparse Fine-Tuning and DeepSparse
-
It was not "Good First Issue"
-
Show HN: Marqo โ Vectorless Vector Search
-
Ask HN: Which Vector Database do you recommend for LLM applications?
-
๐ธ Anime app: I need your help
-
Help me improve The Prompt Index website
-
A note from our sponsor - SaaSHub
www.saashub.com | 10 May 2024
Index
What are some of the best open-source Machinelearning projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | horovod | 13,969 |
2 | ludwig | 10,845 |
3 | vaex | 8,178 |
4 | clearml | 5,279 |
5 | awesome-open-gpt | 5,092 |
6 | marqo | 4,152 |
7 | igel | 3,080 |
8 | deepsparse | 2,881 |
9 | tslearn | 2,786 |
10 | nannyml | 1,759 |
11 | nsfw_model | 1,614 |
12 | pytorch2keras | 846 |
13 | retentioneering-tools | 766 |
14 | ActionAI | 724 |
15 | covalent | 698 |
16 | LiuAlgoTrader | 672 |
17 | MetaSpore | 629 |
18 | CodeRL | 475 |
19 | Machine-Learning-Guide | 441 |
20 | deep-significance | 316 |
21 | hydra-zen | 284 |
22 | yolo-hand-detection | 259 |
23 | zoofs | 236 |
Sponsored