SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 Machine Learning Open-Source Projects
An Open Source Machine Learning Framework for EveryoneProject mention: 🔥🚀 Top 10 Open-Source Must-Have Tools for Crafting Your Own Chatbot 🤖💬 | dev.to | 2023-11-06
To get up to speed with TensorFlow, check their quickstart Support TensorFlow on GitHub ⭐
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.Project mention: Fine-Tuned Llama2 Inserting Unnecessary Delimiters | /r/LocalLLaMA | 2023-11-04
While its tough to say something specifc since we dont know how exactly you trained it or the prompt format of your training input or how you are performing inference, one thing I found when I faced similar types of issues is that the model does not know when to stop. Some of it is because the fast llama tokenizer does not add the token when encoding your inputs. So you can either add that token explicitly in your input text for each sample or use the slow llama tokenizer. Check llama_recipes github repo for the exact issue https://github.com/huggingface/transformers/issues/22794. The other most probable thing you might want to check is if the model.generate output contains the exact input tokens too. That is the expected behavior of some models (like llama2 or mpt) for example when you use vanilla transformers for inference.
Collect and Analyze Billions of Data Points in Real Time. Manage all types of time series data in a single, purpose-built database. Run at any scale in any environment in the cloud, on-premises, or at the edge.
Tensors and Dynamic neural networks in Python with strong GPU accelerationProject mention: Diving into the Deep: My Inaugural PyTorch Contribution Adventure! | dev.to | 2023-11-24
List of Computer Science courses with video lectures.Project mention: Need advice | /r/PAK | 2023-07-12
course Computer science is very wast field the fundamental remains same, learn basic fundamentals, data structures, concepts of object oriented programming.
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for allProject mention: FLaNK Stack Weekly for 20 Nov 2023 | dev.to | 2023-11-20
Deep Learning for humansProject mention: Keras 3.0 | news.ycombinator.com | 2023-11-28
All breaking changes are listed here: https://github.com/keras-team/keras/issues/18467
You can use this migration guide to identify and fix each of these issues (and further, making your code run on JAX or PyTorch): https://keras.io/guides/migrating_to_keras_3/
scikit-learn: machine learning in PythonProject mention: Contraction Clustering (RASTER): A fast clustering algorithm | news.ycombinator.com | 2023-11-27
Learn any GitHub repo in 59 seconds. Onboard AI learns any GitHub repo in minutes and lets you chat with it to locate functionality, understand different parts, and generate new code. Use it for free at www.getonboard.dev.
Tesseract Open Source OCR Engine (main repository)Project mention: Marker: Convert PDF to Markdown quickly with high accuracy | news.ycombinator.com | 2023-11-30
Last update was pretty recent, and the git mentions tesseract 5 as a dep. so it's likely moved on a bit from when you last tried it:
I suppose it depends on your use-case. For personal tasks like this it should be more than sufficient, and won't need user details/cc or whatever to use it.
The world's simplest facial recognition api for Python and the command lineProject mention: GitHub - ageitgey/face_recognition: The world's simplest facial recognition api for Python and the command line | /r/Python | 2023-11-05
The Patterns of Scalable, Reliable, and Performant Large-Scale SystemsProject mention: Ask HN: What are some of the best blog posts by software engineers? | news.ycombinator.com | 2023-04-10
Deepfakes Software For AllProject mention: A beginner guide into deepfakes | dev.to | 2023-06-01
Head over to deepfakes/faceswap and install all the stuff that it asks you to do and then open the terminal with in faceswap env from anaconda.
The Julia Programming LanguageProject mention: Rust std:fs slower than Python | news.ycombinator.com | 2023-11-29
So while this "fixes" the issue, it'll introduce a confusing time delay between you freeing the memory and you observing that in `htop`.
But according to https://jemalloc.net/jemalloc.3.html you can set `opt.muzzy_decay_ms = 0` to remove the delay.
Still, the musl author has some reservations against making `jemalloc` the default:
> It's got serious bloat problems, problems with undermining ASLR, and is optimized pretty much only for being as fast as possible without caring how much memory you use.
With the above-mentioned tunables, this should be mitigated to some extent, but the general "theme" (focusing on e.g. performance vs memory usage) will likely still mean "it's a tradeoff" or "it's no tradeoff, but only if you set tunables to what you need".
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLiteProject mention: How would i go about having YOLO v5 return me a list from left to right of all detected objects in an image? | /r/computervision | 2023-11-13
TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)
100 Days of ML Coding
🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠Project mention: Can't remember name of website that has explanations side-by-side with code | /r/learnmachinelearning | 2023-03-28
Hey are you talking about https://nn.labml.ai/ ?
Learn how to design, develop, deploy and iterate on production-grade ML applications.Project mention: [D] How do you keep up to date on Machine Learning? | /r/learnmachinelearning | 2023-08-13
Made With ML
Caffe: a fast open framework for deep learning.Project mention: List of AI-Models | /r/GPT_do_dah | 2023-05-16
Click to Learn more...
A toolkit for developing and comparing reinforcement learning algorithms.Project mention: OpenAI Acquires Global Illumination | news.ycombinator.com | 2023-08-16
A co-founder announced they disbanded their robots team a couple years ago: https://venturebeat.com/business/openai-disbands-its-robotic...
That was the same time they depreciated OpenAI Gym: https://github.com/openai/gym
What do you have against tesseract.js?
Google ResearchProject mention: Translate to and from 400+ languages locally with MADLAD-400 | /r/LocalLLaMA | 2023-11-10
Google released T5X checkpoints for MADLAD-400 a couple of months ago, but nobody could figure out how to run them. Turns out the vocabulary was wrong, but they uploaded the correct one last week.
AI-Powered Photos App for the Decentralized Web 🌈💎✨Project mention: New Release 231128-f48ff16ef ⚙️🌈 | /r/photoprism | 2023-11-30
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.Project mention: DeepSpeed-FastGen: High-Throughput for LLMs via MII and DeepSpeed-Inference | news.ycombinator.com | 2023-11-04
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Machine Learning related posts
Large Language Model Course
1 project | news.ycombinator.com | 1 Dec 2023
Is anyone using self hosted LLM day to day and training it like a new employee
4 projects | news.ycombinator.com | 30 Nov 2023
We tried injecting hallucinogenics into vision models
1 project | news.ycombinator.com | 30 Nov 2023
2 projects | news.ycombinator.com | 30 Nov 2023
Show HN: Taipy – Turns Data and AI algorithms into full web applications
5 projects | news.ycombinator.com | 30 Nov 2023
Ask HN: Which are some of the best open source courses for ML and Deep Learning?
1 project | news.ycombinator.com | 30 Nov 2023
fast.ai Book in Rust - Chapter 2 - Part 1
2 projects | dev.to | 29 Nov 2023
A note from our sponsor - #<SponsorshipServiceOld:0x00007f0f9ba122a8>
www.saashub.com | 1 Dec 2023
What are some of the best open-source Machine Learning projects? This list will help you: