SaaSHub helps you find the best software and product alternatives Learn more โ
Top 23 Deep Learning Open-Source Projects
-
Project mention: Build, Innovate & Collaborate: Setting Up TensorFlow for Open Source Contribution! ๐โจ | dev.to | 2024-11-03
๐ฅท Fork & clone tensorflow repo. โ๏ธ Setup Development environment for contribution. ๐ซธ Push and create pull request.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
Project mention: Show HN: Voice-Pro โ AI Voice Cloning Magic: Transform Any Voice in 15 Seconds | news.ycombinator.com | 2024-11-27
-
[{'id': 'AI Music Generation', 'text': 'Love this new Space built by @enzostvs + @Xenova for Transformers.js: Generate your own AI music (In-browser generation) with AI Jukebox \n\nhttps://huggingface.co/spaces/enzostvs/ai-jukebox', 'score': 0.8460421562194824}, {'id': 'Kolmogorov Arnold Networks', 'text': 'Transformers are not all we need, that is being proven repeatedly now as more alternative frameworks emerge. Another such framework is Kolmogorov Arnold Network based Transformers. I break down exactly how these differ from Perceptron based Transformers and give you the link to my Colab where I create a model based on the research paper that absolutely destroys a standard Transformers based model. Check out the video here: https://www.youtube.com/watch?v=Sw0euxNZCc4', 'score': 0.8424240350723267}, {'id': 'GitHub Issue 8771', 'text': 'This issue is just a treasure ! A bit deprecated i guess, but things are in their historical context. (personally, still need more to understand better)\nhttps://github.com/huggingface/transformers/issues/8771\n\U0001fae1 to the man @stas ', 'score': 0.8417709469795227}]
-
Project mention: Deprecating PyTorch's official Anaconda channel | news.ycombinator.com | 2024-12-06
-
Project mention: 20 Open Source Tools I Recommend to Build, Share, and Run AI Projects | dev.to | 2024-11-13
OpenCV is an open-source computer vision and machine learning software library that allows users to perform various ML tasks, from processing images and videos to identifying objects, faces, or handwriting. Besides object detection, this platform can also be used for complex computer vision tasks like Geometry-based monocular or stereo computer vision.
-
-
Star on GitHub โญ - Keras
-
nn
๐งโ๐ซ 60+ Implementations/tutorials of deep learning papers with side-by-side notes ๐; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), ๐ฎ reinforcement learning (ppo, dqn), capsnet, distillation, ... ๐ง
-
Project mention: ChatGPT unexpectedly began speaking in a user's cloned voice during testing | news.ycombinator.com | 2024-08-11
-
Project mention: faceswap VS facefusion - a user suggested alternative | libhunt.com/r/faceswap | 2024-01-30
-
There are several implementations of the YOLO algorithm available, but for ease-of-use, we will use the Ultralytics implementation in this guide. We will implement and test the code locally and then deploy to Koyeb's GPUs for higher inference speed.
-
Prompt-Engineering-Guide
๐ Guides, papers, lecture, notebooks and resources for prompt engineering
โข PromptBase: This platform offers a marketplace of ready-made prompts. Browsing here can give you a good sense of how prompts are constructed. โข โPrompt Engineering Guideโ: A free online resource with tips, techniques, and examples of successful prompts. Link: https://github.com/dair-ai/Prompt-Engineering-Guide (https://github.com/dair-ai/Prompt-Engineering-Guide)
-
5. Avik-Jain/100-Days-Of-ML-Code - As the name implies, this repository offers a structured approach to learning machine learning with Python. It covers core ML principles and algorithms through real-world applications. https://github.com/Avik-Jain/100-Days-Of-ML-Code
-
Tensorflow examples
-
-
Deep-Learning-Papers-Reading-Roadmap
Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!
-
-
Project mention: Deep Live Cam: Real-Time Face Swapping and One-Click Video Deepfake Tool | news.ycombinator.com | 2024-08-10
Interesting... This project is built upon "GFPGAN v1.4" (https://github.com/TencentARC/GFPGAN) and "FaceSwap Extension - Automatic 1111 - Proof of Concept" (https://github.com/revolverocelot1/-webui-faceswap-unlocked). The GFPGAN project is grounded on its own in the paper "GFP-GAN: Towards Real-World Blind Face Restoration with Generative Facial Prior" by Wang et al.
-
Project mention: Building a Voice Transcription and Translation App with OpenAI Whisper and Streamlit | dev.to | 2024-11-29
From the official website, Streamlit is a faster way to build and share data apps. It is an open-source Python library that helps you build web applications for sharing analytical results, building complex interactive experiences, and iterating on top of new machine-learning models.
-
Project mention: Show HN: Voice-Pro โ AI Voice Cloning Magic: Transform Any Voice in 15 Seconds | news.ycombinator.com | 2024-11-27
It's really easy for a technical person to do as well.
I use Coqui TTS[0] as part of my home automation, I wrote a small python script that lets me upload a voice clip for it to clone after I got the idea from HeyWillow[1], and a small shim that lets me send the output to a Home Assistant media player instead of using their standard output device. I run the TTS container on a VM with a Tesla P4 (~ยฃ100 to buy) and get about 1x-2x (roughly the same time it'd take to say it, to process) using the large model.
Just for a giggle, I uploaded a few 3s-5s second clip of myself speaking and cloned my voice, then executed a command to our living room media player to call my wife into the room; from another room, she was 100% convinced it was myself speaking words I'd never spoken.
I tried playing with a variety of sentences for a few hours and overall, it sounded almost exactly like me, to me, with the exception of some "attitude" and "intonation" I know I wouldn't use in my speech. I didn't notice much of an improvement using much longer clips; the short ones were "good enough".
Tangentially, it really bugs me that most phone providers in the UK insist you record a "personal greeting" now before they'll let you check your voice mail box, I just record silence, because the last thing I want/need is a voicemail greeting in my voice confirming to some randomer I didn't want calling me, who I am and that my number is active, even more so knowing how I can
[0] https://github.com/coqui-ai/TTS
-
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Project mention: DeepSpeed-Domino: Communication-Free LLM Training Engine | news.ycombinator.com | 2024-11-26 -
tesseract.js - JS Version of OCR Library
-
MockingBird
๐AIๆๅฃฐ: 5็งๅ ๅ ้ๆจ็ๅฃฐ้ณๅนถ็ๆไปปๆ่ฏญ้ณๅ ๅฎน Clone a voice in 5 seconds to generate arbitrary speech in real-time
Deep Learning discussion
Deep Learning related posts
-
Ultimate guide to prompt engineering
-
Deprecating PyTorch's official Anaconda channel
-
Genie 2: A large-scale foundation world model
-
Intel Launches Arc B-Series Graphics Cards
-
Albumentations: Fastest and most flexible image augmentation library
-
Albumentations: Fast Image Augmentations
-
Let Claude read your Gas Meter with this Amazing new Feature
-
A note from our sponsor - SaaSHub
www.saashub.com | 7 Dec 2024
Index
What are some of the best open-source Deep Learning projects? This list will help you:
Project | Stars | |
---|---|---|
1 | tensorflow | 186,712 |
2 | stable-diffusion-webui | 143,804 |
3 | transformers | 135,925 |
4 | Pytorch | 84,676 |
5 | OpenCV | 79,461 |
6 | cs-video-courses | 67,418 |
7 | Keras | 62,164 |
8 | nn | 56,886 |
9 | Real-Time-Voice-Cloning | 52,879 |
10 | faceswap | 52,579 |
11 | yolov5 | 51,282 |
12 | Prompt-Engineering-Guide | 50,664 |
13 | 100-Days-Of-ML-Code | 45,673 |
14 | TensorFlow-Examples | 43,445 |
15 | ColossalAI | 38,850 |
16 | Deep-Learning-Papers-Reading-Roadmap | 38,366 |
17 | Made-With-ML | 37,716 |
18 | GFPGAN | 35,968 |
19 | streamlit | 35,957 |
20 | TTS | 35,874 |
21 | DeepSpeed | 35,693 |
22 | Tesseract.js | 35,428 |
23 | MockingBird | 35,411 |