Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 23 AI Open-Source Projects
-
AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
generative-ai-for-beginners
18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
-
Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
-
Lobe Chat
LobeChat is a open-source, extensible (Function Calling), high-performance chatbot framework.It supports one-click free deployment of your private ChatGPT/LLM web application.
-
pytorch-lightning
Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.
-
upscayl
🆙 Upscayl - Free and Open Source AI Image Upscaler for Linux, MacOS and Windows built with Linux-First philosophy.
-
dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
-
LocalAI
:robot: The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.
-
chatbox
Chatbox is a desktop client for ChatGPT, Claude and other LLMs, available on Windows, Mac, Linux
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: Show HN: I made an app to use local AI as daily driver | news.ycombinator.com | 2024-02-27* LLaVA model: I'll add more documentation. You are right Llava could not generate images. For image generation I don't have immediate plans, but checkout these projects for local image generation.
- https://diffusionbee.com/
- https://github.com/comfyanonymous/ComfyUI
- https://github.com/AUTOMATIC1111/stable-diffusion-webui
Project mention: What AI assistants are already bundled for Linux? | news.ycombinator.com | 2024-03-01> I wouldn't hold my breath waiting for a native Linux AI-assisted assistant.
On Mac when I press Command + Space, it brings up Spotlight search
That can't easily be added to be the equivalent of some kind of LLM prompt on GNOME/KDE/XFCE?
I don't quite know what you'd ask it/do with it that would be of much value? Seems like a quicker way/a wrapper around either asking an LLM questions via CLI or basically Electron wrapping HTML (like this https://github.com/lencx/ChatGPT)?
Generative AI For Beginners: a collection of resources to learn about Generative AI, including tutorials, code samples, and more.
For open assistant, the code: https://github.com/LAION-AI/Open-Assistant/tree/main/inference
Project mention: Show HN: Next-token prediction in JavaScript – build fast LLMs from scratch | news.ycombinator.com | 2024-04-10People on here will be happy to say that I do a similar thing, however my sequence length is dynamic because I also use a 2nd data structure - I'll use pretentious academic speak: I use a simple bigram LM (2-gram) for single next-word likeliness and separately a trie that models all words and phrases (so, n-gram). Not sure how many total nodes because sentence lengths vary in training data, but there are about 200,000 entry points (keys) so probably about 2-10 million total nodes in the default setup.
"Constructing 7-gram LM": They likely started with bigrams (what I use) which only tells you the next word based on 1 word given, and thought to increase accuracy by modeling out more words in a sequence, and eventually let the user (developer) pass in any amount they want to model (https://github.com/google-research/google-research/blob/5c87...). I thought of this too at first, but I actually got more accuracy (and speed) out of just keeping them as bigrams and making a totally separate structure that models out an n-gram of all phrases (e.g. could be a 24-token long sequence or 100+ tokens etc. I model it all) and if that phrase is found, then I just get the bigram assumption of the last token of the phrase. This works better when the training data is more diverse (for a very generic model), but theirs would probably outperform mine on accuracy when the training data has a lot of nearly identical sentences that only change wildly toward the end - I don't find this pattern in typical data though, maybe for certain coding and other tasks there are those patterns though. But because it's not dynamic and they make you provide that number, even a low number (any phrase longer than 2 words) - theirs will always have to do more lookup work than with simple bigrams and they're also limited by that fixed number as far as accuracy. I wonder how scalable that is - if I need to train on occasional ~100-word long sentences but also (and mostly) just ~3-word long sentences, I guess I set this to 100 and have a mostly "undefined" trie.
I also thought of the name "LMJS", theirs is "jslm" :) but I went with simply "next-token-prediction" because that's what it ultimately does as a library. I don't know what theirs is really designed for other than proving a concept. Most of their code files are actually comments and hypothetical scenarios.
I recently added a browser example showing simple autocomplete using my library: https://github.com/bennyschmidt/next-token-prediction/tree/m... (video)
And next I'm implementing 8-dimensional embeddings that are converted to normalized vectors between 0-1 to see if doing math on them does anything useful beyond similarity, right now they look like this:
[nextFrequency, prevalence, specificity, length, firstLetter, lastLetter, firstVowel, lastVowel]
Project mention: Show HN: Memories, FOSS Google Photos alternative built for high performance | news.ycombinator.com | 2024-03-21I have been using https://www.photoprism.app for a couple of years, and it works better than expected, with the latest updates it's actually quite fast and the face tagging works reasonably well.
🔗 https://github.com/microsoft/AI-For-Beginners 🔗 https://microsoft.github.io/AI-For-Beginners/
Project mention: Step by step guide to create customized chatbot by using spaCy (Python NLP library) | dev.to | 2024-03-23Hi Community, In this article, I will demonstrate below steps to create your own chatbot by using spaCy (spaCy is an open-source software library for advanced natural language processing, written in the programming languages Python and Cython):
Project mention: The AI Revolution Is Crushing Thousands of Languages | news.ycombinator.com | 2024-04-25Get your OpenAI API key and then use it on one of the hundreds of open source frontends available, such as: https://github.com/lobehub/lobe-chat
**[I.am.ai AI Expert Roadmap](https://i.am.ai/roadmap)**: This roadmap focuses more on AI and includes various aspects of machine learning and deep learning. It's suitable for those who want to delve deeper into AI, particularly in cutting-edge research and applications.
Project mention: Lightning AI Studios – A persistent GPU cloud environment | news.ycombinator.com | 2023-12-14
Project mention: Why Does Windows Use Backslash as Path Separator? | news.ycombinator.com | 2024-04-24Windows has caused us a lot of issues with Upscayl (https://upscayl.org).
I personally do not use Windows but most of our errors are reported by Windows users where sometimes path parsing is a problem or the drivers mess up vulkan configuration.
Project mention: Visualizer for neural network, deep learning and machine learning models | news.ycombinator.com | 2023-12-26
Project mention: Dify, a visual workflow to build/test LLM applications | news.ycombinator.com | 2024-04-22> https://github.com/langgenius/dify/blob/main/LICENSE
everyone is apparently a license pioneer
Project mention: What’s the Difference Between Fine-tuning, Retraining, and RAG? | dev.to | 2024-04-08Check us out on GitHub.
Deci's YOLO-NAS Pose: Redefining Pose Estimation! Elevating healthcare, sports, tech, and robotics with precision and speed. Github link and blog link down below! Repo: https://github.com/spmallick/learnopencv/tree/master/YOLO-NAS-Pose
Project mention: Chatbox (latest versions) is not open source; AskHN: anything similar? | news.ycombinator.com | 2024-02-08
AI related posts
- OpenAI vs Gemini : Function Calling & Autonomous Agent
- PawanOsman/ChatGPT: Access GPT-3.5.turbo for free via an API
- Observations on MLOps–A Fragmented Mosaic of Mismatched Expectations
- Llama 3 with Function Calling and Code Interpreter
- What is a Plugin Ecosystem and Why Does It Matter?
- Show HN: Langtrace – OpenTelemetry-Based LLM App Observability
- Why Does Windows Use Backslash as Path Separator?
-
A note from our sponsor - InfluxDB
www.influxdata.com | 26 Apr 2024
Index
What are some of the best open-source AI projects? This list will help you:
Project | Stars | |
---|---|---|
1 | AutoGPT | 161,096 |
2 | stable-diffusion-webui | 129,299 |
3 | ChatGPT | 46,892 |
4 | generative-ai-for-beginners | 42,394 |
5 | ColossalAI | 37,836 |
6 | Kong | 37,482 |
7 | Open-Assistant | 36,622 |
8 | MockingBird | 33,796 |
9 | gold-miner | 33,382 |
10 | google-research | 32,804 |
11 | PhotoPrism | 32,590 |
12 | AI-For-Beginners | 31,046 |
13 | spaCy | 28,704 |
14 | Lobe Chat | 28,579 |
15 | AI-Expert-Roadmap | 28,388 |
16 | pytorch-lightning | 26,883 |
17 | upscayl | 26,216 |
18 | netron | 26,040 |
19 | dify | 23,073 |
20 | MindsDB | 21,223 |
21 | learnopencv | 20,363 |
22 | LocalAI | 19,593 |
23 | chatbox | 18,459 |
Sponsored