MiniGPT-4

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

llama.cpp

776 57,463 10.0 C++

LLM inference in C/C++

For a general guide, I recommend: https://timdettmers.com/2023/01/30/which-gpu-for-deep-learni...
There's a subreddit r/LocalLLaMA that seems like the most active community focused on self-hosting LLMs. Here's a recent discussion on hardware: https://www.reddit.com/r/LocalLLaMA/comments/12lynw8/is_anyo...
If you're looking just for local inference, you're best bet is probably to buy a consumer GPU w/ 24GB of RAM (3090 is fine, 4090 more performance potential), which can fit a 30B parameter 4-bit quantized model that can probably be fine-tuned to ChatGPT (3.5) level quality. If not, then you can probably add a second card later on.
Alternatively, if you have an Apple Silicon Mac, llama.cpp performs surprisingly well, it's easy to try for free: https://github.com/ggerganov/llama.cpp
Current AMD consumer cards have terrible software support and IMO isn't really an option. On Windows you might be able to use SHARK or DirectML ports, but nothing will run out of the box. ROCm still has no RDNA3 support (supposedly coming w/ 5.5 but no release date announced) and it's unclear how well it'll work - basically, unless you would rather be fighting w/ hardware than playing around w/ ML, it's probably best to avoid (the older RDNA cards also don't have tensor cores, so perf would be hobbled even if you could get things running. Lots of software has been written w/ CUDA-only in mind).

MiniGPT-4

37 24,925 9.1 Python

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

If I have to learn how to be able to read this code and understand what it and its dependencies are doing, where do I start? Is reading their paper an effective strategy?
https://github.com/Vision-CAIR/MiniGPT-4/blob/main/MiniGPT_4...

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
MiniGPT-4-discord-bot

1 42 10.0 Python

Discontinued A true multimodal LLaMA derivative -- on Discord!

linear layer, and train just the tiny layer on some datasets of image-text pairs.
But the results are pretty amazing. It completely knocks Openflamingo && even the original blip2 models out of the park. And best of all, it arrived before OpenAI's GPT-4 Image Modality did. Real win for Open Source AI.
The repo's default inference code is kind of bad -- vicuna is loaded in fp16 so it can't fit on any consumer hardware. I created a PR on the repo to load it with int8, so hopefully by tomorrow it'll be runnable by 3090/4090 users.
I also developed a toy discord bot (https://github.com/152334H/MiniGPT-4-discord-bot) to show the model to some people, but inference is very slow so I doubt I'll be hosting it publicly.

llama.go

12 1,168 8.2 Go

llama.go is like llama.cpp in pure Golang!

I'm developing framework [1] in Golang with this goal in mind :) It successfully runs relatively big LLM right now, and diffusion models will be the next step
[1] https://github.com/gotzmann/llama.go/

AutoGPT

180 161,617 9.9 JavaScript

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

So...AutoGPT? Now with command-line access! Have fun :)
https://github.com/Significant-Gravitas/Auto-GPT/

ROCm

198 3,637 0.0 Python

Discontinued AMD ROCm™ Software - GitHub Home [Moved to: https://github.com/ROCm/ROCm]

6800 is RDNA2, not RDNA3. The latter is still waiting for ROCm support 4 months post-launch: https://github.com/RadeonOpenCompute/ROCm/issues/1813

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Meet Atom the GPT Assistant, an AI-powered Smart Home Assistant. It's like Google Assistant but with endless possibility of ChatGPT, it's like Siri but with extensibility of Open Source power.

6 projects | /r/homeautomation | 27 Apr 2023
Tech pioneers call for six-month pause of "out-of-control" AI development

7 projects | /r/technology | 29 Mar 2023
IBM Granite: A Family of Open Foundation Models for Code Intelligence

3 projects | news.ycombinator.com | 7 May 2024
LocalAI: Self-hosted OpenAI alternative reaches 2.14.0

1 project | news.ycombinator.com | 3 May 2024
More Agents Is All You Need: LLMs performance scales with the number of agents

2 projects | news.ycombinator.com | 6 Apr 2024

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
llama AI llm vicuna alpaca
Post date: 17 Apr 2023

llama.cpp

MiniGPT-4

InfluxDB

MiniGPT-4-discord-bot

llama.go

AutoGPT

ROCm

Related posts

Meet Atom the GPT Assistant, an AI-powered Smart Home Assistant. It's like Google Assistant but with endless possibility of ChatGPT, it's like Siri but with extensibility of Open Source power.

Tech pioneers call for six-month pause of "out-of-control" AI development

IBM Granite: A Family of Open Foundation Models for Code Intelligence

LocalAI: Self-hosted OpenAI alternative reaches 2.14.0

More Agents Is All You Need: LLMs performance scales with the number of agents

MiniGPT-4

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com llama AI llm vicuna alpaca Post date: 17 Apr 2023

llama.cpp

MiniGPT-4

InfluxDB

MiniGPT-4-discord-bot

llama.go

AutoGPT

ROCm

Related posts

Meet Atom the GPT Assistant, an AI-powered Smart Home Assistant. It's like Google Assistant but with endless possibility of ChatGPT, it's like Siri but with extensibility of Open Source power.

Tech pioneers call for six-month pause of "out-of-control" AI development

IBM Granite: A Family of Open Foundation Models for Code Intelligence

LocalAI: Self-hosted OpenAI alternative reaches 2.14.0

More Agents Is All You Need: LLMs performance scales with the number of agents

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
llama AI llm vicuna alpaca
Post date: 17 Apr 2023