ggml vs chat.petals.dev

ggml

Tensor library for machine learning (by philpax)

Suggest topics

Source Code

Suggest alternative

Edit details

chat.petals.dev

💬 Chatbot web app + HTTP and Websocket endpoints for LLM inference with the Petals client (by petals-infra)

API Bloom Chatbot Distributed Systems Gpt guanaco language-models large-language-models llama Transformer volunteer-computing llama2

Source Code

chat.petals.dev

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

ggml		chat.petals.dev
	Project
3	Mentions	8
19	Stars	299
-	Growth	2.7%
8.6	Activity	7.1
7 months ago	Latest Commit	about 1 month ago
	Language	Python
MIT License	License	-

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

ggml

Posts with mentions or reviews of ggml. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-09-17.

Run LLMs at home, BitTorrent‑style
10 projects | news.ycombinator.com | 17 Sep 2023

https://github.com/philpax/ggml/blob/gguf-spec/docs/gguf.md#...
It is (IMO) a necessary and good change.
I just specified gguf because my 3090 cannot host a 70B model without offloading outside of exLlama's very new ~2 bit quantization.
GGUF File Format Specification
1 project | news.ycombinator.com | 24 Aug 2023
Meta: Code Llama, an AI Tool for Coding
18 projects | news.ycombinator.com | 24 Aug 2023

While we're at it, the GGML file format has been deprecated in favor of GGUF.
https://github.com/philpax/ggml/blob/gguf-spec/docs/gguf.md
https://github.com/ggerganov/llama.cpp/pull/2398

chat.petals.dev

Posts with mentions or reviews of chat.petals.dev. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-12-07.

Make no mistake—AI is owned by Big Tech
2 projects | /r/transhumanism | 7 Dec 2023

ETA: https://chat.petals.dev
Run LLMs in bittorrent style
2 projects | /r/opensource | 20 Nov 2023

Check it out at Petals.dev. Chatbot
Run LLMs at home, BitTorrent‑style
10 projects | news.ycombinator.com | 17 Sep 2023

Hi, a dev here. means "end of sequence" for LLMs. If a model generates it, it forgets everything and continue with an unrelated random text. So I don't think that malicious actors are involved here.
Apparently, the Colab code snippet is just too simplified and does not handle correctly. This is not the case with the full chatbot app at https://chat.petals.dev - you can use it instead or take a look at its code.
Falcon180B: authors open source a new 180B version!
1 project | /r/LocalLLaMA | 8 Sep 2023

edit: this community of people is amazing. like 10 minutes after I posted this or so.... it is now up on chat.petals.dev !!!!
Talk to Falcon 180B-Chat running over Petals
1 project | news.ycombinator.com | 6 Sep 2023
ChatGPT Is Down Again
2 projects | news.ycombinator.com | 31 Aug 2023

good opportunity to try the free and totally open source Big Science Petals chat: https://chat.petals.dev/ ... Try out Stable Beluga 2 70B
I am currently running my 3090 GPU on there to help out, you can check out https://health.petals.dev/
If you have a spare GPU, consider contributing: https://github.com/bigscience-workshop/petals . I am not associated with them.
Sweating Bullets Test
1 project | /r/LocalLLaMA | 6 Aug 2023

So far, not a single one of the models tested (between 7b-70b) could figure out the name of the main character (Nick Slaughter). I've tried all sorts of prompts and the connection between "Tropical Heat" and "Sweating Bullets" is usually known to the model (e.g. "What's the show "Tropical Heat" called in the US?"). But as soon as I ask about the main character, all the models I have tested so far hallucinate all sorts of names, though usually in the right direction (detectives).
Petals: Run 100B+ models at home bit-torrent style
6 projects | news.ycombinator.com | 2 Jan 2023

What are some alternatives?

When comparing ggml and chat.petals.dev you can also consider the following projects:

ollama - Get up and running with Llama 3, Mistral, Gemma, and other large language models.

askai - Command Line Interface for OpenAi ChatGPT

smartcat

petals - 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

LoRA - Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

godot-dodo - Finetuning large language models for GDScript generation.

KoboldAI-Client

codellama - Inference code for CodeLlama models

ollama-ui - Simple HTML UI for Ollama

artbot-for-stable-diffusion - A front-end GUI for interacting with the AI Horde / Stable Diffusion distributed cluster

ggml vs ollama chat.petals.dev vs askai ggml vs smartcat chat.petals.dev vs petals ggml vs LoRA chat.petals.dev vs LoRA ggml vs godot-dodo chat.petals.dev vs KoboldAI-Client ggml vs codellama chat.petals.dev vs ollama ggml vs ollama-ui ggml vs artbot-for-stable-diffusion

Compare ggml vs chat.petals.dev and see what are their differences.

ggml

chat.petals.dev

ggml

chat.petals.dev

What are some alternatives?