My largest ever quants, GPT 3 sized! BLOOMZ 176B and BLOOMChat 1.0 176B

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

llama.cpp

773 56,891 10.0 C++

LLM inference in C/C++

So I am looking at running this with llama.cpp with metal shaders (Mac M1 Ultra, 128 Gb) but running into a conversion problem. I can get tokenizer.json to tokenizer.model using this method, but can't convert the model to the q4_0 .bin that llama.cpp uses.

bloomz.cpp

4 807 6.0 C

C++ implementation for BLOOM

Possibly. There's a llama.cpp fork called bloomz.cpp but it's not been updated in 2 months. So it's not going to support any of the fancy new quantisation methods, performance improvements, GPU acceleration, etc.

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
lm-evaluation-harness

34 5,070 9.9 Python

A framework for few-shot evaluation of language models.

Hey u/The-Bloke Appreciate the quants! What is the degradation on the some benchmarks. Have you seen https://github.com/EleutherAI/lm-evaluation-harness. 3-bit and 2-bit quant will really be pushing it. I don't see a ton of evaluation results on the quants and nice to see a before and after.

ggml

3 1 10.0 C

Discontinued Tensor library for machine learning (by the-crypt-keeper)

You need my ggml fork until #343 is merged into ggml to use it.

ggml

69 9,725 9.8 C

Tensor library for machine learning

You need my ggml fork until #343 is merged into ggml to use it.

SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Bloomz.cpp: C++ implementation for BLOOM models

1 project | news.ycombinator.com | 17 Mar 2023
Extism: Cross-language framework for building with WebAssembly

1 project | news.ycombinator.com | 2 May 2024
Are We Modules Yet?

3 projects | news.ycombinator.com | 1 May 2024
Printing Music with CSS Grid

4 projects | news.ycombinator.com | 30 Apr 2024
Stunt Rally – a free rally racing game with editor

1 project | news.ycombinator.com | 28 Apr 2024

My largest ever quants, GPT 3 sized! BLOOMZ 176B and BLOOMChat 1.0 176B

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA
Bloom CPP Multilingual
Post date: 6 Jul 2023

llama.cpp

bloomz.cpp

InfluxDB

lm-evaluation-harness

ggml

ggml

SaaSHub

Related posts

Bloomz.cpp: C++ implementation for BLOOM models

Extism: Cross-language framework for building with WebAssembly

Are We Modules Yet?

Printing Music with CSS Grid

Stunt Rally – a free rally racing game with editor

My largest ever quants, GPT 3 sized! BLOOMZ 176B and BLOOMChat 1.0 176B

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA Bloom CPP Multilingual Post date: 6 Jul 2023

llama.cpp

bloomz.cpp

InfluxDB

lm-evaluation-harness

ggml

ggml

SaaSHub

Related posts

Bloomz.cpp: C++ implementation for BLOOM models

Extism: Cross-language framework for building with WebAssembly

Are We Modules Yet?

Printing Music with CSS Grid

Stunt Rally – a free rally racing game with editor

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA
Bloom CPP Multilingual
Post date: 6 Jul 2023