The Era of 1-bit LLMs: ternary parameters for cost-effective computing

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Scout Monitoring - Free Django app performance insights with Scout Monitoring
Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
www.scoutapm.com
featured
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
  • unilm

    Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

  • +1 On this, the real proof would have been testing both models side-by-side.

    It seems that it may be published on GitHub [1] according to HuggingFace [2].

    [1] https://github.com/microsoft/unilm/tree/master/bitnet

    [2] https://huggingface.co/papers/2402.17764

  • Scout Monitoring

    Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

    Scout Monitoring logo
  • llama.cpp

    LLM inference in C/C++

  • It does result in a significant degradation relative to unquantized model of the same size, but even with simple llama.cpp K-quantization, it's still worth it all the way down to 2-bit. The chart in this llama.cpp PR speaks for itself:

    https://github.com/ggerganov/llama.cpp/pull/1684#issue-17396...

  • quantized-nets

    Contains code for Binary, Ternary, N-bit Quantized and Hybrid CNNs for low precision experiments.

  • People have been doing this 6 years ago.

        https://github.com/yashkant/quantized-nets

  • trained-ternary-quantization

    Reducing the size of convolutional neural networks

  • https://github.com/TropComplique/trained-ternary-quantization

  • Ternary-Weights-Network

    Ternay-Weights-Network/Pytorch

  • https://github.com/buaabai/Ternary-Weights-Network

  • StableLM

    StableLM: Stability AI Language Models

  • https://github.com/Stability-AI/StableLM?tab=readme-ov-file#...

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts