The Era of 1-bit LLMs: ternary parameters for cost-effective computing

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • unilm

    Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

  • +1 On this, the real proof would have been testing both models side-by-side.

    It seems that it may be published on GitHub [1] according to HuggingFace [2].

    [1] https://github.com/microsoft/unilm/tree/master/bitnet

    [2] https://huggingface.co/papers/2402.17764

  • llama.cpp

    LLM inference in C/C++

  • It does result in a significant degradation relative to unquantized model of the same size, but even with simple llama.cpp K-quantization, it's still worth it all the way down to 2-bit. The chart in this llama.cpp PR speaks for itself:

    https://github.com/ggerganov/llama.cpp/pull/1684#issue-17396...

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • quantized-nets

    Contains code for Binary, Ternary, N-bit Quantized and Hybrid CNNs for low precision experiments.

  • People have been doing this 6 years ago.

        https://github.com/yashkant/quantized-nets

  • trained-ternary-quantization

    Reducing the size of convolutional neural networks

  • https://github.com/TropComplique/trained-ternary-quantization

  • Ternary-Weights-Network

    Ternay-Weights-Network/Pytorch

  • https://github.com/buaabai/Ternary-Weights-Network

  • StableLM

    StableLM: Stability AI Language Models

  • https://github.com/Stability-AI/StableLM?tab=readme-ov-file#...

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts