ggllm.cpp

Falcon LLM ggml framework with CPU and GPU support (by cmp-nct)

Ggllm.cpp Alternatives

Similar projects and alternatives to ggllm.cpp

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better ggllm.cpp alternative or higher similarity.

ggllm.cpp reviews and mentions

Posts with mentions or reviews of ggllm.cpp. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-08-15.
  • Is there a way to use a quantized Falcon 40B with SillyTavern (on Apple Silicon)
    1 project | /r/LocalLLaMA | 25 Aug 2023
    I'd like to try https://huggingface.co/TheBloke/WizardLM-Uncensored-Falcon-40B-GGML with SillyTavern (running on Apple Silicon). The only way I've found to run Falcon 40B quantized on Apple Silicon is with https://github.com/cmp-nct/ggllm.cpp but I haven't figured out any way to get SillyTavern to use that as a local model. Does anyone know of a way to get this working?
  • How Is LLaMa.cpp Possible?
    11 projects | news.ycombinator.com | 15 Aug 2023
    It doesn't support Falcon right now, but there's a fork that does (https://github.com/cmp-nct/ggllm.cpp/).
  • Alfred-40B, an OSS RLHF version of Falcon40B
    1 project | news.ycombinator.com | 1 Aug 2023
  • Falcon ggml/ggcc with langchain
    2 projects | /r/LocalLLaMA | 19 Jul 2023
    To load falcon models with the new file format ggcc wich is a new file format similar to ggml, I'm using this tool: https://github.com/cmp-nct/ggllm.cpp Wich is a fork from : https://github.com/ggerganov/llama.cpp
  • Show HN: Danswer – open-source question answering across all your docs
    7 projects | news.ycombinator.com | 10 Jul 2023
    The GGLLM fork seems to be the leading falcon winner for now [1]

    It comes with its own variant of the GGML sub format "ggcv1" but there's quants available on HF [2]

    Although if you have a GPU I'd go with the newly released AWQ quantization instead [3] the performance is better.

    (I may or may not have a mild local LLM addiction - and video cards cost more then drugs)

    [1] https://github.com/cmp-nct/ggllm.cpp

    [2] https://huggingface.co/TheBloke/falcon-7b-instruct-GGML

    [3] https://huggingface.co/abhinavkulkarni/tiiuae-falcon-7b-inst...

  • ChatGPT loses users for first time, shaking faith in AI revolution
    6 projects | news.ycombinator.com | 7 Jul 2023
    For base tooling, things like:

    https://huggingface.co/ (finding models and downloading them)

    https://github.com/ggerganov/llama.cpp (llama)

    https://github.com/cmp-nct/ggllm.cpp (falcon)

    For interactive work (art/chat/research/playing around), things like:

    https://github.com/oobabooga/text-generation-webui/blob/main... (llama) (Also - they just added a decent chat server built into llama.cpp the project)

    https://github.com/invoke-ai/InvokeAI (stable-diffusion)

    Plus a bunch of hacked together scripts.

    Some example models (I'm linking to quantized versions that someone else has made, but the tooling is in the above repos to create them from the published fp16 models)

    https://huggingface.co/TheBloke/llama-65B-GGML

    https://huggingface.co/TheBloke/falcon-40b-instruct-GPTQ

    https://huggingface.co/TheBloke/Wizard-Vicuna-30B-Uncensored...

    etc. Hugging face has quite a number, although some require filling out forms for the base models for tuning/training.

  • Falcon LLM – A 40B Model
    6 projects | news.ycombinator.com | 17 Jun 2023
  • Run machine learning on 7900XT/7900XTX using ROCm 5.5.0 on Ubuntu 22.04
    4 projects | /r/Amd | 16 Jun 2023
    I did another test running LLM model (gpt4all-falcon) quantized to Q5_0 and Q5_1 to AMD GPU (https://huggingface.co/nomic-ai/gpt4all-falcon). I used this awesome project (forked from https://github.com/ggerganov/llama.cpp to https://github.com/cmp-nct/ggllm.cpp). I hipified the CUDA file into HIP code. and made some modifications on it (PR: https://github.com/cmp-nct/ggllm.cpp/pull/3). Checkout https://huggingface.co/nomic-ai/gpt4all-falcon
  • A note from our sponsor - SaaSHub
    www.saashub.com | 9 May 2024
    SaaSHub helps you find the best software and product alternatives Learn more →

Stats

Basic ggllm.cpp repo stats
8
242
9.5
4 months ago

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com