twinny vs llama.cpp

twinny

The most no-nonsense, locally or API-hosted AI code completion plugin for Visual Studio Code - like GitHub Copilot but completely free and 100% private. (by rjmacarthy)

Source Code

rjmacarthy.github.io

Suggest alternative

Edit details

llama.cpp

LLM inference in C/C++ (by ggerganov)

llama llm

Source Code

Suggest alternative

Edit details

SurveyJS - Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App

With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.

surveyjs.io

featured

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

twinny		llama.cpp
	Project
7	Mentions	778
1,750	Stars	57,984
-	Growth	-
9.9	Activity	10.0
5 days ago	Latest Commit	3 days ago
TypeScript	Language	C++
MIT License	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

twinny

Posts with mentions or reviews of twinny. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-04-07.

Twinny: Locally hosted (or API hosted) AI code completion for Visual Studio Code
1 project | news.ycombinator.com | 10 Apr 2024
The lifecycle of a code AI completion
6 projects | news.ycombinator.com | 7 Apr 2024

For those who might not be aware of this, there is also an open source project on GitHub called "Twinny" which is an offline Visual Studio Code plugin equivalent to Copilot: https://github.com/rjmacarthy/twinny
It can be used with a number of local model services. Currently for my setup on a NVIDIA 4090, I'm running both the base and instruct model for deepseek-coder 6.7b using 5_K_M Quantization GGUF files (for performance) through llama.cpp "server" where the base model is for completions and the instruct model for chat interactions.
llama.cpp: https://github.com/ggerganov/llama.cpp/
deepseek-coder 6.7b base GGUF files: https://huggingface.co/TheBloke/deepseek-coder-6.7B-base-GGU...
deepseek-coder 6.7b instruct GGUF files: https://huggingface.co/TheBloke/deepseek-coder-6.7B-instruct...
Private Ollama GitHub Copilot Alternative with FIM and Chat
1 project | news.ycombinator.com | 16 Jan 2024
Ollama AI code completion plugin for VSCode, 100% free and 100% private
1 project | news.ycombinator.com | 3 Jan 2024
A new locally hosted AI code completion API and vscode extension. Like Copilot but totally free and best of all private.
1 project | /r/coding | 30 Aug 2023
Continue with LocalAI: An alternative to GitHub's Copilot that runs locally
6 projects | news.ycombinator.com | 28 Aug 2023
Locally hosted code completion API and vscode extension. 100% free and 100% private.
2 projects | /r/selfhosted | 24 Aug 2023

https://github.com/rjmacarthy/twinny - vscode extension https://github.com/rjmacarthy/twinny-api - python inference api

llama.cpp

Posts with mentions or reviews of llama.cpp. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-05-07.

IBM Granite: A Family of Open Foundation Models for Code Intelligence
3 projects | news.ycombinator.com | 7 May 2024

if you can compile stuff, then looking at llama.cpp (what ollama uses) is also interesting: https://github.com/ggerganov/llama.cpp
the server is here: https://github.com/ggerganov/llama.cpp/tree/master/examples/...
And you can search for any GGUF on huggingface
Ask HN: Affordable hardware for running local large language models?
1 project | news.ycombinator.com | 5 May 2024

Yes, Metal seems to allow a maximum of 1/2 of the RAM for one process, and 3/4 of the RAM allocated to the GPU overall. There’s a kernel hack to fix it, but that comes with the usual system integrity caveats. https://github.com/ggerganov/llama.cpp/discussions/2182
Xmake: A modern C/C++ build tool
7 projects | news.ycombinator.com | 4 May 2024
Better and Faster Large Language Models via Multi-Token Prediction
1 project | news.ycombinator.com | 1 May 2024

For anyone interested in exploring this, llama.cpp has an example implementation here:
https://github.com/ggerganov/llama.cpp/tree/master/examples/...
Llama.cpp Bfloat16 Support
1 project | news.ycombinator.com | 30 Apr 2024
Fine-tune your first large language model (LLM) with LoRA, llama.cpp, and KitOps in 5 easy steps
1 project | dev.to | 30 Apr 2024

Getting started with LLMs can be intimidating. In this tutorial we will show you how to fine-tune a large language model using LoRA, facilitated by tools like llama.cpp and KitOps.
GGML Flash Attention support merged into llama.cpp
1 project | news.ycombinator.com | 30 Apr 2024
Phi-3 Weights Released
1 project | news.ycombinator.com | 23 Apr 2024

well https://github.com/ggerganov/llama.cpp/issues/6849
Lossless Acceleration of LLM via Adaptive N-Gram Parallel Decoding
3 projects | news.ycombinator.com | 21 Apr 2024
Llama.cpp Working on Support for Llama3
1 project | news.ycombinator.com | 18 Apr 2024

Compare twinny vs llama.cpp and see what are their differences.

twinny

llama.cpp

twinny

llama.cpp