marsha vs llama.cpp

marsha

Marsha is a functional, higher-level, English-based programming language that gets compiled into tested Python software by an LLM (by alantech)

Suggest topics

Source Code

Suggest alternative

Edit details

llama.cpp

LLM inference in C/C++ (by ggerganov)

llama llm

Source Code

Suggest alternative

Edit details

Scout Monitoring - Free Django app performance insights with Scout Monitoring

Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

www.scoutapm.com

featured

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

marsha		llama.cpp
	Project
12	Mentions	791
461	Stars	59,810
0.2%	Growth	-
8.4	Activity	10.0
7 months ago	Latest Commit	about 22 hours ago
Python	Language	C++
MIT License	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

marsha

Posts with mentions or reviews of marsha. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-07-31.

LLMs as compilers
2 projects | dev.to | 31 Jul 2023

There is already a lot of hay to mow with the current state of affairs in generative AI. LLMs as proper compilers, compiLLMers if you will, can produce correct code reliably enough today given enough guidance. Getting an LLM to generate correct code requires providing various examples and descriptive instructions. The UX of a chat interface to an LLM inherently leads people to write prompts that do not meet these criteria. We need to make it easy for people to give LLMs precise descriptions and numerous examples as concisely as possible via syntaxes that are similar to English so they remain easy to learn and use. Coq is a great example of a functional programming syntax that is verbose and distant from English, but example-driven via assertions. David Ellis, Alejandro Guillen and I recently introduced Marsha as a proposal for what a syntax that meets the requirements outlined can look like. It is still early, but LLMs will increasingly give us the power to create more accessible representations of computer programs that look close to English. These representations will be distilled by LLMs into the complexities of the current high-level languages. Knowing Java or Python will become a rare skill, akin to individuals specializing in low-level optimizations using C or assembly language these days. Instead, the focus of developer experience will shift to the higher-level abstractions that are built on top of LLMs and composing these abstractions for different tasks. Compillmers will make programming more accessible in the near future such that writing software becomes part of the resume of most knowledge workers.
Show HN: Marsha – An LLM-Based Programming Language
1 project | /r/hypeurls | 27 Jul 2023

1 project | /r/hackernews | 27 Jul 2023

7 projects | news.ycombinator.com | 25 Jul 2023

> You're a bit too black-and-white on this situation.
While I agree with your other points, I feel this argument doesn't really hold water.
The output of the c compiler is deterministic.
I struggle very hard to believe that the floating point rounding errors when you compile C will cause it to occasionally emit a binary that is not byte-identical multiple sequential runs in a row.
What any program does at runtime is essentially non-deterministic, and that's 100% not what we're talking about here.
If you consider https://github.com/alantech/marsha/blob/main/examples/web/we... ...
The generated output of this file is a probability distribution with a sweet spot where the code does what you want; there are multiple outputs of code that sit in the sweet spot. You want one of these.
The actual output of this file is a probability distribution that includes the examples, but may or may not overlap the sweet spot of 'actually does the right thing'.
...in fact, and there's no specific reason to expect that, regardless of the number of examples you provide, the distribution that includes those examples also includes the sweet spot.
For common examples it will, but I'd argue that it's actually provable that there are times (eg. where the output length of a valid solution would be > the possible out of the model), that regardless of the examples / tests, it's not actually possible to generate a valid solution from. Just like how constraint solvers will sometimes tell you there's no solution that matches all the constraints.
So, that would be like a compiler error. "You've asked for something impossible".
...but I imagine it would be very very difficult to tell the difference between inputs that overlap the sweet spot and those that don't; the ones that don't will have solutions that look right, but actually only cover the examples; and there's literally no way of telling the difference between that and a correct solution without HFRL.
It seem like an intractable problem to me.
> Different tools for different scenarios, so if that is a huge problem, don't use Marsha as it currently is.
As you say~
Marsha, a ChatGPT-based programming language
1 project | /r/ChatGPTCoding | 27 Jul 2023
Marsha is a functional, higher-level, English-based programming language that gets compiled into tested Python software more reliably by ChatGPT
1 project | /r/programming | 27 Jul 2023
Llama 2 – Meta AI
16 projects | news.ycombinator.com | 18 Jul 2023

So this comment inspired me to write a Roman Numeral to Integer function in out LLM-based programming language, Marsha: https://github.com/alantech/marsha/blob/main/examples/genera...

llama.cpp

Posts with mentions or reviews of llama.cpp. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-05-07.

IBM Granite: A Family of Open Foundation Models for Code Intelligence
3 projects | news.ycombinator.com | 7 May 2024

if you can compile stuff, then looking at llama.cpp (what ollama uses) is also interesting: https://github.com/ggerganov/llama.cpp
the server is here: https://github.com/ggerganov/llama.cpp/tree/master/examples/...
And you can search for any GGUF on huggingface
Ask HN: Affordable hardware for running local large language models?
1 project | news.ycombinator.com | 5 May 2024

Yes, Metal seems to allow a maximum of 1/2 of the RAM for one process, and 3/4 of the RAM allocated to the GPU overall. There’s a kernel hack to fix it, but that comes with the usual system integrity caveats. https://github.com/ggerganov/llama.cpp/discussions/2182
Xmake: A modern C/C++ build tool
7 projects | news.ycombinator.com | 4 May 2024
Better and Faster Large Language Models via Multi-Token Prediction
1 project | news.ycombinator.com | 1 May 2024

For anyone interested in exploring this, llama.cpp has an example implementation here:
https://github.com/ggerganov/llama.cpp/tree/master/examples/...
Llama.cpp Bfloat16 Support
1 project | news.ycombinator.com | 30 Apr 2024
Fine-tune your first large language model (LLM) with LoRA, llama.cpp, and KitOps in 5 easy steps
1 project | dev.to | 30 Apr 2024

Getting started with LLMs can be intimidating. In this tutorial we will show you how to fine-tune a large language model using LoRA, facilitated by tools like llama.cpp and KitOps.
GGML Flash Attention support merged into llama.cpp
1 project | news.ycombinator.com | 30 Apr 2024
Phi-3 Weights Released
1 project | news.ycombinator.com | 23 Apr 2024

well https://github.com/ggerganov/llama.cpp/issues/6849
Lossless Acceleration of LLM via Adaptive N-Gram Parallel Decoding
3 projects | news.ycombinator.com | 21 Apr 2024
Llama.cpp Working on Support for Llama3
1 project | news.ycombinator.com | 18 Apr 2024

What are some alternatives?

When comparing marsha and llama.cpp you can also consider the following projects:

maccarone - AI-managed code blocks in Python ⏪⏩

ollama - Get up and running with Llama 3, Mistral, Gemma, and other large language models.

llama2-chatbot - LLaMA v2 Chatbot

gpt4all - gpt4all: run open-source LLMs anywhere

OpenPipe - Turn expensive prompts into cheap fine-tuned models

text-generation-webui - A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

llama - Inference code for LLaMA models on CPU and Mac M1/M2 GPU

GPTQ-for-LLaMa - 4 bits quantization of LLaMA using GPTQ

ggml - Tensor library for machine learning

cog-llama-template - LLaMA Cog template

alpaca.cpp - Locally run an Instruction-Tuned Chat-Style LLM

marsha vs maccarone llama.cpp vs ollama marsha vs llama2-chatbot llama.cpp vs gpt4all marsha vs OpenPipe llama.cpp vs text-generation-webui marsha vs llama llama.cpp vs GPTQ-for-LLaMa marsha vs ollama llama.cpp vs ggml marsha vs cog-llama-template llama.cpp vs alpaca.cpp

Scout Monitoring - Free Django app performance insights with Scout Monitoring

Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

www.scoutapm.com

featured

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

Compare marsha vs llama.cpp and see what are their differences.

marsha

llama.cpp

marsha

llama.cpp

What are some alternatives?