cria vs effort

cria

Tiny inference-only implementation of LLaMA (by recmo)

Suggest topics

Source Code

Suggest alternative

Edit details

effort

An implementation of bucketMul LLM inference (by kolinko)

Suggest topics

Source Code

kolinko.github.io

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

cria		effort
	Project
4	Mentions	3
77	Stars	205
-	Growth	-
2.5	Activity	9.7
about 1 year ago	Latest Commit	26 days ago
Python	Language	Swift
-	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

cria

Posts with mentions or reviews of cria. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-04-17.

Show HN: Speeding up LLM inference 2x times (possibly)
3 projects | news.ycombinator.com | 17 Apr 2024

It originally started as a fork to Recmo’s cria pure numpy llama impl :)
https://github.com/recmo/cria
Took a whole night to compute a few
Jsonformer: A bulletproof way to generate structured output from LLMs
8 projects | news.ycombinator.com | 2 May 2023

Not op, but I can share my approach - I went line by line by Recmo's Cria: https://github.com/recmo/cria - which is an implementation of Llama in Numpy - so very low level. Took me I think 3-4 days x 10 hours + 1-2 days of reading about Transformers to understand what's going on - but from that you can see how models generate text and have a deep understanding of what's going on.
LLaMA for poor
1 project | /r/LocalLLaMA | 1 May 2023

effort

Posts with mentions or reviews of effort. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-04-17.

Some scientists can't stop using AI to write research papers
1 project | news.ycombinator.com | 3 May 2024

My experience is exactly the opposite. AI has way more domain-specific knowledge than any translator I could imagine.
I recently published effort ( http://kolinko.github.io/effort/ - got to the front page of HN two weeks ago ), and literally everything on that page was rewritten by chatgpt.
The flow is that I write it however I can, sometimes so badly that even a human professional would have difficulty to understand, then I ask chatgpt to go paragraph by paragraph and rewrite and smoothen out the text.
You can see how the website was reworked here:
https://chat.openai.com/share/e/10d7ba3f-f7eb-48cd-9250-d864...
GPT has way more domain knowledge in my area than most engineer friends I know, not to mention translators, and it managed to not just help with grammar, but with overall explanations I wrote.
Of course it fails at higher level concepts, and a plenty of things, but still - I can get more quality output from him in an hour than by working a week with a dedicated translator/editor.
Show HN: Speeding up LLM inference 2x times (possibly)
3 projects | news.ycombinator.com | 17 Apr 2024

I think it was somewhere around that tag:
https://github.com/kolinko/effort/releases/tag/5.0-last-mixt...
Cannot rerun easily any more, because the underlying model/weight names changed in the meantime. It doesn't help that Mixtral's published .safetensor files seem messed up, and I needed to hack a conversion from pytorch - it added an extra layer of confusion into the project.