tinygrad vs neural-engine

tinygrad

You like pytorch? You like micrograd? You love tinygrad! ❤️ [Moved to: https://github.com/tinygrad/tinygrad] (by geohot)

Suggest topics

DISCONTINUED

Suggest alternative

Edit details

neural-engine

Everything we actually know about the Apple Neural Engine (ANE) (by hollance)

neural-engine ane Coreml Iphone iOS neural-network tpu

Source Code

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

tinygrad		neural-engine
	Project
58	Mentions	22
17,800	Stars	1,884
-	Growth	-
9.7	Activity	5.1
10 months ago	Latest Commit	about 1 month ago
Python	Language
MIT License	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

tinygrad

Posts with mentions or reviews of tinygrad. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-06-06.

tinygrad: extreme simplicity, easiest framework to add new accelerators to
1 project | /r/u_waynerad | 2 Jul 2023
GGML – AI at the Edge
11 projects | news.ycombinator.com | 6 Jun 2023

Might be a silly question but is GGML a similar/competing library to George Hotz's tinygrad [0]?
[0] https://github.com/geohot/tinygrad
Render neural network into CUDA/HIP code
3 projects | news.ycombinator.com | 2 Jun 2023

at first glance i thought may its like tinygrad. but looks has many ops than that tiny grad but most maps to underlying hardware provided ops?
i wonder how well tinygrad's apporach will work out, ops fusion sounds easy, just a walk a graph, pattern match it and lower to hardware provided ops?
Anyway if anyone wants to understand the philosophy behind tinygrad, this file is great start https://github.com/geohot/tinygrad/blob/master/docs/abstract...
llama.cpp now officially supports GPU acceleration.
8 projects | /r/LocalLLaMA | 13 May 2023

There are currently at least 3 ways to run llama on m1 with GPU acceleration. - mlc-llm (pre-built, only 1 model has been ported) - tinygrad (very memory efficient, not that easy to integrate into other projects) - llama-mps (original llama codebase + llama adapter support)
George Hotz building an AMD competitor to Nvidia.
1 project | /r/wallstreetbets | 13 May 2023
George Hotz ROCm adventures
1 project | /r/ROCm | 30 Apr 2023

Hopefully we will see now full support with AMD hardware on https://github.com/geohot/tinygrad. You can read more about it on https://tinygrad.org/
The Coming of Local LLMs
7 projects | news.ycombinator.com | 11 Apr 2023

tinygrad
https://github.com/geohot/tinygrad/tree/master/accel/ane
But I have not tested it on Linux since Asahi has not yet added support.
llama.cpp runs at 18ms per token (7B) and 200ms per token (65B) without quantization.
Everything we know about Apple's Neural Engine
1 project | news.ycombinator.com | 25 Mar 2023
Everything we know about the Apple Neural Engine (ANE)
9 projects | news.ycombinator.com | 25 Mar 2023
How 'Open' Is OpenAI, Really?
2 projects | news.ycombinator.com | 12 Mar 2023

neural-engine

Posts with mentions or reviews of neural-engine. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-05-07.

Apple Introduces M4 Chip
7 projects | news.ycombinator.com | 7 May 2024

~38 TOPS at fp16 is amazing, if the quoted number if fp16 (ANE is fp16 according to this [1] but that honestly seems like a bad choice when people are going smaller and smaller even at the higher level datacenter cards so not sure why apple would use it instead of fp8 natively)
[1]: https://github.com/hollance/neural-engine/blob/master/docs/1...
Optimize sgemm on RISC-V platform
6 projects | news.ycombinator.com | 28 Feb 2024

yep. they have a neural engine that is separate from the CPU and GPU that does really fast matmuls https://github.com/hollance/neural-engine. it's basically completely undocumented.
Apple is adding more and more neural engine cores to their products, is there any way to use them for local LLMs?
2 projects | /r/LocalLLaMA | 7 Jun 2023

Looks like the ANE ("Apple Neural Engine") cores are powerful but not as flexible/programmable as the GPU cores. There is no sign that LLM inference is possible with them or ever will be unless Apple either opens up the closed ANE software framework for extensibility or they extend the ANE framework to support modern LLMs themselves. I would not hold my breath.
Anthropic’s $5B, 4-year plan to take on OpenAI
6 projects | news.ycombinator.com | 11 Apr 2023

If Apple would wake up to what's happening with llama.cpp etc then I don't see such a big role for paying for remote access to big models via API
Currently a Macbook has a Neural Engine that is sitting idle 99% of the time and only suitable for running limited models (poorly documented, opaque rules about what ops can be accelerated, a black box compiler [1] and an apparent 3GB model size limit [2])
OTOH you can buy a Macbook with 64GB 'unified' memory and a Neural Engine today
If you squint a bit and look into the near future it's not so hard to imagine a future Mx chip with a more capable Neural Engine and yet more RAM, and able to run the largest GPT3 class models locally. (Ideally with better developer tools so other compilers can target the NE)
And then imagine it does that while leaving the CPU+GPU mostly free to run apps/games ... the whole experience of using a computer could change radically in that case.
I find it hard not to think this is coming within 5 years (although equally, I can imagine this is not on Apple's roadmap at all currently)
[1] https://github.com/hollance/neural-engine
Everything we actually know about the Apple Neural Engine (ANE)
1 project | /r/apple | 26 Mar 2023

1 project | /r/programming | 25 Mar 2023
What we know about the Apple Neural Engine
1 project | /r/patient_hackernews | 25 Mar 2023

1 project | /r/hackernews | 25 Mar 2023
Everything we know about the Apple Neural Engine (ANE)
1 project | /r/hypeurls | 25 Mar 2023

9 projects | news.ycombinator.com | 25 Mar 2023

My question too. This semi-answer on the page seems to contradict itself (source: https://github.com/hollance/neural-engine/blob/master/docs/p... ):
"> Can I program the ANE directly?
Unfortunately not. You can only use the Neural Engine through Core ML at the moment.
There currently is no public framework for programming the ANE. There are several private, undocumented frameworks but obviously we cannot use them as Apple rejects apps that use private frameworks.
(Perhaps in the future Apple will provide a public version of AppleNeuralEngine.framework.)"
The last part links to this bunch of headers:
https://github.com/nst/iOS-Runtime-Headers/tree/master/Priva...
So might it be more accurate to say you can program it directly, but won't end up with something that can be distributed on the app store?

What are some alternatives?

When comparing tinygrad and neural-engine you can also consider the following projects:

Pytorch - Tensors and Dynamic neural networks in Python with strong GPU acceleration

Dual-Edge-TPU-Adapter - Dual Edge TPU Adapter to use it on a system with single PCIe port on m.2 A/B/E/M slot

llama.cpp - LLM inference in C/C++

pyllms - Minimal Python library to connect to LLMs (OpenAI, Anthropic, AI21, Cohere, Aleph Alpha, HuggingfaceHub, Google PaLM2, with a built-in model performance benchmark.

openpilot - openpilot is an open source driver assistance system. openpilot performs the functions of Automated Lane Centering and Adaptive Cruise Control for 250+ supported car makes and models.

ANECompat - A tool which checks compatibility of CoreML model with Apple Neural Engine

llama - Inference code for Llama models

pytorch-apple-silicon-benchmarks - Performance of PyTorch on Apple Silicon

tensorflow_macos - TensorFlow for macOS 11.0+ accelerated using Apple's ML Compute framework.

tensorexperiments - Boilerplate for GPU-Accelerated TensorFlow and PyTorch code on M1 Macbook

GPTQ-for-LLaMa - 4 bits quantization of LLaMA using GPTQ

more-ane-transformers - Run transformers (incl. LLMs) on the Apple Neural Engine.

tinygrad vs Pytorch neural-engine vs Dual-Edge-TPU-Adapter tinygrad vs llama.cpp neural-engine vs pyllms tinygrad vs openpilot neural-engine vs ANECompat tinygrad vs llama neural-engine vs pytorch-apple-silicon-benchmarks tinygrad vs tensorflow_macos neural-engine vs tensorexperiments tinygrad vs GPTQ-for-LLaMa neural-engine vs more-ane-transformers

Compare tinygrad vs neural-engine and see what are their differences.

tinygrad

neural-engine

tinygrad

neural-engine

What are some alternatives?