-
ml-ane-transformers
Reference implementation of the Transformer architecture optimized for Apple Neural Engine (ANE)
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Anyhow, you'll need the latest release of llama.cpp (here is the version that supports CUDA 12.1) and you'll also need version 12.1 of CUDA toolkit (that can be found here. Note that it's over 3 GB).
Apple ML team released a paper and repo last year ( Ane_transformers )that shows how to optimize transformer architecture for ANE use prior to converting a PyTorch model to CoreML.
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.