Is llama.cpp any good on ARM (e.g. Ampere Altra) or only on x86-64?

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • ml-ane-transformers

    Reference implementation of the Transformer architecture optimized for Apple Neural Engine (ANE)

  • I'm also looking into converting these models from PyTorch to CoreML format, and seeing how well they run when given access to the GPU and Neural Engine. There's even an optimized library Apple has specifically for this type of model.

  • llama.cpp

    LLM inference in C/C++

  • Perf on RK3588 on FriendlyElec NanoPi R6s: https://github.com/ggerganov/llama.cpp/issues/722

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Ask HN: How to do dead simple heartbeat monitoring?

    4 projects | news.ycombinator.com | 6 May 2024
  • FFmpeg in Python: library to manipulate video and audio

    1 project | news.ycombinator.com | 6 May 2024
  • Unleash Your Inner Strategist: Build & Play Connect Four in Python!

    1 project | dev.to | 6 May 2024
  • Jailbreak in a Haystack

    1 project | news.ycombinator.com | 6 May 2024
  • Pydantic logfire: Uncomplicated Observability for Python and beyond

    1 project | news.ycombinator.com | 6 May 2024