Llamafile 0.7 Brings AVX-512 Support: 10x Faster Prompt Eval Times for AMD Zen 4

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • llamafile

    Distribute and run LLMs with a single file.

  • I just had a look at their Github, and true enough:

    “External weights are particularly useful for Windows users because they enable you to work around Windows' 4GB executable file size limit.

    For Windows users, here's an example for the Mistral LLM:

    curl -L -o llamafile.exe https://github.com/Mozilla-Ocho/llamafile/releases/download/...

    curl -L -o mistral.gguf https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.1-GGU...

    ./llamafile.exe -m mistral.gguf -ngl 9999

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • highway

    Performance-portable, length-agnostic SIMD with runtime dispatch

  • The bf16 dot instruction replaces 6 instructions: https://github.com/google/highway/blob/master/hwy/ops/x86_12...

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Highway: C++ library that provides portable SIMD/vector intrinsics

    1 project | news.ycombinator.com | 8 Jun 2024
  • Permuting Bits with GF2P8AFFINEQB

    1 project | news.ycombinator.com | 27 Sep 2023
  • AMD EPYC 97x4 “Bergamo” CPUs: 128 Zen 4c CPU Cores for Servers, Shipping Now

    1 project | news.ycombinator.com | 24 Jun 2023
  • 10~17x faster than what? A performance analysis of Intel' x86-SIMD-sort(AVX-512)

    3 projects | news.ycombinator.com | 10 Jun 2023
  • The Most Useful Numbers You've Never Heard Of (Veritasium video on p-adic numbers)

    1 project | /r/math | 6 Jun 2023