Show HN: Speeding up LLM inference 2x times (possibly)

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • effort

    An implementation of bucketMul LLM inference

  • I think it was somewhere around that tag:

    https://github.com/kolinko/effort/releases/tag/5.0-last-mixt...

    Cannot rerun easily any more, because the underlying model/weight names changed in the meantime. It doesn't help that Mixtral's published .safetensor files seem messed up, and I needed to hack a conversion from pytorch - it added an extra layer of confusion into the project.

  • cria

    Tiny inference-only implementation of LLaMA (by recmo)

  • It originally started as a fork to Recmo’s cria pure numpy llama impl :)

    https://github.com/recmo/cria

    Took a whole night to compute a few

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Show HN: Route your prompts to the best LLM

    6 projects | news.ycombinator.com | 22 May 2024
  • Search Engines Down: DDG, Qwant, Startpage, Bing Whats Going On?

    1 project | news.ycombinator.com | 23 May 2024
  • OmniGlue: Generalizable Feature Matching with Foundation Model Guidance

    1 project | news.ycombinator.com | 23 May 2024
  • RateMyReads API

    1 project | dev.to | 23 May 2024
  • Nvidia revenue up 262%, thanks to the AI boom

    1 project | news.ycombinator.com | 23 May 2024