Fast Llama 2 on CPUs with Sparse Fine-Tuning and DeepSparse

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • deepsparse

    Sparsity-aware deep learning inference runtime for CPUs

  • Interesting company. Yannic Kilcher interviewed Nir Shavit last year and they went into some depth: https://www.youtube.com/watch?v=0PAiQ1jTN5k DeepSparse is on GitHub: https://github.com/neuralmagic/deepsparse

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • The future of quantization techniques in deep learning.

    1 project | /r/deeplearning | 14 Jun 2023
  • [D] Most efficient open source language model ?

    1 project | /r/MachineLearning | 23 Oct 2022
  • Sparseserver.ui – test the performance of Sparse Transformers

    1 project | news.ycombinator.com | 19 Apr 2022
  • [P] SparseServer.UI : A UI to test performance of Sparse Transformers

    2 projects | /r/MachineLearning | 19 Apr 2022
  • DeepSparse Engine

    1 project | /r/LanguageTechnology | 4 Apr 2022