Reaching LLaMA2 Performance with 0.1M Dollars

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Scout Monitoring - Free Django app performance insights with Scout Monitoring
Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
www.scoutapm.com
featured
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
  • llama

    Inference code for Llama models

  • > JetMoE-8B is trained with less than $ 0.1 million1 cost but outperforms LLaMA2-7B from Meta AI, who has multi-billion-dollar training resources. LLM training can be much cheaper than people generally thought.

    They want you to read this as "we spent $100k compared to Meta's spending billions", but that's not actually what this says. It says that they spent $100k and Meta has the resources to spend billions if they wanted to.

    We don't know what Facebook spent on training LLaMA 2, but they say that it took them 184320 A100-80GB GPU-hours to train the 7B model [0]. AWS charges $14.46/hour for an instance that has 8 of those [1], which amounts to $1.81/GPU/hr.

    At that rate and assuming they paid something resembling AWS's list price, LLaMA 2 7B cost ~$333k. That's more than $100k, but not by orders of magnitude, and it's likely that Facebook wasn't paying the full price AWS is charging today.

    [0] https://github.com/meta-llama/llama/blob/main/MODEL_CARD.md#...

    [1] https://aws.amazon.com/ec2/instance-types/p4/

  • Scout Monitoring

    Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

    Scout Monitoring logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • La Criptografia en l'Era de la Computació Quàntica i de la IA

    2 projects | dev.to | 1 Jun 2024
  • Show HN: 5x faster depth map generation using tensorrt inside comfyui

    1 project | news.ycombinator.com | 1 Jun 2024
  • Meltdown

    1 project | news.ycombinator.com | 1 Jun 2024
  • Napster Sparked a File-Sharing Revolution 25 Years Ago

    1 project | news.ycombinator.com | 1 Jun 2024
  • HuggingFace hacked – Space secrets leak disclosure

    1 project | news.ycombinator.com | 1 Jun 2024