TinyLlama project aims to pretrain a 1.1B Llama model on 3T tokens

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • TinyLlama

    The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

  • llama.cpp

    LLM inference in C/C++

  • Could this be used as a source of speculative tokens for larger llama models?, as per https://github.com/ggerganov/llama.cpp/pull/2926

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • airoboros

    Customizable implementation of the self-instruct paper.

  • hydra-moe

  • Thanks. Yes, I've seen airoboros, it aims to use a mixture of fine-tunes of the base model if I recall correctly. Not a truly pre-trained MOE, but could be useful.

    Hydra, is this it? https://github.com/SkunkworksAI/hydra-moe

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • More Low-Bit LLMs

    2 projects | news.ycombinator.com | 2 May 2024
  • Kolmogorov-Arnold Network for Reinforcement Leaning, Initial Experiments

    1 project | news.ycombinator.com | 2 May 2024
  • Create an AI prototyping environment using Jupyter Lab IDE with Typescript, LangChain.js and Ollama for rapid AI prototyping

    4 projects | dev.to | 2 May 2024
  • Show HN: FileKitty – Combine and label text files for LLM prompt contexts

    4 projects | news.ycombinator.com | 1 May 2024
  • Effortlessly Create an AI Dungeon Master Bot Using Julep and Chainlit

    1 project | dev.to | 1 May 2024