Deepmind: Focused Transformer: Contrastive Training for Context Scaling

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

long_llama

5 1,433 7.9 Python

LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.

LONGLLAMA : extending LLaMA’s context length with FOT One of the promises of our work is that FOT can be used to fine-tune already existing large models to extend their context length. In this section, we show that this is indeed the case. We use OpenLLaMA-3B and OpenLLaMA-7B models trained for 1T tokens as start- ing points and fine-tune them with FOT. We show that the resulting models, which we call LONGLLAMAs, are capable of extrapolating beyond their training context length (even up to 256K) and retain the performance on short-context tasks. We release the inference code on GitHub: https://github.com/CStanKonrad/long_llama and the LONGLLAMA-3B check- point on Hugging Face: https://huggingface.co/syzymon/long_llama_3b. We note that our checkpoint is backward compatible, i.e. can be used with any existing LLaMA inference code (both in Hugging Face and other implementations), albeit without long-context capabilities

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Ask HN: Using Voice Cloning and TTS to Fine-Tune Diarizer

1 project | news.ycombinator.com | 11 Jun 2024
Show HN: Epidemik a Python package for epidemic simulation

1 project | news.ycombinator.com | 11 Jun 2024
Show HN: Applying Maths in the Chemical and Biomolecular Sciences

1 project | news.ycombinator.com | 11 Jun 2024
Advanced Time-Series & Python Libraries

1 project | dev.to | 11 Jun 2024
Show HN: Fine-tune llama3 to support function calling

1 project | news.ycombinator.com | 11 Jun 2024

Deepmind: Focused Transformer: Contrastive Training for Context Scaling

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA
hardware-buttons scrape-images linkedin-bot
Post date: 9 Jul 2023

long_llama

InfluxDB

Related posts

Ask HN: Using Voice Cloning and TTS to Fine-Tune Diarizer

Show HN: Epidemik a Python package for epidemic simulation

Show HN: Applying Maths in the Chemical and Biomolecular Sciences

Advanced Time-Series & Python Libraries

Show HN: Fine-tune llama3 to support function calling

Deepmind: Focused Transformer: Contrastive Training for Context Scaling

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA hardware-buttons scrape-images linkedin-bot Post date: 9 Jul 2023

long_llama

InfluxDB

Related posts

Ask HN: Using Voice Cloning and TTS to Fine-Tune Diarizer

Show HN: Epidemik a Python package for epidemic simulation

Show HN: Applying Maths in the Chemical and Biomolecular Sciences

Advanced Time-Series & Python Libraries

Show HN: Fine-tune llama3 to support function calling

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA
hardware-buttons scrape-images linkedin-bot
Post date: 9 Jul 2023