[Discussion] Fine tune model for long context

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

memory-efficient-attention-pytorch

2 227 6.1 Python

Discontinued Implementation of a memory efficient multi-head attention as proposed in the paper, "Self-attention Does Not Need O(n²) Memory"

Check these efficient attention mechanism which are almost a drop in replacement: efficient attention flash attention

flash-attention

26 10,773 9.4 Python

Fast and memory-efficient exact attention

Check these efficient attention mechanism which are almost a drop in replacement: efficient attention flash attention

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project