Recursions-Are-All-You-Need VS transformers

Compare Recursions-Are-All-You-Need vs transformers and see what are their differences.

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
Recursions-Are-All-You-Need transformers
1 181
3 126,516
- 2.6%
2.9 10.0
about 1 month ago about 14 hours ago
Python Python
GNU General Public License v3.0 only Apache License 2.0
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

Recursions-Are-All-You-Need

Posts with mentions or reviews of Recursions-Are-All-You-Need. We have used some of these posts to build our list of alternatives and similar projects.
  • Recursions Are All You Need: Towards Efficient Deep Unfolding Networks
    1 project | /r/BotNewsPreprints | 10 May 2023
    The use of deep unfolding networks in compressive sensing (CS) has seen wide success as they provide both simplicity and interpretability. However, since most deep unfolding networks are iterative, this incurs significant redundancies in the network. In this work, we propose a novel recursion-based framework to enhance the efficiency of deep unfolding models. First, recursions are used to effectively eliminate the redundancies in deep unfolding networks. Secondly, we randomize the number of recursions during training to decrease the overall training time. Finally, to effectively utilize the power of recursions, we introduce a learnable unit to modulate the features of the model based on both the total number of iterations and the current iteration index. To evaluate the proposed framework, we apply it to both ISTA-Net+ and COAST. Extensive testing shows that our proposed framework allows the network to cut down as much as 75% of its learnable parameters while mostly maintaining its performance, and at the same time, it cuts around 21% and 42% from the training time for ISTA-Net+ and COAST respectively. Moreover, when presented with a limited training dataset, the recursive models match or even outperform their respective non-recursive baseline. Codes and pretrained models are available at https://github.com/Rawwad-Alhejaili/Recursions-Are-All-You-Need .

transformers

Posts with mentions or reviews of transformers. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-05-08.

What are some alternatives?

When comparing Recursions-Are-All-You-Need and transformers you can also consider the following projects:

fairseq - Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

sentence-transformers - Multilingual Sentence & Image Embeddings with BERT

llama - Inference code for Llama models

transformer-pytorch - Transformer: PyTorch Implementation of "Attention Is All You Need"

text-generation-webui - A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

huggingface_hub - The official Python client for the Huggingface Hub.

OpenNMT-py - Open Source Neural Machine Translation and (Large) Language Models in PyTorch

sentencepiece - Unsupervised text tokenizer for Neural Network-based text generation.

Swin-Transformer-Tensorflow - Unofficial implementation of "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" (https://arxiv.org/abs/2103.14030)

faiss - A library for efficient similarity search and clustering of dense vectors.

KoboldAI-Client

gpt-neo - An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.