perceiver-ar
google-research
Our great sponsors
perceiver-ar | google-research | |
---|---|---|
3 | 98 | |
225 | 32,733 | |
1.3% | 1.3% | |
0.0 | 9.6 | |
7 days ago | 2 days ago | |
Python | Jupyter Notebook | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
perceiver-ar
-
[D] Most important AI Paper´s this year so far in my opinion + Proto AGI speculation at the end
General-purpose, long-context autoregressive modeling with Perceiver AR - Deepmind 2022 Paper: https://arxiv.org/abs/2202.07765 Deepmind: https://www.deepmind.com/publications/perceiver-ar-general-purpose-long-context-autoregressive-generation Code: https://github.com/google-research/perceiver-ar
- GitHub - google-research/perceiver-ar
- [R] General-purpose, long-context autoregressive modeling with Perceiver AR - Deepmind 2022
google-research
-
Show HN: Next-token prediction in JavaScript – build fast LLMs from scratch
found the JS one..
https://github.com/google-research/google-research/tree/mast...
- Google Research website is down
-
Jpegli: A New JPEG Coding Library
The change was literally just made: https://github.com/google-research/google-research/commit/4a...
It appears this was in response to Hacker News comments.
- Multi-bitrate JPEG compression perceptual evaluation dataset 2023
-
Vector Databases: A Technical Primer [pdf]
There are options such as Google's ScaNN that may let you go farther before needing to consider specialized databases.
https://github.com/google-research/google-research/blob/mast...
-
Labs.Google
I feel it was unnecesary to create this because https://research.google/ already exists? It just seems like they want to take another URL with a "pure" domain name instead of psubdirectories, etc parts.
- Smerf: Streamable Memory Efficient Radiance Fields
-
Shisa 7B: a new JA/EN bilingual model based on Mistral 7B
You could also try some dedicated translation models like https://huggingface.co/facebook/nllb-moe-54b (or https://github.com/google-research/google-research/tree/master/madlad_400 for something smaller) and see how they do.
-
Translate to and from 400+ languages locally with MADLAD-400
Google released T5X checkpoints for MADLAD-400 a couple of months ago, but nobody could figure out how to run them. Turns out the vocabulary was wrong, but they uploaded the correct one last week.
- Mastering ROUGE Matrix: Your Guide to Large Language Model Evaluation for Summarization with Examples
What are some alternatives?
flash-attention - Fast and memory-efficient exact attention
qdrant - Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
flash-attention-jax - Implementation of Flash Attention in Jax
fast-soft-sort - Fast Differentiable Sorting and Ranking
RHO-Loss
faiss - A library for efficient similarity search and clustering of dense vectors.
CodeRL - This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (NeurIPS22).
ml-agents - The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.
EfficientZero - Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.
Milvus - A cloud-native vector database, storage for next generation AI applications
XMem - [ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
struct2depth - Models and examples built with TensorFlow