LIMoE-pytorch
CoCa-pytorch
LIMoE-pytorch | CoCa-pytorch | |
---|---|---|
1 | 1 | |
46 | 984 | |
- | - | |
2.5 | 6.2 | |
about 2 months ago | 5 months ago | |
Python | Python | |
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
LIMoE-pytorch
-
Pix2tex: Using a ViT to convert images of equations into LaTeX code
Makes me wonder what the SOTA is for open source efforts along these lines.
I have heard about "mixture of experts" as being a potentially important advance, and also of course about multimodality. So I found this: https://github.com/YeonwooSung/LIMoE-pytorch
CoCa-pytorch
What are some alternatives?
nougat - Implementation of Nougat Neural Optical Understanding for Academic Documents
DALLE-pytorch - Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
latex2sympy - Parse LaTeX math expressions
RETRO-pytorch - Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
LaTeX-OCR - pix2tex: Using a ViT to convert images of equations into LaTeX code.
TimeSformer-pytorch - Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification
PaLM-pytorch - Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways
performer-pytorch - An implementation of Performer, a linear attention-based transformer, in Pytorch
x-clip - A concise but complete implementation of CLIP with various experimental improvements from recent papers
InternVideo - Video Foundation Models & Data for Multimodal Understanding
vectorrvnn - Data Driven method for hierarchical grouping of paths in Vector Graphics.