XMem
Cream
DISCONTINUED
Our great sponsors
XMem | Cream | |
---|---|---|
10 | 3 | |
1,187 | 261 | |
- | - | |
7.9 | 8.2 | |
17 days ago | almost 2 years ago | |
Python | Python | |
GNU General Public License v3.0 only | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
XMem
-
Track-Anything: a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything and XMem.
Nvm just found the occlusion video on https://github.com/hkchengrex/XMem holy shit
-
[D] Most important AI Paper´s this year so far in my opinion + Proto AGI speculation at the end
XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model ( Added because of the Atkinson-Shiffrin Memory Model ) Paper: https://arxiv.org/abs/2207.07115 Github: https://github.com/hkchengrex/XMem
- [D] Most Popular AI Research July 2022 pt. 2 - Ranked Based On GitHub Stars
- Most Popular AI Research July 2022 pt. 2 - Ranked Based On GitHub Stars
-
I trained a neural net to watch Super Smash Bros
Yeah MiVOS would speed up your tagging a lot. I also was curious if you saw XMem which just came out. I found that worked really well too.
-
[R] Unicorn: 🦄 : Towards Grand Unification of Object Tracking(Video Demo)
Have you check XMem?
Cream
- [D] Most Popular AI Research July 2022 pt. 2 - Ranked Based On GitHub Stars
- Most Popular AI Research July 2022 pt. 2 - Ranked Based On GitHub Stars
-
[R] Rethinking and Improving Relative Position Encoding for Vision Transformer
Code for https://arxiv.org/abs/2107.14222 found: https://github.com/microsoft/Cream/tree/main/iRPE
What are some alternatives?
SwinIR - SwinIR: Image Restoration Using Swin Transformer (official repository)
yolov7 - Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
flash-attention - Fast and memory-efficient exact attention
deeplab2 - DeepLab2 is a TensorFlow library for deep labeling, aiming to provide a unified and state-of-the-art TensorFlow codebase for dense pixel labeling tasks.
multiface - Hosts the Multiface dataset, which is a multi-view dataset of multiple identities performing a sequence of facial expressions.
AutoML - This is a collection of our NAS and Vision Transformer work. [Moved to: https://github.com/microsoft/Cream]
NAFNet - The state-of-the-art image restoration model without nonlinear activation functions.
NUWA - A unified 3D Transformer Pipeline for visual synthesis
latent-diffusion - High-Resolution Image Synthesis with Latent Diffusion Models