maxim-pytorch
maxim
Our great sponsors
maxim-pytorch | maxim | |
---|---|---|
1 | 1 | |
159 | 940 | |
- | 3.0% | |
1.6 | 0.0 | |
about 1 year ago | 11 months ago | |
Python | Python | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
maxim-pytorch
-
Need help to reduce vision transformer patch artifacts
Hi there, i'm trying to do image enhancement task. Basically you give it some image and it makes it more colorful, vibrant, changes up some exposure and other things. I'm implementing it with Google Maxim architecture in pytorch. Unfortunately they dont include training code (original one is in JAX) and pyotorch only has the network, not training code.
maxim
-
GOOGLE new computer vision multi-axis approach improves high level tasks, such as object detection, as well as motion deblurring, denoising, deraining
Today we present a new multi-axis approach that is simple and effective, improves on the original ViT and MLP models, can better adapt to high-resolution, dense prediction tasks, and can naturally adapt to different input sizes with high flexibility and low complexity. Based on this approach, we have built two backbone models for high-level and low-level vision tasks. We describe the first in “MaxViT: Multi-Axis Vision Transformer”, to be presented in ECCV 2022, and show it significantly improves the state of the art for high-level tasks, such as image classification, object detection, segmentation, quality assessment, and generation. The second, presented in “MAXIM: Multi-Axis MLP for Image Processing” at CVPR 2022, is based on a UNet-like architecture and achieves competitive performance on low-level imaging tasks including denoising, deblurring, dehazing, deraining, and low-light enhancement. To facilitate further research on efficient Transformer and MLP models, we have open-sourced the code and models for both MaxViT and MAXIM.
What are some alternatives?
Restormer - [CVPR 2022--Oral] Restormer: Efficient Transformer for High-Resolution Image Restoration. SOTA for motion deblurring, image deraining, denoising (Gaussian/real data), and defocus deblurring.
maxvit - [ECCV 2022] Official repository for "MaxViT: Multi-Axis Vision Transformer". SOTA foundation models for classification, detection, segmentation, image quality, and generative modeling...
GIMP-ML - AI for GNU Image Manipulation Program
swin2sr - Swin2SR: SwinV2 Transformer for Compressed Image Super-Resolution and Restoration. Advances in Image Manipulation (AIM) workshop ECCV 2022. Try it out! over 3.3M runs https://replicate.com/mv-lab/swin2sr
SwinIR - SwinIR: Image Restoration Using Swin Transformer (official repository)
friendship-globe
VRT - VRT: A Video Restoration Transformer (official repository)
Windows-terminal-context-menu - 📃 This is a simple script to add right click context menu for your best Windows terminal ❤