GOOGLE new computer vision multi-axis approach improves high level tasks, such as object detection, as well as motion deblurring, denoising, deraining

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

maxvit

1 421 0.0 Jupyter Notebook

[ECCV 2022] Official repository for "MaxViT: Multi-Axis Vision Transformer". SOTA foundation models for classification, detection, segmentation, image quality, and generative modeling...

Today we present a new multi-axis approach that is simple and effective, improves on the original ViT and MLP models, can better adapt to high-resolution, dense prediction tasks, and can naturally adapt to different input sizes with high flexibility and low complexity. Based on this approach, we have built two backbone models for high-level and low-level vision tasks. We describe the first in “MaxViT: Multi-Axis Vision Transformer”, to be presented in ECCV 2022, and show it significantly improves the state of the art for high-level tasks, such as image classification, object detection, segmentation, quality assessment, and generation. The second, presented in “MAXIM: Multi-Axis MLP for Image Processing” at CVPR 2022, is based on a UNet-like architecture and achieves competitive performance on low-level imaging tasks including denoising, deblurring, dehazing, deraining, and low-light enhancement. To facilitate further research on efficient Transformer and MLP models, we have open-sourced the code and models for both MaxViT and MAXIM.

maxim

1 943 0.0 Python

[CVPR 2022 Oral] Official repository for "MAXIM: Multi-Axis MLP for Image Processing". SOTA for denoising, deblurring, deraining, dehazing, and enhancement.

Today we present a new multi-axis approach that is simple and effective, improves on the original ViT and MLP models, can better adapt to high-resolution, dense prediction tasks, and can naturally adapt to different input sizes with high flexibility and low complexity. Based on this approach, we have built two backbone models for high-level and low-level vision tasks. We describe the first in “MaxViT: Multi-Axis Vision Transformer”, to be presented in ECCV 2022, and show it significantly improves the state of the art for high-level tasks, such as image classification, object detection, segmentation, quality assessment, and generation. The second, presented in “MAXIM: Multi-Axis MLP for Image Processing” at CVPR 2022, is based on a UNet-like architecture and achieves competitive performance on low-level imaging tasks including denoising, deblurring, dehazing, deraining, and low-light enhancement. To facilitate further research on efficient Transformer and MLP models, we have open-sourced the code and models for both MaxViT and MAXIM.

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Vector Databases for generative AI applications

1 project | dev.to | 6 May 2024
Fingerpaint – Draw using your laptop's touchpad

1 project | news.ycombinator.com | 6 May 2024
I analyzed code review best practices for a year. This is what I learned.

1 project | dev.to | 6 May 2024
From English Teacher to AWS Architect

1 project | dev.to | 6 May 2024
Pure-PyTorch Implementation of Kolmogorov-Arnold Network (Kan)

1 project | news.ycombinator.com | 6 May 2024

GOOGLE new computer vision multi-axis approach improves high level tasks, such as object detection, as well as motion deblurring, denoising, deraining

This page summarizes the projects mentioned and recommended in the original post on /r/AR_MR_XR Post date: 20 Sep 2022

maxvit

maxim

InfluxDB

Related posts

Vector Databases for generative AI applications

Fingerpaint – Draw using your laptop's touchpad

I analyzed code review best practices for a year. This is what I learned.

From English Teacher to AWS Architect

Pure-PyTorch Implementation of Kolmogorov-Arnold Network (Kan)