-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
TensorFlow Models (GitHub: https://github.com/tensorflow/models) is a collection of diverse TensorFlow-based ML and DL models for tasks like image classification, object detection, and text classification.
-
pytorch-image-models
Discontinued PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more [Moved to: https://github.com/huggingface/pytorch-image-models] (by rwightman)
PyTorch Image Models (timm) (GitHub: https://github.com/rwightman/pytorch-image-models) is a library of deep learning models and utilities in PyTorch, including popular models like ResNet and EfficientNet.
-
YOLOv5 (GitHub: https://github.com/ultralytics/yolov5) is a fast, accurate object detection model with code for training, testing, deployment, and pre-trained weights.
-
detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Detectron2 (GitHub: https://github.com/facebookresearch/detectron2) is a Facebook AI Research library with state-of-the-art object detection and segmentation algorithms in PyTorch.
-
OpenCV (GitHub: https://github.com/opencv/opencv) is a widely-used computer vision library with real-time image/video processing, feature extraction, and ML algorithms, supporting multiple languages.
-
Neural Style Transfer (GitHub: https://github.com/leongatys/PytorchNeuralStyleTransfer) is a PyTorch implementation for applying the style of one image to another while preserving content.
-
CycleGAN
Software that can generate photos from paintings, turn horses into zebras, perform style transfer, and more.
CycleGAN (GitHub: https://github.com/junyanz/CycleGAN) is a deep learning-based image-to-image translation approach without paired examples, implemented in PyTorch.
-
Hugging Face Transformers (GitHub: https://github.com/huggingface/transformers) is a library of advanced NLP models, also supporting multimodal tasks like image captioning and visual question answering.