prismer
Qwen-VL
prismer | Qwen-VL | |
---|---|---|
5 | 4 | |
1,285 | 4,033 | |
-0.2% | 9.1% | |
5.2 | 8.5 | |
5 months ago | 10 days ago | |
Python | Python | |
GNU General Public License v3.0 or later | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
prismer
-
[D] Tracking Dancing People
tracking with An Ensemble of Experts similar to this https://github.com/NVlabs/Prismer
-
Meet Prismer: An Open Source Vision-Language Model with An Ensemble of Experts
Quick Read: https://www.marktechpost.com/2023/03/11/meet-prismer-an-open-source-vision-language-model-with-an-ensemble-of-experts/ Paper: https://arxiv.org/pdf/2303.02506.pdf Code: https://github.com/nvlabs/prismer
- Prismer: A Vision-Language Model with Multi-Modal Experts
-
[R] Prismer: An Open Source Vision-Language Model with An Ensemble of Experts.
Code and Models - https://github.com/NVlabs/prismer
Qwen-VL
What are some alternatives?
Oscar - Oscar and VinVL
CogVLM - a state-of-the-art-level open visual language model | 多模态预训练模型
InternGPT - InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)
FaceFusion - Next generation face swapper and enhancer
CLIP-Caption-Reward - PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)
Chinese-LLaMA-Alpaca - 中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
VehicleFinder-CTIM
chatgpt_academic - 为GPT/GLM提供图形交互界面,特别优化论文阅读润色体验,模块化设计支持自定义快捷按钮&函数插件,支持代码块表格显示,Tex公式双显示,新增Python和C++项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持清华chatglm等本地模型 [Moved to: https://github.com/binary-husky/gpt_academic]
ComfyUI - The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.
opencopilot - 🕊️ Build and embed open-source AI Copilots into your product with ease
FLaNK-OpenAi - Chat
VPGTrans - Codes for VPGTrans: Transfer Visual Prompt Generator across LLMs. VL-LLaMA, VL-Vicuna.