Otter
Sophia
Otter | Sophia | |
---|---|---|
4 | 3 | |
3,447 | 361 | |
- | - | |
9.1 | 7.0 | |
about 2 months ago | 6 months ago | |
Python | Python | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Otter
-
OpenAI vs Google, Detect ChatGPT Content with 99% accuracy, Navigating AI compute costs
👀 Video-LLaMA - Empower large language models with video and audio understanding capability. (link) 🦦 Otter - Multi-modal model with improved instruction-following and in-context learning ability. 🔗 Linkly.AI - AI-powered lead analytics and management platform that helps you track, analyze, and streamline your leads in one place. 🎬 Jet Cut Ready - AI plugin for Adobe Premiere Pro that automatically removes silent parts in videos. (link) 💬 HeyGen's ChatGPT Plugin - Convert text into high-quality videos using AI text and video generation.
- Multimodal models and "active" learning
- Otter: A Multi-Modal Model with In-Context Instruction Tuning
-
Otter is a multi-modal model developed on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on a dataset of multi-modal instruction-response pairs. Otter demonstrates remarkable proficiency in multi-modal perception, reasoning, and in-context learning.
GitHub repo includes HuggingFace links to the model: https://github.com/Luodian/Otter
Sophia
- [D] Potential scammer on github stealing work of other ML researchers?
-
[R] Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training
Github repo
-
The Sophia optimizer, a faster alternative to AdamW
Code: https://github.com/kyegomez/Sophia
Looking forward to trying it out this week
What are some alternatives?
LLaMA-Adapter - [ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
Adan-pytorch - Implementation of the Adan (ADAptive Nesterov momentum algorithm) Optimizer in Pytorch
NExT-GPT - Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
Adan - Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Video-LLaMA - [EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
LOGICGUIDE - Plug in and Play implementation of "Certified Reasoning with Language Models" that elevates model reasoning by 40%
Awesome-Multimodal-Large-Language-Models - :sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
tree-of-thoughts - Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%
LinkedInGPT - Skynet
Sophia - The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”
squeezelite-esp32 - ESP32 Music streaming based on Squeezelite, with support for multi-room sync, AirPlay, Bluetooth, Hardware buttons, display and more
deep-daze - Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network). Technique was originally created by https://twitter.com/advadnoun