chatgpt-raycast
Wav2Lip
Our great sponsors
chatgpt-raycast | Wav2Lip | |
---|---|---|
265 | 34 | |
204 | 9,257 | |
- | - | |
10.0 | 4.8 | |
over 1 year ago | 8 days ago | |
TypeScript | Python | |
MIT License | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
chatgpt-raycast
-
ChatGTP tools you may need - Work always in progress)
chatgpt-raycast: ChatGPT raycast extension
- I remember that season slightly differently.
-
ChatGPT just wrote me a song.
I am not musical but I asked ChatGPT to write me a pop song about a beautiful girl named Diane (my sister's name) just for fun.
- AI Chat is a pretty cool tool for DMs to get inspirational ideas.
-
Can't register: "The email you provided is not supported."
I've tried to register (https://chat.openai.com/) and I get the error message: "Oops! The email you provided is not supported. Please contact us through our help center if this issue persists."
-
Why is Hilo airport “ITO”? Nobody knows.
Check out ChatGPT if you have a chance while it's still free. Unlike Siri or Alexa the conversation is much more human like and you can ask it complex questions. Here's a decent article on it.
-
[Release] Media Hoarder v1.1.0 - AI movie recommendations powered by ChatGPT
ChatGPT, OpenAI's artificial intelligence chatbot actually knows one or two things about movies. It can confidently provide answers to queries like:
-
Show HN: Media Hoarder X ChatGPT
- "Provide a list of action movies where the protagonist is female and wields a shotgun and their IMDB IDs"
and ChatGPT's answers are quite spot on!
The next task was: How to integrate ChatGPT into Media Hoarder?
Media Hoarder runs on Electron which allows to fully control a browser window. So you can:
- open up a browser window and launch https://chat.openai.com
- Alguien que sepa de unity ?
-
Discovered ChatGPT3 two deys ago, cannot stop asking questions
Create an account here: http://chat.openai.com
Wav2Lip
-
Show HN: Sync (YC W22) – an API for fast and affordable lip-sync at scale
Hey HN, we’re sync. (https://synclabs.so/). We’re building fast + lightweight audio-visual models to create, modify, and understand humans in video.
You can check our more about us and our company in this video here: https://bit.ly/3TV27rd
Our first api lets you lip-sync a person in a video to an audio in any language in zero-shot. You can check out some examples here (https://bit.ly/3IT3UXk)
Here’s a demo showing how it works and how to sync your first video / audio: https://bit.ly/4ablRwo
Our playground + api is live, you can play with our models here: https://app.synclabs.so/
Four years ago we open-sourced Wav2lip (https://github.com/Rudrabha/Wav2Lip), the first model to lipsync anyone to any audio w/o having to train for each speaker. Even now, it’s the most prolific lipsyncing model to date (almost 9k GitHub stars).
Human lip-sync enables interesting features for many products – you can use it to seamlessly translate videos from one language to another, create personalized ads / video messages to send to your customers, or clone yourself so you never have to record a piece of content again.
We’re excited about this area of research / the models we’re building because they can be impactful in many ways:
[1] we can dissolve language as a barrier
check out how we used it to dub the entire 2-hour Tucker Carlson interview with Putin speaking fluent English: https://vimeo.com/914605299
imagine millions gaining access to knowledge, entertainment, and connection — regardless of their native tongue.
realtime at the edge takes us further — live multilingual broadcasts + video calls, even walking around Tokyo w/ a Vision Pro 2 speaking English while everyone else Japanese.
[2] we can move the human-computer interface beyond text-based-chat
keyboard / mice are lossy + low bandwidth. human communication is rich and goes beyond just the words we say. what if we could compute w/ a face-to-face interaction?
Many people get carried away w/ the fact LLMs can generate, but forget they can also read. The same is true for these audio/visual models — generation unlocks a portion of the use-cases, but understanding humans from video unlocks huge potential.
Embedding context around expressions + body language in inputs / outputs would help us interact w/ computers in a more human way.
[3] and more
powerful models small enough to run at the edge could unlock a lot:
eg.
-
Ideas to recreate audio
If your technically inclined you can use https://github.com/Rudrabha/Wav2Lip to sync the lip movements to the new audio.
-
How to make deep fake lip sync using Wav2Lip
This is the Github link : https://github.com/Rudrabha/Wav2Lip
-
Dark Brandon going hard
Video mapping onto Audio: Now you have Audio with coherent back and forth dialogue. To get the looped video puppets, you find a relatively stable interview clip (in this channel and many of Athenes other ones, the clips of the people just stay in one place). Then feed the audio + video clip into a lipsync algorithm like this https://bhaasha.iiit.ac.in/lipsync/
- Is it possible to sync a lip and facial expression animation with audio in real time?
-
A little bedtime story by the AI nanny | Stable Diffusion + GPT = a match made in latent space
It's not animating really, just lip sync and face restoration, here I used: https://github.com/Rudrabha/Wav2Lip and https://github.com/TencentARC/GFPGAN respectively.
-
Elevenlabs voice clone and janky avatarify with wav2lip added.
I just used the web based wav2lip demo. https://bhaasha.iiit.ac.in/lipsync/ Haven’t used the plan in a while, however the colab gives much better results. This was just a quick dusty example done all in the phone.
- retromash - The Tide is High / Thinking Out Loud (Blondie, Ed Sheeran)
-
Who knows how to create long-form & cheap AI avatar content? The three main platforms (Synthesia, Movio, & D-ID) all charge over $20 a month for ~ 15 minutes of content, but this TikTok user streamed for 90 hours… how did he pull that off?
https://github.com/Rudrabha/Wav2Lip Demo: https://youtu.be/0fXaDCZNOJc
- Video editing with AI
What are some alternatives?
ChatGPT - 🔮 ChatGPT Desktop Application (Mac, Windows and Linux)
stylegan2 - StyleGAN2 - Official TensorFlow Implementation
chatgpt-google-extension - A browser extension that enhance search engines with ChatGPT
Thin-Plate-Spline-Motion-Model - [CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.
chatgpt-api - Node.js client for the official ChatGPT API. 🔥
first-order-model - This repository contains the source code for the paper First Order Motion Model for Image Animation
ChatGPT - Lightweight package for interacting with ChatGPT's API by OpenAI. Uses reverse engineered official API.
DeepFaceLive - Real-time face swap for PC streaming or video calls
ChatGPT.nvim - ChatGPT Neovim Plugin: Effortless Natural Language Generation with OpenAI's ChatGPT API
GFPGAN - GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
wundergraph - WunderGraph is a Backend for Frontend Framework to optimize frontend, fullstack and backend developer workflows through API Composition.
Real-Time-Voice-Cloning - Clone a voice in 5 seconds to generate arbitrary speech in real-time