ChatGPT can now see, hear, and speak – openai.com

SurveyJS - Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App

With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.

surveyjs.io

featured

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

awesome-talking-head-generation

2 1,124 6.9

As soon as they release the API, we can build an AI "bartender". Combine the voice output and input with NeRF talking heads such as from Diarupt or https://github.com/harlanhong/awesome-talking-head-generatio....
You will now be able to feed it images and responses of the customers. Give it a function to call complementaryDrink(customerId)

awesome-talking-head-generatio

2 - -

As soon as they release the API, we can build an AI "bartender". Combine the voice output and input with NeRF talking heads such as from Diarupt or https://github.com/harlanhong/awesome-talking-head-generatio....
You will now be able to feed it images and responses of the customers. Give it a function to call complementaryDrink(customerId)

SurveyJS

surveyjs.io featured

Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App. With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.
chatcraft.org

64 121 9.5 TypeScript

Developer-oriented ChatGPT clone

openai chatgpt seems to be stuck in a "Look, cool demo" mode.
1. According to demo, they seem to pair voice input with TTS output. What if I wanna use voice to describe a program I want it to write?
2. Furthermore, if you gonna do a voice assistant, why not go the full way with wake-words and VAD?
3. Not releasing it to everyone is potentially a way to create a hype cycle prior to users discovering that the multimodality is rather meh.
4. The bike demo could actually use visual feedback to see what it's talking about ala segment anything. It's pretty confusing to get a paragraph explanation of what tool to pick.
In my https://chatcraft.org, we added voice incrementally. So i can swap typing and voice. We can also combine it with function-calling, etc. We also use openai apis. Except in our case there is no weird waitlist. You pop in your api key and get access to voice input immediately.

talk

3 555 8.1 TypeScript

Let's make sand talk (by yacineMTB)

Also curious to hear about your setup. Using whisper too? When I was experimenting with it there was still a lot of annoyance about hallucinations and I was hard coding some "if last phrase is 'thanks for watching', ignore last phrase"
I was just googling a bit to see what's out there now for whisper/llama combos and came across this: https://github.com/yacineMTB/talk
There's a demo linked on the github page that seems relatively fast at responding conversationally, but still maybe 1-2 seconds at times. Impressive it's entirely offline.

willow

37 2,365 9.6 C

Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative
whisper-live-transcription

3 85 7.9 Python

Live-Transcription (STT) with Whisper PoC

Here's a link to a project that claims half second latency for the transcription part: https://github.com/gaborvecsei/whisper-live-transcription

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Improve Download Speeds with Concurrency

2 projects | dev.to | 20 Apr 2024
Concluding OSD700

4 projects | dev.to | 20 Apr 2024
ChatCraft v2.0.0

2 projects | dev.to | 20 Apr 2024
ChatCraft week 14: Releasing v2.0!

2 projects | dev.to | 19 Apr 2024
Contributing to Open Source Project ChatCraft

1 project | dev.to | 18 Apr 2024

ChatGPT can now see, hear, and speak – openai.com

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Alexa Chat face-reenactment Deep Learning gpt-4
Post date: 25 Sep 2023

awesome-talking-head-generation

awesome-talking-head-generatio

SurveyJS

chatcraft.org

talk

willow

whisper-live-transcription

Related posts

Improve Download Speeds with Concurrency

Concluding OSD700

ChatCraft v2.0.0

ChatCraft week 14: Releasing v2.0!

Contributing to Open Source Project ChatCraft

ChatGPT can now see, hear, and speak – openai.com

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com Alexa Chat face-reenactment Deep Learning gpt-4 Post date: 25 Sep 2023

awesome-talking-head-generation

awesome-talking-head-generatio

SurveyJS

chatcraft.org

talk

willow

whisper-live-transcription

Related posts

Improve Download Speeds with Concurrency

Concluding OSD700

ChatCraft v2.0.0

ChatCraft week 14: Releasing v2.0!

Contributing to Open Source Project ChatCraft

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Alexa Chat face-reenactment Deep Learning gpt-4
Post date: 25 Sep 2023