AudioGPT
AudioGPT | CML_AMP_LLM_Chatbot_Augmented_with_Enterprise_Data | |
---|---|---|
4 | 5 | |
9,796 | 45 | |
0.8% | - | |
3.7 | 5.2 | |
about 2 months ago | 23 days ago | |
Python | Python | |
GNU General Public License v3.0 or later | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
AudioGPT
- FLiPN-FLaNK Stack Weekly May 8 2023
-
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
Large language models (LLMs) have exhibited remarkable capabilities across a variety of domains and tasks, challenging our understanding of learning and cognition. Despite the recent success, current LLMs are not capable of processing complex audio information or conducting spoken conversations (like Siri or Alexa). In this work, we propose a multi-modal AI system named AudioGPT, which complements LLMs (i.e., ChatGPT) with 1) foundation models to process complex audio information and solve numerous understanding and generation tasks; and 2) the input/output interface (ASR, TTS) to support spoken dialogue. With an increasing demand to evaluate multi-modal LLMs of human intention understanding and cooperation with foundation models, we outline the principles and processes and test AudioGPT in terms of consistency, capability, and robustness. Experimental results demonstrate the capabilities of AudioGPT in solving AI tasks with speech, music, sound, and talking head understanding and generation in multi-round dialogues, which empower humans to create rich and diverse audio content with unprecedented ease. Our system is publicly available at \url{https://github.com/AIGC-Audio/AudioGPT}.
CML_AMP_LLM_Chatbot_Augmented_with_Enterprise_Data
What are some alternatives?
AudioLDM - AudioLDM: Generate speech, sound effects, music and beyond, with text.
graphic-walker - An open source alternative to Tableau. Embeddable visual analytic
highstorm - Open Source Event Monitoring
FLiPStackWeekly - FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...
thinkgpt - Agent techniques to augment your LLM and push it beyong its limits
123elf - A native port of Lotus 1-2-3 to Linux.
Discord-Chatbot-Gpt4Free - This is a Discord Chatbot with image detection, OCR, internet access and DALL-E image generation for free [Moved to: https://github.com/mishalhossin/Discord-AI-Chatbot]
vscode-openai-code-analyzer - Analyze code with OpenAI
sitemap2feed - Convert an online sitemap to Atom, RSS and JSON feeds
pranadb