Embedding cost and model choice makes this a very compelling choice. I'm working on leveraging embeddings in https://github.com/discourse/discourse-ai where it powers offering related topics, semantic search, tag and category recommendations among other things.
A cheap offering like this can make it a lot more reasonable for self-hosters.
Cross-Platform, GPU Accelerated Whisper 🏎️
Whisper large is only 1.5B params, why not run it client side with something like https://github.com/FL33TW00D/whisper-turbo
(Disclaimer: I am the author)
Collect and Analyze Billions of Data Points in Real Time. Manage all types of time series data in a single, purpose-built database. Run at any scale in any environment in the cloud, on-premises, or at the edge.
This is the home of the Beam CLI binaries, as well as a collection of example apps built with Beam
Serverless only works if the cold boot is fast. For context, my company runs a serverless cloud GPU product called https://beam.cloud, which we've optimized for fast cold start. We see Whisper in production cold start in under 10s (across model sizes). A lot of our users are running semi-real time STT, and this seems to be working well for them.
ChatGPT refuses to create a poem admiring Donald Trump but creates a poem and admires Joe Biden. ChatGPT is built in with political biases.
1 project | /r/walkaway | 31 Jan 2023
MLflow VS VevestaX - a user suggested alternative
2 projects | 12 May 2022
Here's a repository where I try to keep up with the most interesting research papers of 2022. It is a curated list of the latest breakthroughs in AI and Data Science by release date with a clear video explanation, link to a more in-depth article, and code (if applicable).
2 projects | /r/artificial | 9 May 2022
[P] Rubrix: Open-source Python framework for NLP data annotation, exploration, and monitoring
2 projects | /r/MachineLearning | 13 Sep 2021
“AI” Demystified: A Decompiler
1 project | news.ycombinator.com | 3 Sep 2021