Our great sponsors
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
Related to the actual OpenAI announcement, I've been able to generate some preliminary code editing evaluations of the new GPT models. OpenAI is enforcing very low rate limits on the new GPT-4 model. I will update the results as quickly my rate limit allows.
https://news.ycombinator.com/item?id=38172621
Also, aider now supports these new models, including `gpt-4-1106-preview` with the massive 128k context window.
https://github.com/paul-gauthier/aider/releases/tag/v0.17.0
The documentation/READMEs in the GitHub repo was updated to play nice with the new v1.0.0 of the package: https://github.com/openai/openai-python/
Open researchers are trying to shrink and speed up 138K models e.g. YaRN https://github.com/jquesnelle/yarn
It's very compelling and opens up a lot of use cases, so I've been keeping an eye out for advancements.
>How do you detect speech starting and stopping?
https://github.com/snakers4/silero-vad
Related posts
- Whisper - A new free AI model from OpenAI that can transcribe Japanese (and many other languages) at up to "human level" accuracy
- [P] A more detailed post about Silero VAD on The Gradient
- Silero VAD: pre-trained enterprise-grade voice activity detector
- [P] Silero VAD: One voice detector to rule them all
- [Discussion] Video Translation Task