dograh
strawberry
| dograh | strawberry | |
|---|---|---|
| 7 | 2 | |
| 4,249 | 0 | |
| 90.4% | - | |
| 9.8 | - | |
| 5 days ago | about 1 month ago | |
| Python | Python | |
| BSD 2-clause "Simplified" License | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
dograh
-
OpenAI delivers low-latency voice AI at scale
If you like Pipecatโs focus on speed, you might also try out our open source, which comes with all the batteries included (knowledge base, telephony/SIP, variables, BYOK any LLM STT TTS, Speech to Speech, etc )
And it's fully OSS- like n8n for voice AI, and you can use it with OpenClaw or Claude code - recently launched MCPs.Github- https://github.com/dograh-hq/dograh, Youtube -https://www.youtube.com/watch?v=sxiSp4JXqws&list=PLDqzGuN7B1...
-
4 open-source tools to build production-ready AI voice agents ๐๏ธ๐
I've built voice agents before, but when it came to shipping them for production, I couldn't find a platform that worked quickly in 2 minutes - until we started building Dograh. It's an open-source voice AI platform with a visual workflow builder, built-in telephony, and post-call analytics out of the box. Alternative to Vapi, Retell, and Bland, but self-hostable and BSD-2 licensed. You get a canvas where you connect nodes instead of writing Python, so prompt tweaks don't mean a redeploy. Voicemail detection, call transfer, variable extraction, knowledge base, and CRM connectors all come standard. Same feature set whether you self-host or use the managed cloud. It has native support for BYOK (bring your own key) across every layer. Deepgram or Whisper for STT, ElevenLabs or Kokoro for TTS, and any LLM for the brain. Want to run everything locally? Swap in self-hosted models through the UI, no code required. Check it. https://docs.dograh.com/getting-started Youtube link: https://www.youtube.com/watch?v=sxiSp4JXqws Star the Dograh repo โญ โ https://github.com/dograh-hq/dograh
- Show HN: Dograh โ voice agents that pick Recordings over TTS using LLM
-
We analyzed 10,000 voice AI calls. The LLM was rarely the problem.
We built Dograh OSS, an open-source voice AI platform. When we started, we assumed most failures would come from the LLM - bad answers, missed intent, prompt edge cases. So we spent a lot of early effort there.
- Show HN: We open sourced Vapi โ UI included
- Show HN: Dograh โ an OSS Vapi alternative to quickly build and test voice agents
- Is there open source alternative for VAPI or retellai?
strawberry
-
OpenAI delivers low-latency voice AI at scale
The whole setup works on my M2 MacBook Pro with 16 GB RAM. I use Gemma 4B via LiteRT-LM.
I've found that LiteRT-LM has a much lower DRAM footprint than Ollama. I've also made tons of optimizations in the code - for eg, you can do quite a bit with a 16k context window for a voice assistant while managing a good footprint, so I keep track of the token usage and then perform an auto-compaction after a while. I use sub-agents and only do deep-think calls with them, so the context window is separated out. In a multi-turn conversation, if Gemma 4 directly processes audio input, the KV cache fills up within a few turns, so I channel it all via Whisper.
I did not want to use openWakeWord or Picovoice because they had limitations on which wake word you could choose. Alternative was to train a model of my own. So I created my own wake word detection pipeline using Whisper Tiny - works surprisingly well: https://github.com/pncnmnp/strawberry/blob/main/main.py#L143...
Also, I have VAD going with smart turn v3 (like I mentioned above) + I use browser/websocket for AEC + Barge-in (https://github.com/pncnmnp/strawberry/blob/main/audio_ws.py).
I'm using the MacBook's built-in microphones for this, though, and I haven't fully tested it with other microphones. I've been ironing out the rough edges on a daily basis. I should write a quick blog on this too.
What are some alternatives?
pronghorn - Fast, low-latency voice assistant protocol. Wire-level UDP streaming replacement for Wyoming.
pipecat - Open Source framework for voice and multimodal conversational AI
webrtc-zero-downtime-restart - A playground to make WebRTC easier to deploy, safer and more robust