Whisper VS beaker

Compare Whisper vs beaker and see what are their differences.

Whisper

High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model (by Const-me)

beaker

An experimental peer-to-peer Web browser (by beakerbrowser)
WorkOS - The modern identity platform for B2B SaaS
The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
workos.com
featured
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
Whisper beaker
32 36
7,182 6,703
- -
6.5 0.0
7 months ago over 1 year ago
C++ JavaScript
Mozilla Public License 2.0 MIT License
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

Whisper

Posts with mentions or reviews of Whisper. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-12-17.
  • Nvidia Speech and Translation AI Models Set Records for Speed and Accuracy
    1 project | news.ycombinator.com | 18 Apr 2024
    I've been using WhisperDesktop ( https://github.com/Const-me/Whisper ) with great success on a 3090 for fast & accurate transcription of often poor quality euro-english hours long multispeaker audio files. If there's an easy way to compare I'm certainly going to give this a try.
  • AMD's CDNA 3 Compute Architecture
    7 projects | news.ycombinator.com | 17 Dec 2023
    Why would you want OpenCL? Pretty sure D3D11 compute shaders gonna be adequate for a Torch backend, and they even work on Linux with Wine: https://github.com/Const-me/Whisper/issues/42 Native Vulkan compute shaders would be even better.

    Why would you want unified address space? At least in my experience, it’s often too slow to be useful. DMA transfers (CopyResource in D3D11, copy command queue in D3D12, transfer queue in VK) are implemented by dedicated hardware inside GPUs, and are way more efficient.

  • Amazon Bedrock Is Now Generally Available
    2 projects | news.ycombinator.com | 28 Sep 2023
    https://github.com/ggerganov/whisper.cpp

    https://github.com/Const-me/Whisper

    I had fun with both of these. They will both do realtime transcription. Bit you will have to download the training data sets…

  • Why Nvidia Keeps Winning: The Rise of an AI Giant
    3 projects | news.ycombinator.com | 6 Jul 2023
    Gamers don’t care about FP64 performance, and it seems nVidia is using that for market segmentation. The FP64 performance for RTX 4090 is 1.142 TFlops, for RTX 3090 Ti 0.524 TFlops. AMD doesn’t do that, FP64 performance is consistently better there, and have been this way for quite a few years. For example, the figure for 3090 Ti (a $2000 card from 2022) is similar to Radeon RX Vega 56, a $400 card from 2017 which can do 0.518 TFlops.

    And another thing: nVidia forbids usage of GeForce cards in data centers, while AMD allows that. I don’t know how specifically they define datacenter, whether it’s enforceable, or whether it’s tested in courts of various jurisdictions. I just don’t want to find out answers to these questions at the legal expenses of my employer. I believe they would prefer to not cut corners like that.

    I think nVidia only beats AMD due to the ecosystem: for GPGPU that’s CUDA (and especially the included first-party libraries like BLAS, FFT, DNN and others), also due to the support in popular libraries like TensorFlow. However, it’s not that hard to ignore the ecosystem, and instead write some compute shaders in HLSL. Here’s a non-trivial open-source project unrelated to CAE, where I managed to do just that with decent results: https://github.com/Const-me/Whisper That software even works on Linux, probably due to Valve’s work on DXVK 2.0 (a compatibility layer which implements D3D11 on top of Vulkan).

  • Ask HN: What is your recommended speech to text/audio transcription tool?
    1 project | news.ycombinator.com | 12 Jun 2023
    Currently, I use a GUI for Whisper AI (https://github.com/Const-me/Whisper) to upload MP3s of interviews to get text transcripts. However, I'm hoping to find another tool that would recognize and split out the text per speaker.

    Does such a thing exist?

  • Da audio a testo, consigli?
    1 project | /r/Universitaly | 8 Jun 2023
  • Ask HN: Any recommendations for cheap, high-quality transcription software
    2 projects | news.ycombinator.com | 29 May 2023
    I just used Whisper over the weekend to transcribe 5 hours of meeting, worked nicely and it can be run on a single GPU locally. https://github.com/ggerganov/whisper.cpp

    There are a few wrappers available with GUI like https://github.com/Const-me/Whisper

  • Voice recognition software for German
    2 projects | /r/software | 20 May 2023
  • Const-me/Whisper: High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model
    1 project | /r/thirdbrain | 15 May 2023
  • I built a massive search engine to find video clips by spoken text
    3 projects | /r/videos | 10 May 2023

beaker

Posts with mentions or reviews of beaker. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-02-29.
  • Can We Get More Decentralised Than the Fediverse?
    2 projects | news.ycombinator.com | 29 Feb 2024
    For me, the peak of decentralization efforts were Beaker Browser [1] and Stealth [2].

    But one project didn't make enough money and the author of the other one got doxxed into oblivion, so I guess we can't have nice things.

    A peer to peer browser has so much potential, I wish somebody else might give it a try.

    [1] https://github.com/beakerbrowser/beaker

    [2] https://github.com/tholian-network/stealth

  • Show HN: DiskerNet – Browse the Internet from Your Disk, Now Open Source
    3 projects | news.ycombinator.com | 16 Jul 2023
    I wanted to mention Beaker Browser, but sadly, it's been archived: https://github.com/beakerbrowser/beaker/blob/master/archive-...
  • The AT protocol is the most obtuse crock of s*
    9 projects | news.ycombinator.com | 9 May 2023
    AT proto has some significant similarities to Matrix:

    * Both are work by self-authenticating git-style replication of Merkle trees/DAGs

    * Both define strict data schemas for extensible sets of events (Matrix uses JSON schema - https://github.com/matrix-org/matrix-spec/tree/main/data/eve... and OpenAPI; AT uses Lexicons)

    * Both use HTTPS for client-server and server-server traffic by default.

    * Both are focused on decentralised composable reputation - e.g. https://matrix.org/blog/2020/10/19/combating-abuse-in-matrix... on the Matrix side, or https://paulfrazee.medium.com/the-anti-parler-principles-for... on the bluesky side, etc.

    * Both are designed as big-world communication networks. You don't have the server balkanisation that affects ActivityPub.

    * Both eschew cryptocurrency systems and incentives.

    There are some significant differences too:

    * Matrix aspires to be the secure communication layer for the open web.

    * AT aspires (i think) to be an open decentralised social networking protocol for the internet.

    * AT has portable identity by default. We've been working on this on Matrix (e.g. MSC1228 - https://github.com/matrix-org/matrix-spec-proposals/pull/122... and MSC2787 - https://github.com/matrix-org/matrix-spec-proposals/blob/nei...) and have a new MSC (and implementation on Dendrite) in progress right now which combines the best bits of MSC1228 & MSC2787 into something concrete, at last. In fact the proto-MSC is due to emerge today.

    * AT is proposing a asymmetrical federation architecture where user data is stored on Personal Data Servers (PDS), but indexing/fan-out/etc is done by Big Graph Servers (BGS). Matrix is symmetrical and by default federates full-mesh between all servers participating in a conversation, which on one hand is arguably better from a self-sovereignty and resilience perspective - but empirically has created headaches where an underpowered server joins some massive public chatroom and then melts. Matrix has improved this by steady optimisation of both protocol and implementation (i.e. adding lazy loading everywhere - e.g. https://matrix-org.github.io/synapse/latest/development/syna...), but formalising an asymmetrical architecture is an interesting different approach :)

    * AT is (today) focused on for public conversations (e.g. prioritising big-world search and indexing etc), whereas Matrix focuses both on private and public communication - whether that's public chatrooms with 100K users over 10K servers, or private encrypted group conversations. For instance, one of Matrix's big novelties is decentralised access control without finality (https://matrix.org/blog/2020/06/16/matrix-decomposition-an-i...) in order to enforce access control for private conversations.

    * Matrix also provides end-to-end encryption for private conversations by default, today via Double Ratchet (Olm/Megolm) and in the nearish future MLS (https://arewemlsyet.com). We're also starting to work on post quantum crypto.

    * Matrix is obviously ~7 years older, and has many more use cases fleshed out - whether that's native VoIP/Video a la Element Call (https://element.io/blog/introducing-native-matrix-voip-with-...) or virtual worlds like Third Room (https://thirdroom.io) or shared whiteboarding (https://github.com/toger5/TheBoard) etc.

    * AT's lexicon approach looks to be a more modular to extend the protocol than Matrix's extensible event schemas - in that AT lexicons include both RPC definitions as well as the schemas for the underlying datatypes, whereas in Matrix the OpenAPI evolves separately to the message schemas.

    * AT uses IPLD; Matrix uses Canonical JSON (for now)

    * Matrix is perhaps more sophisticated on auth, in that we're switching to OpenID Connect for all authentication (and so get things like passkeys and MFA for free): https://areweoidcyet.com

    * Matrix has an open governance model with >50% of spec proposals coming from the wider community these days: https://spec.matrix.org/proposals

    * AT has done a much better job of getting mainstream uptake so far, perhaps thanks to building a flagship app from day one (before even finishing or opening up the protocol) - whereas Element coming relatively late to the picture has meant that Element development has been constantly slowed by dealing with existing protocol considerations (and even then we've had constant complaints about Element being too influential in driving Matrix development).

    * AT backs up all your personal data on your client (space allowing), to aid portability, whereas Matrix is typically thin-client.

    * Architecturally, Matrix is increasingly experimenting with a hybrid P2P model (https://arewep2pyet.com) as our long-term solution - which effectively would end up with all your data being synced to your client. I'd assume bluesky is consciously avoiding P2P having been overextended on previous adventures with DAT/hypercore: https://github.com/beakerbrowser/beaker/blob/master/archive-.... Whereas we're playing the long game to slowly converge on P2P, even if that means building our own overlay networks etc: https://github.com/matrix-org/pinecone

    I'm sure there are a bunch of other differences, but these are the ones which pop to the top of my head, plus I'm far from an expert in AT protocol.

    It's worth noting that in the early days of bluesky, the Matrix team built out Cerulean (https://matrix.org/blog/2020/12/18/introducing-cerulean) as a demonstration to the bluesky team of how you could build big-world microblogging on top of Matrix, and that Matrix is not just for chat. We demoed it to Jack and Parag, but they opted to fund something entirely new in the form of AT proto. I'm guessing that the factors that went into this were: a) wanting to be able to optimise the architecture purely for social networking (although it's ironic that ATproto has ended up pretty generic too, similar to Matrix), b) wanting to be able to control the strategy and not have to follow Matrix's open governance model, c) wanting to create something new :)

    From the Matrix side; we keep in touch with the bluesky team and wish them the best, and it's super depressing to see folks from ActivityPub and Nostr throwing their toys in this manner. It reminds me of the unpleasant behaviour we see from certain XMPP folks who resent the existence of Matrix (e.g. https://news.ycombinator.com/item?id=35874291). The reality is that the 'enemy' here, if anyone, are the centralised communication/social platforms - not other decentralisation projects. And even the centralised platforms have the option of seeing the light and becoming decentralised one day if we play our parts well.

    What would be really cool, from my perspective, would be if Matrix ended up being able to help out with the private communication use cases for AT proto - as we obviously have a tonne of prior art now for efficient & audited E2EE private comms and decentralised access control. Moreover, I /think/ the lexicon approach in AT proto could let Matrix itself be expressed as an AT proto lexicon - providing interop with existing Matrix rooms (at least semantically), and supporting existing Matrix clients/SDKs, while using AT proto's ID model and storing data in PDSes etc. Coincidentally, this matches work we've been doing on the Matrix side as part of the MIMI IETF working group to figure out how to layer Matrix on top of other existing protocols: e.g. https://datatracker.ietf.org/doc/draft-ralston-mimi-matrix-t... and https://datatracker.ietf.org/doc/draft-ralston-mimi-matrix-m... - and if I had infinite time right now I'd certainly be trying to map Matrix's CS & SS APIs onto an AT proto lexicon to see what it looks like.

    TL;DR: I think AT proto is cool, and I wish that open projects saw each other as fellow travellers rather than competitors.

  • Ask HN: Those making $0/month or less on side projects – Show and tell
    95 projects | news.ycombinator.com | 27 Jan 2023
    it sounds a lot like you're reinventing what Beaker Browser had built on top of DAT, except that it could do more. For example, they made a distributed Twitter clone as a proof of concept, but folks actually started using it. Definitely included blogging stuff.

    Really cool stuff around taking sites and things other folks had built and using them as a basis for your new thing.

    https://github.com/beakerbrowser/beaker/

  • Secure Scuttlebutt
    5 projects | news.ycombinator.com | 23 Jan 2023
    As a long time patchwork user —April 2017 for the win…— that just recently quit, I could see how the multitude of half finished clients, deprecated functionality would get to that outcome.

    SSB is dead, other than the few trying to make a go financially at it, via either crowdfunding, NLnet grants, or VC.

    I've reverted to Web 1.0 blogging, with none of the bs that is consistent with using a archived client, focus on trying to fit a database into a mobile app — without regard to front end functionality.

    > When I look at Beaker, I think it was probably 50% easy. The initial demo took 2 weeks: 20%. It was a full website editor in about 2 months: 30%. The feedback was great: 50%. The users didn't stick: 50%. We got invited to talks which increased exposure: 51%. A few niche communities took an interest: 53%. Folks liked it enough to donate via OpenCollective and Patreon: 54%. You get the idea. Notably absent is "usage and retention went through the roof: 80%" and then "usage continued to grow for years: 100%."

    Everything that pfrazee wrote here about Beaker Browser at https://github.com/beakerbrowser/beaker/blob/master/archive-... is true for ssb.

  • Beaker Browser is now archived
    1 project | /r/hypeurls | 27 Dec 2022
    5 projects | news.ycombinator.com | 27 Dec 2022
    I'm sad to see this go, a remnant of another web which could have been. I actually spent a lot of time playing with Beaker and hacking it up for my own purposes.

    We actually had a discussion a few years ago where I made a suggestion about change to the default behavior. At the time, you made a perfectly valid response and declined my suggestion, but I'm curious if your thinking is the same today, given how things played out: https://github.com/beakerbrowser/beaker/issues/1444

  • Digital Commons
    6 projects | /r/solarpunk | 21 Aug 2022
    Beaker, Hybercore
  • Ask HN: What relatively new project/movement are you excited about?
    4 projects | news.ycombinator.com | 4 Aug 2022
    Disclosure: It's in Romanian, no cookies, no JS, no trackers

    Beaker Browser https://beakerbrowser.com/ seems dead, loved the concept but it's no longer updated

    Now that you've asked, nope, didn't found anything with a clear future on the "Web3" side of the internet. Vast majority make use of crypto/blockchain and IMHO blockchain is anything but not decentralization.

  • Triple Entry Blogging
    2 projects | news.ycombinator.com | 4 May 2022

What are some alternatives?

When comparing Whisper and beaker you can also consider the following projects:

whisper.cpp - Port of OpenAI's Whisper model in C/C++

ipfs - Peer-to-peer hypermedia protocol

whisper - Robust Speech Recognition via Large-Scale Weak Supervision

ufonet - UFONet - Denial of Service Toolkit

TransformerEngine - A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

pglet - Pglet - build internal web apps quickly in the language you already know!

just-an-email - App to share files & texts between your devices without installing anything

ZeroNet - ZeroNet - Decentralized websites using Bitcoin crypto and BitTorrent network

ggml - Tensor library for machine learning

pjproject - PJSIP project

cookwherever - Cook Wherever is an open source project to attempt to making cooking more accessible and engaging for everyone.

agregore-browser - A minimal browser for the distributed web (Desktop version)