sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation. (by google)

Sentencepiece Alternatives

Similar projects and alternatives to sentencepiece

  1. dalle-mini

    DALL·E Mini - Generate images from a text prompt

  2. Stream

    Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.

    Stream logo
  3. stylegan2-pytorch

    Simplest working implementation of Stylegan2, state of the art generative adversarial network, in Pytorch. Enabling everyone to experience disentanglement

  4. llama.cpp

    LLM inference in C/C++

  5. text-generation-webui

    LLM UI with advanced features, easy setup, and multiple backend support.

  6. whisper.cpp

    Port of OpenAI's Whisper model in C/C++

  7. llama

    Inference code for Llama models

  8. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  9. jukebox

    Code for the paper "Jukebox: A Generative Model for Music"

  10. tinygrad

    Discontinued You like pytorch? You like micrograd? You love tinygrad! ❤️ [Moved to: https://github.com/tinygrad/tinygrad] (by geohot)

  11. KoboldAI

    KoboldAI is generative AI software optimized for fictional use, but capable of much more!

  12. tiktoken

    tiktoken is a fast BPE tokeniser for use with OpenAI's models.

  13. glide-text2im

    GLIDE: a diffusion-based text-conditional image synthesis model

  14. aphantasia

    CLIP + FFT/DWT/RGB = text to image/video

  15. tokenmonster

    Ungreedy subword tokenizer and vocabulary trainer for Python, Go & Javascript

  16. CTranslate2

    Fast inference engine for Transformer models

  17. tokenizers

    💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

  18. transformers

    🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

  19. community-events

    8 sentencepiece VS community-events

    Place where folks can contribute to 🤗 community events

  20. bevy_retro

    Plugin pack for making 2D games with Bevy

  21. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better sentencepiece alternative or higher similarity.

sentencepiece discussion

Log in or Post with

sentencepiece reviews and mentions

Posts with mentions or reviews of sentencepiece. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2025-06-30.

Stats

Basic sentencepiece repo stats
23
11,074
4.1
13 days ago

Sponsored
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.
Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
getstream.io

Did you know that C++ is
the 7th most popular programming language
based on number of references?