Pure Go implementation of OpenAI's tokenizer

This page summarizes the projects mentioned and recommended in the original post on /r/golang

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • tiktoken

    tiktoken is a fast BPE tokeniser for use with OpenAI's models.

  • I've just published a pure go version of OpenAI's tokenizer (tiktoken). If you are building applications in Go that uses any of OpenAI's API's you will quickly run into issues of requests being rejected due to inputs that exceed the max size–which is measured in tokens. This library can help you count tokens and/or split strings before you send any API requests.

  • tokenizer

    Pure Go implementation of OpenAI's tiktoken tokenizer

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • tokenizer-go

    A Go wrapper for GPT-3 token encode/decode. https://platform.openai.com/tokenizer

  • Note: there is another library that does something very similar but it isn't pure go, it uses a npm package and invokes nodejs to encode/decode.

  • tiktoken-go

    go version of tiktoken

  • It is great! I was searching for a such library several days ago. Unfortunately, I have already found another pure go port: https://github.com/pkoukk/tiktoken-go

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts