Go Text processing

Open-source Go projects categorized as Text processing

Top 23 Go Text processing Projects

  • GitHub repo micro-editor

    A modern and intuitive terminal-based text editor

    Project mention: I use the REAL text editor | reddit.com/r/linuxmemes | 2021-02-28

    I like micro

  • GitHub repo GoQuery

    A little like that j-thing, only in Go.

    Project mention: Travel Advisory Thursday | reddit.com/r/belgium | 2021-01-21

    Depends on your programming language of choice. In Go I like this one: https://github.com/PuerkitoBio/goquery

  • GitHub repo blackfriday

    Blackfriday: a markdown processor for Go

    Project mention: My stack will outlive yours | reddit.com/r/programming | 2021-01-07

    Not being able to re-use html templates was a major problem for me (Web components could solve this when they get rid of the need for JS to use them, which I think will soon happen), and I also needed easy source code highlighting as I mostly write about code. So I wrote a Go generator that did just what I needed and now write my blog posts mostly in markdown, with support for code highlighting thanks to Blackfriday and bfchroma... both of which are simple Go libraries which I "vendor" (copy the source into my own project, so to speak) so if they stop maintaining them, it doesn't affect me much or at all.

  • GitHub repo sh

    A shell parser, formatter, and interpreter with bash support; includes shfmt (by mvdan)

  • GitHub repo toml

    TOML parser for Golang with reflection. (by BurntSushi)

  • GitHub repo go-humanize

    Go Humans! (formatters for units to human friendly sizes)

  • GitHub repo bluemonday

    bluemonday: a fast golang HTML sanitizer (inspired by the OWASP Java HTML Sanitizer) to scrub user generated content of XSS

  • GitHub repo gofeed

    Parse RSS, Atom and JSON feeds in Go

  • GitHub repo xurls

    Extract urls from text

  • GitHub repo commonregex

    🍫 A collection of common regular expressions for Go (by mingrammer)

  • GitHub repo slug

    URL-friendly slugify with multiple languages support.

  • GitHub repo whatlanggo

    Natural language detection library for Go

  • GitHub repo Dataflow kit

    Extract structured data from web sites. Web sites scraping.

  • GitHub repo mxj

    Decode / encode XML to/from map[string]interface{} (or JSON); extract values with dot-notation paths and wildcards. Replaces x2j and j2x packages.

  • GitHub repo Koazee

    A StreamLike, Immutable, Lazy Loading and smart Golang Library to deal with slices.

  • GitHub repo gographviz

    Parses the Graphviz DOT language in golang

  • GitHub repo xpath

    XPath package for Golang, supports HTML, XML, JSON document query.

  • GitHub repo go-runewidth

    wcwidth for golang

  • GitHub repo htmlquery

    htmlquery is golang XPath package for HTML query.

    Project mention: XPath package for HTML Query, No third-party library dependencies | reddit.com/r/golang | 2020-12-30
  • GitHub repo gotext

    Go (Golang) GNU gettext utilities package

  • GitHub repo gotabulate

    Gotabulate - Easily pretty-print your tabular data with Go

  • GitHub repo go-edlib

    Golang string comparison and edit distance algorithms library, featuring : Levenshtein, LCS, Hamming, Damerau levenshtein (OSA and Adjacent transpositions algorithms), Jaro-Winkler, Cosine, etc...

  • GitHub repo goq

    A declarative struct-tag-based HTML unmarshaling or scraping package for Go built on top of the goquery library

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2021-02-28.


What are some of the best open-source Text processing projects in Go? This list will help you:

Project Stars
1 micro-editor 16,308
2 GoQuery 9,857
3 blackfriday 4,660
4 sh 3,513
5 toml 3,425
6 go-humanize 2,529
7 bluemonday 1,779
8 gofeed 1,561
9 xurls 760
10 commonregex 726
11 slug 628
12 whatlanggo 477
13 Dataflow kit 454
14 mxj 453
15 Koazee 444
16 gographviz 414
17 xpath 370
18 go-runewidth 338
19 htmlquery 332
20 gotext 292
21 gotabulate 259
22 go-edlib 250
23 goq 190