Go Text processing

Open-source Go projects categorized as Text processing

Top 23 Go Text processing Projects

Text processing
  1. micro-editor

    A modern and intuitive terminal-based text editor

    Project mention: Notepad++ is 21 years old | news.ycombinator.com | 2024-11-01

    Micro editor (https://micro-editor.github.io/) works best for me but it's terminal-based.

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. GoQuery

    A little like that j-thing, only in Go.

  4. sh

    A shell parser, formatter, and interpreter with bash support; includes shfmt (by mvdan)

  5. blackfriday

    Blackfriday: a markdown processor for Go

  6. toml

    TOML parser for Golang with reflection. (by BurntSushi)

  7. go-humanize

    Go Humans! (formatters for units to human friendly sizes)

  8. goldmark

    :trophy: A markdown parser written in Go. Easy to extend, standard(CommonMark) compliant, well structured.

    Project mention: Create a Hugo Theme from Scratch - III | dev.to | 2024-10-15

    Hugo uses goldmark to render markdown, and render hooks allow developers to override the rendering of specific components. Currently supported are:

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. bluemonday

    bluemonday: a fast golang HTML sanitizer (inspired by the OWASP Java HTML Sanitizer) to scrub user generated content of XSS

    Project mention: Don't try to sanitize input. Escape output. (2020) | news.ycombinator.com | 2024-07-13
  11. html-to-markdown

    ⚙️ Convert HTML to Markdown. Even works with entire websites and can be extended through rules.

    Project mention: Show HN: We made an MCP Server so that Cursor can build anything from API Docs | news.ycombinator.com | 2025-03-24

    I'm frequently constructing context based on up-to-date docs using curl + html2markdown[0] and custom css selectors, which is extremely tedious. MCP servers for docs would be very useful for me.

    That said, I don't really expect the AI itself to come up with docs to read (maybe some day). I want it predominantly so I can manually reference it in my prompt (in e.g. the Zed assistant panel) like `/npmdocs packagename packageversion`.

    But even for AI "self-driven" use-cases, I primarily see the value in read-only MCP servers that provide more context, just in an "as-needed" way, instead of me putting it there explicitly.

    [0]: https://github.com/JohannesKaufmann/html-to-markdown

  12. gofeed

    Parse RSS, Atom and JSON feeds in Go

  13. lingua-go

    The most accurate natural language detection library for Go, suitable for short text and mixed-language text

  14. slug

    URL-friendly slugify with multiple languages support.

  15. xurls

    Extract urls from text

  16. frangipanni

    Program to convert lines of text into a tree structure.

  17. omniparser

    omniparser: a native Golang ETL streaming parser and transform library for CSV, JSON, XML, EDI, text, etc. (by jf-tech)

    Project mention: Omniparser: A native Golang ETL streaming parser and transform library | news.ycombinator.com | 2024-10-25
  18. commonregex

    🍫 A collection of common regular expressions for Go (by mingrammer)

  19. htmlquery

    htmlquery is golang XPath package for HTML query.

  20. xpath

    XPath package for Golang, supports HTML, XML, JSON document query.

  21. Dataflow kit

    Extract structured data from web sites. Web sites scraping.

  22. whatlanggo

    Natural language detection library for Go

    Project mention: whatlanggo VS Panlingo - a user suggested alternative | libhunt.com/r/whatlanggo | 2025-02-03
  23. go-runewidth

    wcwidth for golang

  24. mxj

    Decode / encode XML to/from map[string]interface{} (or JSON); extract values with dot-notation paths and wildcards. Replaces x2j and j2x packages.

  25. gographviz

    Parses the Graphviz DOT language in golang

  26. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Go Text processing discussion

Log in or Post with

Go Text processing related posts

Index

What are some of the best open-source Text processing projects in Go? This list will help you:

# Project Stars
1 micro-editor 26,083
2 GoQuery 14,474
3 sh 7,704
4 blackfriday 5,543
5 toml 4,724
6 go-humanize 4,524
7 goldmark 4,009
8 bluemonday 3,368
9 html-to-markdown 2,813
10 gofeed 2,688
11 lingua-go 1,239
12 slug 1,237
13 xurls 1,217
14 frangipanni 1,199
15 omniparser 1,041
16 commonregex 893
17 htmlquery 762
18 xpath 715
19 Dataflow kit 678
20 whatlanggo 653
21 go-runewidth 634
22 mxj 624
23 gographviz 559

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com