Pix2Text

An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported. (by breezedeus)

Pix2Text Alternatives

Similar projects and alternatives to Pix2Text

  1. tesseract-ocr

    Tesseract Open Source OCR Engine (main repository)

  2. Nutrient

    Nutrient - The #1 PDF SDK Library. Bad PDFs = bad UX. Slow load times, broken annotations, clunky UX frustrates users. Nutrient’s PDF SDKs gives seamless document experiences, fast rendering, annotations, real-time collaboration, 100+ features. Used by 10K+ devs, serving ~half a billion users worldwide. Explore the SDK for free.

    Nutrient logo
  3. LaTeX-OCR

    22 Pix2Text VS LaTeX-OCR

    pix2tex: Using a ViT to convert images of equations into LaTeX code.

  4. LaTeX-OCR

    pix2tex: Using a ViT to convert images of equations into LaTeX code. (by katie-lim)

  5. spark

    Discontinued Arknights OCR tool to automatically create a detailed list of your operators. (by Meph1sto666)

  6. CnOCR

    1 Pix2Text VS CnOCR

    CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scenarios and can be used directly after installation. 【基于 PyTorch/MXNet 的中文/英文 OCR Python 包。】

  7. Cloe

    4 Pix2Text VS Cloe

    Manga OCR snipping application for desktop

  8. deathcounter_ocr

    A python script which detects death messages by using OCR and displays a corrosponding deathcounter. Preconfigured for Elden Ring

  9. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
  10. Calliar

    1 Pix2Text VS Calliar

    A dataset for online Arabic calligraphy. A collection of 2500 annotated calligraphic styles.

  11. PyMuPDF-Utilities

    1 Pix2Text VS PyMuPDF-Utilities

    Demos, examples and utilities using PyMuPDF

  12. PaddleOCR

    69 Pix2Text VS PaddleOCR

    Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better Pix2Text alternative or higher similarity.

Pix2Text discussion

Log in or Post with

Pix2Text reviews and mentions

Posts with mentions or reviews of Pix2Text. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-04-16.
  • How do I solve this?
    1 project | /r/LaTeX | 11 Jun 2023
    Use this: https://p2t.behye.com/
  • Github packages/Apps that are must have for Physicists using Linux
    3 projects | /r/AskPhysics | 16 Apr 2023
    I have recently discovered a few very helpful github packages which help me make notes while listening to lectures. These would be 1. pix2tex (allows you to scan an equation and convert it to latex) 2. pix2text (allows you to scan an equation with words in it and converts it to latex and text) 3. Tesseract (not really a physics related package, but it does allow me to copy notes from transcripts easily) 4. Mathpix an app that performs all the above mentioned operations better than the packages above, but one which ain't free.
  • Help with Project Pix2Text
    1 project | /r/LaTeX | 13 Mar 2023
    BTW, you can use this online webpage https://p2t.behye.com/ , which is powered by Pix2Text.
  • How to use the graphical interface of LatexOCR? How to use the Snipping tool?
    3 projects | /r/LaTeX | 4 Mar 2023
    Pix2Text itself trains a mathematical formula detection model to detect mathematical formulas contained in the images. The recognized mathematical formulas patches are handed over to LaTeXOCR for recognition, while the rest text parts are handed over to the OCR engine CnOCR for recognition. More info can be found here: https://github.com/breezedeus/pix2text
  • Pix2Text (P2T): a Free Alternative to Mathpix
    1 project | /r/LaTeX | 17 Feb 2023
    Pix2Text (P2T) is a free open-source Python replacement for Mathpix, and is now able to perform the core functions of Mathpix. Pix2Text supports the recognition of mixed images containing both text and formulas, returning results similar to Mathpix. Its text recognition supports Chinese and English.
  • A note from our sponsor - Nutrient
    nutrient.io | 16 Feb 2025
    Bad PDFs = bad UX. Slow load times, broken annotations, clunky UX frustrates users. Nutrient’s PDF SDKs gives seamless document experiences, fast rendering, annotations, real-time collaboration, 100+ features. Used by 10K+ devs, serving ~half a billion users worldwide. Explore the SDK for free. Learn more →

Stats

Basic Pix2Text repo stats
6
2,190
9.2
2 months ago

breezedeus/Pix2Text is an open source project licensed under MIT License which is an OSI approved license.

The primary programming language of Pix2Text is Jupyter Notebook.


Sponsored
Nutrient - The #1 PDF SDK Library
Bad PDFs = bad UX. Slow load times, broken annotations, clunky UX frustrates users. Nutrient’s PDF SDKs gives seamless document experiences, fast rendering, annotations, real-time collaboration, 100+ features. Used by 10K+ devs, serving ~half a billion users worldwide. Explore the SDK for free.
nutrient.io

Did you know that Jupyter Notebook is
the 13th most popular programming language
based on number of references?