Python data-extraction

Open-source Python projects categorized as data-extraction

Top 11 Python data-extraction Projects

data-extraction
  • flashtext

    Extract Keywords from sentence or Replace keywords in sentences.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • Optimus

    :truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark (by ironmussa)

  • parsera

    Lightweight library for scraping web-sites with LLMs

    Project mention: llm-scraper VS parsera - a user suggested alternative | libhunt.com/r/llm-scraper | 2024-10-16
  • hacker-news-digest

    :newspaper: Let ChatGPT Summarize Hacker News for You

    Project mention: HN Summary: Let ChatGPT Summarize Hacker News for You | news.ycombinator.com | 2024-09-02
  • PlotDigitizer

    A Python utility to digitize plots.

  • sayn

    Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).

  • superpipe

    Superpipe - optimized LLM pipelines for structured data

    Project mention: Show HN: Superpipe – optimized LLM pipelines for structured outputs | news.ycombinator.com | 2024-03-26
  • tinvois-parser

    Extract receipt info

  • JSONPATH

    A query expression for extracting data from JSON. (by linw1995)

  • Data Extractor

    Combine XPath, CSS Selectors and JSONPath for Web data extracting.

  • Webtap.ai

    AI web scraping python library for efficient and reliable web scraping.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python data-extraction discussion

Log in or Post with

Python data-extraction related posts

  • Scrapegraph-ai VS parsera - a user suggested alternative

    2 projects | 16 Oct 2024
  • HN Summary: Let ChatGPT Summarize Hacker News for You

    1 project | news.ycombinator.com | 2 Sep 2024
  • Lightweight library for scraping web-sites with LLMs

    1 project | news.ycombinator.com | 17 Aug 2024
  • What's the fun in writing on the internet anymore?

    1 project | news.ycombinator.com | 17 Feb 2024
  • Made an app that summarizes recent popular stories from Hacker News

    2 projects | news.ycombinator.com | 16 Nov 2023
  • Hi, can anyone tell me how to use this repository?

    1 project | /r/learnpython | 26 Oct 2023
  • Hi, can anyone tell me how to use this repository?

    1 project | /r/learnpython | 26 Oct 2023
  • A note from our sponsor - SaaSHub
    www.saashub.com | 8 Dec 2024
    SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source data-extraction projects in Python? This list will help you:

Project Stars
1 flashtext 5,597
2 Optimus 1,486
3 parsera 903
4 hacker-news-digest 689
5 PlotDigitizer 121
6 sayn 121
7 superpipe 108
8 tinvois-parser 42
9 JSONPATH 41
10 Data Extractor 28
11 Webtap.ai 12

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com

Did you konow that Python is
the 2nd most popular programming language
based on number of metions?