llm_data_parser

This is a proof-of-concept of using an LLM to find and extract meaningful data without parsing the html too much. (by repollo)

Llm_data_parser Alternatives

Similar projects and alternatives to llm_data_parser

  1. text-generation-webui

    A Gradio web UI for Large Language Models with support for multiple inference backends.

  2. Nutrient

    Nutrient – The #1 PDF SDK Library, trusted by 10K+ developers. Other PDF SDKs promise a lot - then break. Laggy scrolling, poor mobile UX, tons of bugs, and lack of support cost you endless frustrations. Nutrient’s SDK handles billion-page workloads - so you don’t have to debug PDFs. Used by ~1 billion end users in more than 150 different countries.

    Nutrient logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better llm_data_parser alternative or higher similarity.

llm_data_parser discussion

Log in or Post with

llm_data_parser reviews and mentions

Posts with mentions or reviews of llm_data_parser. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-04-22.
  • [P] I built a tool that auto-generates scrapers for any website with GPT
    3 projects | /r/MachineLearning | 22 Apr 2023
    I actually tried doing this with langchain and gpt-3 and upload it to github a week ago, you can find it here, https://github.com/repollo/llm_data_parser Is really crappy right now because I only wanted to show to rpilocator.com’s owner it was possible, since he’s having to go through each spider/scraper and update it every time a website gets modified. But really cool to see a whole platform for this very purpose! Would be cool to see support for multiple libraries, and programming languages!

Stats

Basic llm_data_parser repo stats
1
28
1.9
almost 2 years ago

The primary programming language of llm_data_parser is Python.


Sponsored
Nutrient – The #1 PDF SDK Library, trusted by 10K+ developers
Other PDF SDKs promise a lot - then break. Laggy scrolling, poor mobile UX, tons of bugs, and lack of support cost you endless frustrations. Nutrient’s SDK handles billion-page workloads - so you don’t have to debug PDFs. Used by ~1 billion end users in more than 150 different countries.
www.nutrient.io