[P] I built a tool that auto-generates scrapers for any website with GPT

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning

SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  1. llm_data_parser

    This is a proof-of-concept of using an LLM to find and extract meaningful data without parsing the html too much.

    I actually tried doing this with langchain and gpt-3 and upload it to github a week ago, you can find it here, https://github.com/repollo/llm_data_parser Is really crappy right now because I only wanted to show to rpilocator.com’s owner it was possible, since he’s having to go through each spider/scraper and update it every time a website gets modified. But really cool to see a whole platform for this very purpose! Would be cool to see support for multiple libraries, and programming languages!

  2. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  3. text-generation-webui

    A Gradio web UI for Large Language Models with support for multiple inference backends.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Consent-O-Matic has been removed from the Chrome Web Store

    1 project | news.ycombinator.com | 16 Jul 2024
  • The internet used to be fun

    1 project | news.ycombinator.com | 12 Feb 2024
  • Block Cookie Banners on Firefox

    1 project | news.ycombinator.com | 7 Dec 2023
  • Tech workers demand high salaries despite hiring slowdown

    1 project | news.ycombinator.com | 15 Sep 2023
  • New Alien movie has wrapped filming ahead of 2024 release

    1 project | /r/movies | 5 Jul 2023