JavaScript Scraping

Open-source JavaScript projects categorized as Scraping

Top 14 JavaScript Scraping Projects

  • kuwala

    Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data science models and products with a focus on geospatial data. Currently, the following data connectors are available worldwide: a) High-resolution demograp

    Project mention: Show HN: GeoSage – A ETL Webtool for Geo and Demographics Data from the Open Web | news.ycombinator.com | 2023-10-05

    --> Google Trends Data for Regions (Coming Soon)

    The tool goes beyond our previously published CLI tool (https://github.com/kuwala-io/kuwala/tree/master/kuwala) by providing a hostable solution with a user-friendly interface. We have not open-sourced it yet but a demo is available here: https://geosage.kuwala.io/.

    Urban planners can utilize movement data to analyze foot traffic in different city zones. Marketers can leverage demographic data to tailor campaigns more effectively. Developers can build their apps on top of it.

    To round it up .... GeoSage brings...

    Unified Data Management: Access data from OSM, Facebook, and soon Google, all in one place.

  • gogoanime-api

    Anime Streaming, Discovery API made with Cheerio and Express. Uses data from Gogoanime

    Project mention: I created an anime website . | /r/developersIndia | 2023-06-29
  • SurveyJS

    Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App. With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.

  • dark-knowledge

    😈📚 A curated library of research papers and presentations for counter-detection and web privacy enthusiasts.

    Project mention: Share some articles you've saved | /r/privsec_dev | 2023-04-28

    "A curated library of research papers and presentations for counter-detection and web privacy enthusiasts": https://github.com/prescience-data/dark-knowledge

  • quetre

    A libre front-end for Quora

    Project mention: How to browse Reddit, Youtube, Quora and twitter in this period of Bac exams without vpn | /r/algeria | 2023-06-12

    Use frontend alternitive websites: Reddit > https://libreddit.domain.glass/ | Youtube > https://docs.invidious.io/instances/ or https://piped.video | Quora > https://quetre.iket.me/ | Twitter > https://nitter.net/ | for linux users use these apps: Smplayer+smtube (youtube) available on Flathub and AUR or Freetube available Flathub and AUR | for android users: newpipe or libretube (youtube) | for more apps for linux, android and windows and more links visit this link for more information about frontend alternative websites watch Eric Murphy's video

  • humanparser

    Parse a human name string into salutation, first name, middle name, last name, suffix.

  • amazon_scraper

    Amazon products scraper with using of rotating proxies and headless Chrome from ScrapingAnt

  • instagram-without-api-node

    A simple Node.js code to get unlimited instagram public pictures by every user without api, without credentials.

    Project mention: Saving Instagram images automatically every hour with Node.js or PHP | news.ycombinator.com | 2023-04-20

    actually the JSON I get from Instagram (by https://github.com/orsifrancesco/instagram-without-api-node) just gives me the images ordered by size.. the first one of the array is always the biggest (but the quality is never amazing)

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

  • garlic

    🧄🧛 protect your website from being scraped by bots. (by velocitatem)

  • sniffagrammers

    Node.js and PHP files to automatically downloading pictures from instagram by https://orsi.me/sniffagram

    Project mention: Saving Instagram images automatically every hour with Node.js or PHP | /r/hypeurls | 2023-04-20
  • bard-unofficial-api

    Google's Bard ChatBot Unofficial NodeJS API

    Project mention: Trying to get "SNlM0e" data from request response, works on local machine but not on GCP VM instance? | /r/googlecloud | 2023-06-05

    The repo that I am using: https://github.com/AdamSEY/bard-unofficial-api

  • Web-Scraper

    Simple Web scraping app to scrape all the Indian Presidents (Name and Birthdays) present on Wikipedia. (by Garima-sharma814)

  • twitter-image-downloader

    This is a tool to download images posted/retweeted from a Twitter user's timeline. (by Kimkykie)

  • memer-telegram-bot

    Memer Telegram Bot - Search & Create memes!

  • viddy

    Find DOM elements using an expressive query syntax, extract text and monitor changes. (by shuckster)

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2023-10-05.

JavaScript Scraping related posts

Index

What are some of the best open-source Scraping projects in JavaScript? This list will help you:

Project Stars
1 kuwala 755
2 gogoanime-api 649
3 dark-knowledge 495
4 quetre 398
5 humanparser 92
6 amazon_scraper 73
7 instagram-without-api-node 61
8 garlic 50
9 sniffagrammers 38
10 bard-unofficial-api 22
11 Web-Scraper 3
12 twitter-image-downloader 2
13 memer-telegram-bot 0
14 viddy 0
The modern identity platform for B2B SaaS
The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
workos.com