Skyscraper Alternatives

Similar projects and alternatives to skyscraper

Playwright

382 61,953 9.9 TypeScript skyscraper VS Playwright

Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.
aider

64 9,705 9.9 Python skyscraper VS aider

aider is AI pair programming in your terminal
InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
cheerio

50 27,801 9.7 TypeScript skyscraper VS cheerio

The fast, flexible, and elegant library for parsing and manipulating HTML and XML.
colly

39 22,205 5.7 Go skyscraper VS colly

Elegant Scraper and Crawler Framework for Golang
rod

20 4,808 7.9 Go skyscraper VS rod

A Devtools driver for web automation and scraping
Nokogiri

20 6,108 9.4 C skyscraper VS Nokogiri

Nokogiri (鋸) makes it easy and painless to work with XML and HTML from Ruby.
shot-scraper

16 1,535 7.1 Python skyscraper VS shot-scraper

A command-line utility for taking automated screenshots of websites
SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
CIEL

13 143 6.7 Common Lisp skyscraper VS CIEL

CIEL Is an Extended Lisp. Scripting with batteries included.
roswell

11 1,742 4.9 Common Lisp skyscraper VS roswell

intended to be a launcher for a major lisp environment that just works.
grub-2.0

4 19 0.0 Python skyscraper VS grub-2.0

Grub is an AI powered Web crawler.
WebDumper

2 131 0.0 TypeScript skyscraper VS WebDumper

A tool for scraping, dumping and unpacking (webpacked) javascript source files.
ChromeController

1 209 3.3 Python skyscraper VS ChromeController

Comprehensive wrapper and execution manager for the Chrome browser using the Chrome Debugging Protocol.
reaver

1 144 10.0 Clojure skyscraper VS reaver

A Clojure library for extracting data from HTML.
hickory

1 622 4.8 Clojure skyscraper VS hickory

HTML as data (by clj-commons)
backup-scripts

6 197 10.0 Clojure skyscraper VS backup-scripts

The various scripts I use to back up my home computers using ssh and rsync (by eamonnsullivan)
babashka-sql-pods

2 77 4.8 Clojure skyscraper VS babashka-sql-pods

Babashka pods for SQL databases
pypandoc

5 821 6.8 Python skyscraper VS pypandoc

Thin wrapper for "pandoc" (MIT)
bootleg

2 253 10.0 JavaScript skyscraper VS bootleg

Simple template processing command line tool to help build static websites (by retrogradeorbit)
pod-registry

2 87 8.1 Clojure skyscraper VS pod-registry

Pod manifests describe where pods can be downloaded, etc.
.dotfiles

1 2 6.4 Lua skyscraper VS .dotfiles

My dotfiles (by TimDeve)
SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better skyscraper alternative or higher similarity.

Suggest an alternative to skyscraper

skyscraper reviews and mentions

Posts with mentions or reviews of skyscraper. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-02-20.

Web Scraping in Python – The Complete Guide
11 projects | news.ycombinator.com | 20 Feb 2024

Yes!
My Clojure scraping framework [0] facilitates that kind of workflow, and I’ve been using it to scrape/restructure massive sites (millions of pages). I guess I’m going to write a blog post about scraping with it at scale. Although it doesn’t really scale much above that – it’s meant for single-machine loads at the moment – it could be enhanced to support that kind of workflow rather easily.
[0]: https://github.com/nathell/skyscraper
Babashka: GraalVM Helped Create a Scripting Environment for Clojure
10 projects | news.ycombinator.com | 8 Dec 2022

I plan to port my scraping framework (Skyscraper, https://github.com/nathell/skyscraper) to babashka one day. I’m not sure how easy it will be, though, since it uses core.async (which I believe bb has limited support for) and SQLite via clojure.java.jdbc.
Mastering Web Scraping in Python: Crawling from Scratch
6 projects | news.ycombinator.com | 11 Aug 2021

I’ve done a fair share of scraping, and I learned that on a large scale, there are a lot of cross-cutting repetitive concerns. Things like caching, fetching HTML (preferably in parallel), throttling, retries, navigation, emitting the output as a dataset…
My library, Skyscraper [0], attempts to help with these. It’s written in Clojure (based on Enlive or Reaver, both counterparts to Beautiful Soup), but the principles should be readily transferable everywhere.
[0]: https://github.com/nathell/skyscraper
A note from our sponsor - InfluxDB
www.influxdata.com | 7 May 2024

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Stats

Basic skyscraper repo stats

Mentions

Stars

401

Activity

4.9

Last Commit

10 months ago

The primary programming language of skyscraper is Clojure.

Popular Comparisons