warc-proxy

Serving content from a WARC (by alard)

Warc-proxy Alternatives

Similar projects and alternatives to warc-proxy

  • Scrapy

    Scrapy, a fast high-level web crawling & scraping framework for Python.

  • mitmproxy

    An interactive TLS-capable intercepting HTTP proxy for penetration testers and software developers.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better warc-proxy alternative or higher similarity.

warc-proxy reviews and mentions

Posts with mentions or reviews of warc-proxy. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-11-11.
  • Ask HN: Best way to keep the raw HTML of scraped pages?
    3 projects | news.ycombinator.com | 11 Nov 2022
    I thought that mitmproxy did this, but cursory searches didn't show anything; that said, their actual format[1] has even more fidelity (I'd guess it's comparable to wireshark)

    One should be aware that WARC is great for preservation, but getting content back out of it would require specialized tooling ala: https://github.com/alard/warc-proxy

    1: https://github.com/mitmproxy/mitmproxy/blob/9.0.1/mitmproxy/...

Stats

Basic warc-proxy repo stats
1
61
10.0
over 11 years ago

The primary programming language of warc-proxy is Python.


Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com