java-warc

Read Web ARChive (WARC) files in Java. (by bottomless-archive-project)

Java-warc Alternatives

Similar projects and alternatives to java-warc

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better java-warc alternative or higher similarity.

java-warc reviews and mentions

Posts with mentions or reviews of java-warc. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-01-11.
  • How I archived 100 million PDF documents... (Part 1)
    6 projects | dev.to | 11 Jan 2023
    I found one Java library on Github (thanks Mixnode) that was able to read these files. Unfortunately, it was not maintained for the past couple of years. I picked it up and forked it to make it a little easier to use. (A couple of years later this repo was moved under the Bottomless Archive project as well.)

Stats

Basic java-warc repo stats
1
5
10.0
over 2 years ago

bottomless-archive-project/java-warc is an open source project licensed under Apache License 2.0 which is an OSI approved license.

The primary programming language of java-warc is Java.


Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com