wayback-machine-downloader
reddit-search
wayback-machine-downloader | reddit-search | |
---|---|---|
48 | 16 | |
5,053 | 204 | |
- | - | |
0.0 | 0.0 | |
3 months ago | about 2 years ago | |
Ruby | TypeScript | |
GNU General Public License v3.0 or later | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
wayback-machine-downloader
-
Ask HN: Cool Useful GitHub Repos?
I just found this https://github.com/hartator/wayback-machine-downloader
anyone have anything similarly interesting/cool/niche-useful ?
-
ArchiveTeam is saving Blogger from Google deletion
Send ArchiveTeam the link on IRC or here and we can save it to archive.org, then later you can use wayback-machine-downloader to grab it from archive.org.
https://github.com/hartator/wayback-machine-downloader
-
My TikTok was Hacked & Deleted and I GOT IT BACK!
This is where it gets tricky, you need to download the code from the wayback machine and he was able to do that by following these steps: https://github.com/hartator/wayback-machine-downloader
-
Is there a way to quick download twitter images on the wayback machine?
Not sure if it will work for twitter, but I have used wayback-machine-downloader to batch download stuff.
-
Forgot to backup my WordPress files before I swapped webhosting provider, am I screwed?
Adding to archive.org, there is a github repo to fetch website data. You can give a try too. Here is the repo link
- Can I please get help downloading and saving a website for offline use?
-
Hey guys, looks like we have a potential hacker on our hands. All of our company's files were deleted from our FTP. :( Is there any way we can get a cache of our website and restore everything? Any help or advice would be greatly appreciated. Thanks in advance!
Edit: Good news! I found a solution that saved me. I was able to download the full website (including images, JS, and CSS files) using this tool: https://github.com/hartator/wayback-machine-downloader
-
Hey guys, so a potential hacker managed to delete all of our company's files from our FTP. Yikes! Is there a way to retrieve a cache of our website and restore it? Any advice or tips would be greatly appreciated. Thanks in advance!
Edit: Thank you to everyone who suggested the Wayback Machine Downloader! It saved the day and allowed me to download the full website, including images, JS, and CSS files.
-
Have a lengthy flight: how to seamlessly mirror couple websites
I've used https://github.com/hartator/wayback-machine-downloader but it sometimes messes up CSS badly
- what Do YOU Recommend?
reddit-search
-
Shreddit is a Python program to remove all your Reddit comments
When I quit reddit (over irritation at being banned from too many subreddits for not aligning properly with the hive mind), I used this tool to extract and save all my comments locally before manually deleting the lot:
https://github.com/camas/reddit-search
...aaand Github has disabled the repository for 'Terms of Service' violations. Go figure, maybe there's a mirror somewhere.
-
The final nail in the coffin.
The GitHub has been officially disabled again for ToS violations.
- Camas reddit-search "This repository has been disabled"
-
An error occured! code isn't exactly "enterprise" so feel free to tell me on Github or use pushshift directly / Cannot read properties of undefined (reading 'split') in r in r
RIP https://github.com/camas/reddit-search/issues/new
-
Daily General Discussion - May 15, 2022
#1: Camas reddit-search "has been disabled by GitHub Staff due to a violation of GitHub's Terms of Service." | 130 comments #2: What happened to removeddit.com? #3: Online Removal Request form for removal requests. Please put your removal request here where it can be processed more quickly.
-
GitHub Reddit Search Deleted
From the repository (https://github.com/camas/reddit-search):
-
Discussion Thread
Access to this repository has been disabled by GitHub Staff due to a violation of GitHub's Terms of Service.
-
Camas reddit-search "has been disabled by GitHub Staff due to a violation of GitHub's Terms of Service."
Actually, I was referring to the notice posted right on the repo:
- Camas "has been disabled by GitHub Staff due to a violation of GitHub's Terms of Service."
-
Checkmate
Man I'm almost smart enough to do all that... I got to this website and couldn't figure out the rest. I will keep trying. Is this what hacking feels like? Glad people have a better memory than I do!
What are some alternatives?
savepagenow - A simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service
Pushshift API - Pushshift API
warrick - Recover lost websites from the Web Infrastructure
reddit-search
neocities - Neocities.org - the web site. The entire thing. Yep, we're completely open source.
Hexo - A fast, simple & powerful blog framework, powered by Node.js.
wayback-machine-spn-scripts - Bash scripts which interact with Internet Archive Wayback Machine's Save Page Now
gba-remote-play - 📡 Stream Raspberry Pi games to a GBA via Link Cable.
go-readability - Go package that cleans a HTML page for better readability.
waybackpack - Download the entire Wayback Machine archive for a given URL.
WriteFreely - A clean, Markdown-based publishing platform made for writers. Write together and build a community.
docs - This is a repo of the RetroArch official document page.