archiveweb.page
oldweb-today
archiveweb.page | oldweb-today | |
---|---|---|
7 | 22 | |
739 | 236 | |
3.1% | 1.3% | |
6.3 | 4.2 | |
about 1 month ago | 4 months ago | |
JavaScript | JavaScript | |
GNU Affero General Public License v3.0 | GNU Affero General Public License v3.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
archiveweb.page
- Webrecorder: Capture interactive websites and replay them at a later time
- r18 database of metadata
-
Ask HN: What is going on at archive.ph?
I use this from time to time to archive web page for my own use
https://github.com/webrecorder/archiveweb.page
I also convert web snippets I find useful to the markdown format and store it in my Joplin notebook, so that way it lives on even if the website is gone.
- "scrape" a javascript object from a website?
-
GitNoter โ An open source alternative to Evernote (Self Hosted)
There's also ArchiveWeb.page, which records in the same WARC format as archive.org
https://github.com/webrecorder/archiveweb.page
- Archiveweb.page โ A High-Fidelity Web Archiving Extension for Chromium Browsers
oldweb-today
-
Ask HN: What products other than Obsidian share the file over app philosophy?
There are flat-file CMSes (content management systems) like Grav: https://getgrav.org/
I guess, in some vague/broad sense, config-as-code systems also implement something similar? Maybe even OpenAPI schemas could count to some degree...?
In the old days, the "semantic web" movement was an attempt to make more webpages both human- and machine-readable indefinitely by tagging them with proper schema: https://en.wikipedia.org/wiki/Resource_Description_Framework. Even Google was on board for a while, but I guess it never saw much uptake. As far as I can tell it's basically dead now, both because of non-semantic HTML (everything as a React div), general laziness, and LLMs being able to parse things loosely.
-------------
Side thoughts...
Philosophically, I don't know that capturing raw data alone as files is really sufficient to capture the nuances of any particular experience, or the overall zeitgeist of an era. You can archive Geocities pages, but that doesn't really capture the novelty and indie-ness of that era. Similarly, you can save TikTok videos, but absent the cultural environment that created them (and a faithful recreation of the recommendation algorithm), they wouldn't really show future archaeologists how teenagers today lived.
I worked for a natural history museum for a while, and while we were there, one of the interesting questions (well, to me anyway) was whether our web content was in and of itself worth preserving as a cultural artifact -- both so that future generations can see what exhibits were interesting/apropos for the cultures of our times, but also so they could see how our generation found out about those exhibitions to begin with (who knows what the Web will morph into 50 years later). It wasn't enough to simply save the HTML of our web pages, both because they tie into various other APIs and databases (like zoological collections) and because some were interactive experiences, like games designed to be played with a mouse (before phones were popular), or phone chatbots with some of our specimens. To really capture the experience authentically would've required emulating not just our tech stacks and devices, among other things.
Like for the earlier Geocities example, sure you could just save the old HTML and render it with a modern browser, but that's not the same as something like https://oldweb.today/?browser=ns3-mac#http://geocities.com/ , which emulates the whole OS and browser too. And that still isn't the same as having to sit in front of a tiny CRT and wait minutes for everything to download over a 14.4k modem, only to be interrupted when mom had to make a call.
I guess that's a longwinded of critiquing "file over app": It only makes sense for things that are originally files/documents to begin with. Much of our lives now are not flat docs but "experiences" that take much more thought and effort to archive. If the goal is truly to preserve that posterity, it's not enough to just archive their raw data, but to develop ways to record and later emulate entire experiences, both technological and cultural. It ain't easy!
-
Ask HN: It's 1997, how do you build a website?
1. Anything that can emulate Windows 95
2. HotDog, Dreamweaver, and/or Frontpage for HTML editing. They each have their strengths & weaknesses and work relatively well together. Normally I'd lay out pages in Dreamweaver or Frontpage and then clean up the HTML with HotDog. https://en.wikipedia.org/wiki/HotDog. Upload them with BulletproofFTP (http://www.bpftp.com/), an ancient predecessor of Filezilla.
3. Not sure if any of the old hosts are still around. Maybe Neocities.org comes close to the Geocities of old, or else you can get barebones and very cheap but reliable web hosting from https://www.nearlyfreespeech.net/
Get AOL dialup: https://getonline.aol.com/dialup and a 56k modem off eBay, no cheating with megabit connections.
You can emulate old browsers online: https://oldweb.today/
People generally weren't self-hosting their websites back then because phone lines weren't very reliable, and the dialup modems of the day (like today's cable modems) generally had faster download than upload speeds. After the BBS era, commercial internet hosts started becoming pretty common, but they usually used expensive ISDN or T1 lines.
Apache was still in its infancy then, and Microsoft IIS was common.
---------
Overall, I don't think the network stack is really THAT far removed from what we have today. The basics of HTTP were pretty fleshed out already (it was more the HTTPS and DNS security extensions that really evolved), along with better compression and parallelization protocols in HTTP2/3. And of there's been a huge amount of backend server optimizations, caching, reverse proxies, etc. But the old network stack would still work today and you can still run a basic barebones http daemon and firewall the same way you could back then, open up port 80, and watch your box get pwned by the bots. Hell, there are probably still zombie bots leftover from that era running on someone's basement PC, just casually scanning the internet day in and day out, waiting for its chance...
What's changed a lot more (IMO) is more the language of the web itself. HTML was very basic then, as was JS and CSS. Making websites without modern CSS, in particular, is very painful... requiring a lot of nested tables, frames (remember those? they're different from iframes!), etc. A lot of the pain was abstracted away to backend scripting languages (Perl, ASP, Coldfusion) or frontend extensions (Java, Shockwave Flash) and people didn't really work in vanilla HTML/JS very often (because they were so weak by then). Plain HTML is for your basic Astro Marmalade site with a bunch of animated gifs and whatever, but probably real ecommerce sites (such as it were) wouldn't be written in plain HTML :)
-
SURF THE WEB LIKE IT'S 1999!
I like https://oldweb.today ... actually emulates old OS/browser combinations and proxies stuff from archive.org
-
Is there a website that shows chronological versions of websites/apps/aplatforms?
Another option to what you are looking for https://oldweb.today/
- Old Web, Today
-
I miss the old Internet
Good news! You can go back in time https://oldweb.today/
- Casual Friday - ๐ต Talking to the moon ๐ We could talk forever ๐ต
-
How To See What a Website Used To Look Like In 2019?
https://web.archive.org/ does not have a screenshot from that month. https://oldweb.today doesn't work either because it pulls up a broken version (I tried all of the browsers it allows on the left). I tried looking online for another site to be able to pull an archive but was unsuccessful.
I need to see what gravitytransformation.com/macro-calculator looked like in September of 2019. https://web.archive.org/ does not have a screenshot from that month. https://oldweb.today doesn't work either.
-
macOS 8 Running in the Browser
https://oldweb.today/ gives you that experience, using some of the same building blocks.
What are some alternatives?
replayweb.page - Serverless replay of web archives directly in the browser
web-archives - Browser extension for viewing archived and cached versions of web pages, available for Chrome, Edge and Safari
Surfingkeys - Map your keys for web surfing, expand your browser with javascript and keyboard.
pygooglenews - If Google News had a Python library
ungoogled-chromium-extension-installer - Extension for Ungoogled Chromium that allows easy installation of extensions from Chrome webstore.
wrp - Web Rendering Proxy: Use vintage, historical, legacy browsers on modern web
kdeconnect-chrome-extension - A browser extension to send pages and content from your browser to connected KDE Connect devices.
vaporBoy - Gameboy / Gameboy Color Emulator PWA built with Preact. โ๏ธ Powered by wasmBoy. ๐ฎThemed with VaporWave. ๐ด๐ฌ
extension.js - ๐งฉ Plug-and-play, zero-config, cross-browser extension development tool.
neocities - Neocities.org - the web site. Yep, the backend is open source!
markdownload - A Firefox and Google Chrome extension to clip websites and download them into a readable markdown file.
picotcp.js