awesome-web-archiving
zotero
awesome-web-archiving | zotero | |
---|---|---|
13 | 254 | |
1,818 | 9,264 | |
2.1% | 2.3% | |
5.2 | 9.9 | |
5 days ago | 1 day ago | |
JavaScript | ||
Creative Commons Zero v1.0 Universal | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
awesome-web-archiving
-
Show HN: OpenAPI DevTools – Chrome ext. that generates an API spec as you browse
https://github.com/iipc/awesome-web-archiving/blob/main/READ...
-
DPReview.com is going down effective April 10.
People have pasted this around, https://github.com/iipc/awesome-web-archiving Could probably do it with wget if you had enough time?
- DPReview.com to close on April 10 after 25 years of operation
-
This Layoff Does Not Exist: tech layoff announcements but weird
Maybe something on this list can help you https://github.com/iipc/awesome-web-archiving
-
Software to keep Website pages "alive"?
Awesome Web Archiving has a longer list of tools and software
-
How to Download All of Wikipedia onto a USB Flash Drive
Not related to the OP topic or zim but I was looking into archiving my bookmarks and other content like documentation sites and wikis. I'll list some of the things I ended up using.
ArchiveBox[1]: Pretty much a self-hosted wayback machine. It can save websites as plain html, screenshot, text, and some other formats. I have my bookmarks archived in it and have a bookmarklet to easily add new websites to it. If you use the docker-compose you can enable a full-text search backend for an easy search setup.
WebRecorder[2]: A browser extension that creates WACZ archives directly in the browser capturing exactly what content you load. I use it on sites with annoying dynamic content that sites like wayback and ArchiveBox wouldn't be able to copy.
ReplayWeb[3]: An interface to browse archive types like WARC, WACZ, and HAR. The interface is just like browsing through your browser. It can be self-hosted as well for the full offline experience.
browsertrix-crawler[4]: A CLI tool to scrape websites and output to WACZ. Its super easy to run with Docker and I use it to scrape entire blogs and docs for offline use. It uses Chrome to load webpages and has some extra features like custom browser profiles, interactive login, and autoscroll/autoplay. I use the `--generateWACZ` parameter so I can use ReplayWeb to easily browse through the final output.
For bookmark and misc webpage archiving then ArchiveBox should be more than enough. Check out this repo for an amazing list of tools and resources https://github.com/iipc/awesome-web-archiving
[1] https://github.com/ArchiveBox/ArchiveBox
- Self Hosted Roundup #14
- SingleFile: Save a Complete Web Page into a Single HTML File
- [HELP] Starting Out for a Beginner
- Reflections as the Internet Archive turns 25
zotero
-
Google Scholar PDF Reader
Maybe try Zotero[1]. There are many addons which can do what you need.
[1]https://www.zotero.org/
-
I wrote my bibliography manually (Dont ask why). How do I sort it by the first letter of each entry?
And next time, you use a real literature management program like zotero (some university libraries offer classes, there is a r/zotero, etc) or jabref to create a proper bibtex file with the references. It is not that difficult, and keeps you sane (esp. if a paper has to be formatted for a different publisher). See e.g. learnlatex.
-
Ask HN: Who is hiring? (December 2023)
Zotero | Remote | Full-Time or Part-Time | https://www.zotero.org
Zotero is an open-source project that develops software to help people collect, organize, annotate, cite, and share their research. Our software is recommended by most universities and used by millions of students, scholars, scientists, and researchers worldwide.
We're looking for a JavaScript developer to work on Zotero "translators" — the pieces of code that let people click a button in their browser toolbar on any webpage and save high-quality metadata and files to their Zotero libraries. If you like web scraping, APIs, data formats, and exploring sites in the browser devtools, this would be up your alley. As a core Zotero developer, you'll also have the ability to work across Zotero's vast ecosystem and help shape the future of the project.
This is an open-ended contract role that can scale up and down in hours based on availability and workload.
https://www.zotero.org/jobs
-
Show HN: Odin – the integration of LLMs with Obsidian note taking
Zotero is your answer, it even auto generates your citations.
https://www.zotero.org/
Apparently there are plugins for Logseq and Obsidian as well.
- Ask HN: How do you use your iPad?
-
A collection of useful Mac Apps
Zotero - Price: Free Free and open-source reference manager that helps you collect, organize, and cite your research sources.
-
Is there an equivalent of calibredb for research papers?
I use the free and open source Zotero which I think you'd find very calibre-like and manage notes and concept linking with org-roam in emacs.
-
Will I lose everything on Zotero?
If you can't hold the urge to know, you can check on the Zotero web library if all of your things are still there
-
Advice for Thesis students
Resources: ZOTERO. Zotero is a free (you can pay to get more storage), open-source citation manager with optional browser plugins. IT WILL FORMAT CITATIONS FOR YOU. (sometimes you have to edit them, but most of the time it can pull metadata and format things correctly on its own). You can sort your references into folders or with tags, read and annotate PDF copies on your computer or in a mobile app, and make notes - which I used to keep track of specific quotations I wanted to use.
-
Extra Reading for Archaeology / Ancient History
You can also use online resources like The Encyclopedia of Archaeological Sciences, that I think is mostly free or the Handbook of Archaeological Sciences which I think is also mostly free. If you can't get a hold of those things you can also email the authors/editors and they might send you a free copy or look them up on Academia.edu and see if they have a free version. Also, if you don't already, use Google Scholar, it's the best resource for finding free articles and topics to read. It's also never too early to start using something like Zotaro, Mendeley, or Endnote to keep track of your readings and help you with citations/references in papers. You can literally download the citation, import it into one of those systems and it automatically formats your referencing.
What are some alternatives?
SingleFileZ - Web Extension to save a faithful copy of an entire web page in a self-extracting ZIP file
calibre - The official source code repository for the calibre ebook manager
ArchiveBox - 🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
jabref - Graphical Java application for managing BibTeX and biblatex (.bib) databases
obelisk - Go package and CLI tool for saving web page as single HTML file
obsidian-citation-plugin - Obsidian plugin which integrates your academic reference manager with the Obsidian editor. Search your references from within Obsidian and automatically create and reference literature notes for papers and books.
SingleFile-MV3 - SingleFile version compatible with Manifest V3. The future, right now!
Zettlr - Your One-Stop Publication Workbench
firefox-scrapbook - ScrapBook X – a legacy Firefox add-on that captures web pages to local device for future retrieval, organization, annotation, and edit.
notion-auto-pull - Bash script to automatically download a notion workspace
youtube-dl - Command-line program to download videos from YouTube.com and other video sites
zotero-mdnotes - A Zotero plugin to export item metadata and notes as markdown files