openlibrary
ArchiveBox
Our great sponsors
openlibrary | ArchiveBox | |
---|---|---|
408 | 248 | |
4,831 | 19,737 | |
2.3% | 3.1% | |
9.9 | 9.7 | |
7 days ago | 8 days ago | |
Python | Python | |
GNU Affero General Public License v3.0 | MIT |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
openlibrary
-
Ask HN: Anyone looking for contributors for their open source projects
I'd like to make a pitch for Openlibrary.org the free online library from Internet Archive that includes a fulltext search of millions of books.
I've been volunteering with them on and off for several years and it's always a lovely experience. Their backend is python and frontend mostly from python templates and some Vue for librarian stuff.
Every Tuesday they have a call on Zoom that everyone is welcome to join to share what they're working on, ask for help, and generally chat a bit. It's a great time.
Depending on what you're interested in there's a lot to do from helping build import pipelines for more book entries, writing bots to cleanup data, Performance improvements, better documenting public APIs, etc
I'm currently slowly working on a wikidata integration for their authors page. We also could use some help upgrading to Vue 3, mentors for Google summer of code would be helpful, find of ML projects needing help, moving away from old jQuery libraries, etc.
They can be quite responsive to PRs too like I blogged about here: https://blog.rayberger.org/idea-to-merged-in-less-than-30-mi...
For example, here's a small issue that could use some help on the python side: https://github.com/internetarchive/openlibrary/issues/8928
-
Building an Open Source Decentralized E-Book Search Engine
OpenLibrary does provide search access to full texts. For example: https://openlibrary.org/search/inside?q=%22institutional+thi...
It is open source and they're always looking for contributors. I think they'd especially welcome help improving search!
https://github.com/internetarchive/openlibrary/
- Show HN: Mutable.ai – Turn your codebase into a Wiki
-
MLIS books available digitally?
Check out https://openlibrary.org. You can search ´library science’, librarian’, etc, and something should come up. Just select the ‘ebooks’ option to search for items within the collection. And you can narrow the search by subject, etc.
- HMF a “legal” website to download books
-
NaNoWriMo: National Novel Writing Month
Right now I'm in the middle of the chicken and the egg problem where we don't have enough authors cataloging their publications and b/c of that obviously readers are not interested in using the site.
I've gone back and forth with taking Open Libray's [0] catalog as that would at least flesh out our collection of books but then I'd have to deal with verifying authors to accounts so they can access their books. Which sounds like a major headache and also just defeats the concept of building a community.
Since this is really a weekend project, I'm just going to keep building the tools out to perfection and hope people will trickle in over time.
Luckily for me I just want to write, so the tools I'm building are exactly what works for my writing goals and I think overtime others will find the same value.
[0] https://openlibrary.org
-
is there any way to read books for free?
Here's one: https://openlibrary.org/
-
YSK: You can access many old and out of print hiking books from the Internet Archive's Open Library
The Internet Archive runs what they call the Open Library, which is a unique concept on the traditional library. You can sign-up with minimal details and digitally check out many scanned books from libraries all over the world. The only caveat is that almost all of the books are older editions - ones that would be impossible to find locally. It's great if you're looking for old routes, a look back in time, details about obscure areas, or just prefer to read a book rather than browse AllTrails. Please do still support local authors whenever you can as guidebooks take hundreds of hours to create and are slowly going extinct.
-
🐍🐍 23 issues to grow yourself as an exceptional open-source Python expert 🧑💻 🥇
Repo : https://github.com/internetarchive/openlibrary
-
Searching for a pharmacy book
I want to clarify that I'm a non-US citizen, so accessing physical copies from US libraries or buying it from Amazon might not be feasible for me. To give you some context, my personal research was guided by the wiki section of r/FREEMEDIAHECKYEAH (https://www.reddit.com/r/FREEMEDIAHECKYEAH/wiki/reading/). I've conducted research using various online resources, including the Ebook & Open Source/Access Libraries such as Sci-Hub, Z-Library, Library Genesis, Anna’s Archive, and PDF Drive. Additionally, I've checked Torrent Search Engines like The Pirate Bay and BTDigg. Moreover, I've searched in Internet Archive and its Open Library but again I had no luck. However, I haven't yet explored software-based libraries. Finally I've looked into the Ebay if anyone had the particular book but it looks like both the versions are quite rare, because the book was meant to be only for Pharmarcist and especially for American ones.
ArchiveBox
-
Ask HN: What Underrated Open Source Project Deserves More Recognition?
Two projects I greatly appreciate, allowing me to easily archive my bandcamp and GOG purchases (after the initial setup anyways):
https://github.com/easlice/bandcamp-downloader
https://github.com/Kalanyr/gogrepoc
And I recently learned about archivebox, which I think is going to be a fast favorite and finally let me clear out my mess of tabs/bookmarks: https://github.com/ArchiveBox/ArchiveBox
- YaCy, a distributed Web Search Engine, based on a peer-to-peer network
-
Vice website is shutting down
If you really want to save the content for yourself, use something like https://archivebox.io/
I've been running a local instance for a few years now and download/save tech articles all time. I can search and find them as needed.
-
An Introduction to the WARC File
API is coming soon (relatively, it's still a one-man project)! Stay tuned https://github.com/ArchiveBox/ArchiveBox/issues/496
I have an event-sourcing refactor in progress now to allow us to pluginize functionality like the API (similar to Home Assistant with a plugin app sotre), it will take a month or two. Next up is the REST API using the new plugin system.
-
Ask HN: How can I back up an old vBulletin forum without admin access?
I guess your best chance is to use something like https://archivebox.io/.
-
ArchiveBox – open-source self-hosted web archiving
Yeah this is a cool project but it was discussed 2 days ago.
As mentioned by the maintainer there, they even maintain a list of alternatives, very classy:
https://github.com/ArchiveBox/ArchiveBox/wiki/Web-Archiving-...
- ArchiveBox: Open-source self-hosted web archiving
- Linkhut: A Social Bookmarking Site
- Show HN: Rem: Remember Everything (open source)
- Bookmark manager with a focus on organization?
What are some alternatives?
DeDRM_tools - DeDRM tools for ebooks
Wallabag - wallabag is a self hostable application for saving web pages: Save and classify articles. Read them later. Freely.
calibre - The official source code repository for the calibre ebook manager
paimon-moe - Your best Genshin Impact companion! Help you plan what to farm with ascension calculator and database. Also track your progress with todo and wish counter.
bypass-paywalls-chrome - Bypass Paywalls web browser extension for Chrome and Firefox.
SingleFile - Web Extension for saving a faithful copy of a complete web page in a single HTML file
launcher - Launcher for Flashpoint Archive
ArchivesSpace - The ArchivesSpace archives management tool
stylegan2-pytorch - Simplest working implementation of Stylegan2, state of the art generative adversarial network, in Pytorch. Enabling everyone to experience disentanglement
grab-site - The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
RegExr - RegExr is a HTML/JS based tool for creating, testing, and learning about Regular Expressions.
Archivematica - Free and open-source digital preservation system designed to maintain standards-based, long-term access to collections of digital objects.