Building an Open Source Decentralized E-Book Search Engine

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • ebook-demo

  • IPFSPytorchDataset

    An IPFS Dataset class for PyTorch to load an IPFS node and categorize images based on sub folders

  • Many moons ago I wanted to do something similar for AI data sets and models over IPFS. I don't know the future for IPFS but I do hope the essence of a p2p data sharing infrastructure becomes more accessible to help individuals tackle some of the issues with large datasets with less hardware on hand.

    https://github.com/JakeKalstad/IPFSPytorchDataset

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • load_ipfs_pytorch_model

    Loading a pytorch model from an IPFS CID

  • openlibrary

    One webpage for every book ever published!

  • OpenLibrary does provide search access to full texts. For example: https://openlibrary.org/search/inside?q=%22institutional+thi...

    It is open source and they're always looking for contributors. I think they'd especially welcome help improving search!

    https://github.com/internetarchive/openlibrary/

  • emdash

    πŸ“šπŸ§™β€β™‚οΈ Wisdom indexer β€” use AI to organize text snippets so you can actually remember & learn from what you read

  • I have a side project that aims to organize your ebook highlight collections with on-device semantic search. [1] Right now it only indexes your own content but I'd like to add a mode that allows you to share your collection and let others find relevant ideas via semantic search -- a discovery platform for ideas found in books. It's open source if you want a sense of how it works now. [2]

    [1] https://emdash.ai/

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts