Our great sponsors
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
Camlistore
Perkeep (née Camlistore) is your personal storage system for life: a way of storing, syncing, sharing, modelling and backing up content.
-
spyglass
A personal search engine: Create a searchable library from your personal documents, interests, and more!
-
MeiliSearch
A lightning-fast search API that fits effortlessly into your apps, websites, and workflow
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
I've been building something along these lines for my own personal data in top of my https://datasette.io project. I call it Dogsheep (it's a pun on Wolfram) - I explained it and gave a demo in this talk: https://simonwillison.net/2020/Nov/14/personal-data-warehouses/
I am long user of sist2 from simon987 for full text search of pdf. It indexes everything (file content and metadata) through elasticsearch while providing a nice GUI. https://github.com/simon987/sist2
If you want to live dangerously, this might eventually be useful: https://perkeep.org/
I believe spyglass is for website indexing but they are adding support for local files too. I haven't tried it yet but it might be helpful to you.