sql.js-httpvfs
s3fs-fuse
sql.js-httpvfs | s3fs-fuse | |
---|---|---|
15 | 57 | |
3,233 | 8,095 | |
- | 1.1% | |
1.5 | 8.8 | |
about 1 year ago | 9 days ago | |
TypeScript | C++ | |
Apache License 2.0 | GNU General Public License v3.0 only |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
sql.js-httpvfs
-
A future for SQL on the web (2021)
I couldn't find do these wrappers support read-only SQLite databases with HTTP range requests, like in this famous post [1]. Phiresky's wrapper supports it, but it seems to be rebuilding the whole sql.js [2], I'd rather have it as VFS on top of sqlite.org's own WASM module. I like the idea of HTTP range requests, but I don't want to run a fork, that will be unmaintained in few years.
[1]: https://news.ycombinator.com/item?id=27016630
[2]: https://github.com/phiresky/sql.js-httpvfs
- Cloud Backed SQLite
-
Transmission 4.0.0 beta 1 is out
Oh that’s an interesting idea. I saw someone built SQLite over HTTP with the Range header: https://phiresky.github.io/blog/2021/hosting-sqlite-database...
So presumably in a similar manner as https://github.com/phiresky/sql.js-httpvfs you could map SQLite pages to leaf torrents too and get the chunking you are looking for.
-
netgrep - An experimental porting of ripgrep on WASM over the HTTP protocol.
But ripgrep has to read the whole file; if your intended use case is searching a blog, I'd recommend using something which makes an index, so you don't need to download everything. For example, sql.js-httpvfs is SQLite ported to wasm, with the DB file read over HTTP. If you use FTS to make an index, it works astonishingly well for text search.
-
Datasette Lite: a server-side Python web application running in a browser
How close is Python SQLite and Datasette Lite to accessing a hosted SQL database using HTTP range requests as can be done in sql.js like https://github.com/phiresky/sql.js-httpvfs?
I put together a Pyodide-based web app where users need a few indexed queries from a 600mb SQLite database but it isn't very practical for them to download the whole thing into the browser.
https://observablehq.com/@thadk/life
-
Ws4sqlite: Query SQLite via HTTP
You can also access sqlite databases directly from an http server that supports range requests (like s3). There are a bunch of implementations of this in different languages including Go[0] and Javascript[1].
[0]: https://github.com/psanford/sqlite3vfshttp
[1]: https://github.com/phiresky/sql.js-httpvfs
-
Static torrent website with peer-to-peer queries over BitTorrent on 2M records
Thanks to SQLite VFS abstraction, it is possible to implement your own file system on which SQLite parks data and structures. Inspired by Phiresky's sql.js-httpvfs which uses HTTP Range requests to lazy load blocks of storage from a static web server, I changed few lines of code to point the VFS read() calls to a database seeded by peers as a torrent. A 300 MiB db with 2 million records can be queried from seeders for full text searches in less than 2 MiB traffic with the BitTorrent protocol, all inside the browser, in a static website.
-
WebVM: Server-less x86 virtual machines in the browser
Hosting SQLite Databases on GitHub Pages" https://news.ycombinator.com/item?id=28021766 : DuckDB can query [and page] Parquet from GitHub, sql.js-httpvfs, sqltorrent, File System Access API (Chrome only so far; IDK about resource quotas and multi-GB datasets), serverless search with WASM workers
https://github.com/phiresky/sql.js-httpvfs :
> sql.js is a light wrapper around SQLite compiled with EMScripten for use in the browser (client-side).
> This repo is a fork of and wrapper around sql.js to provide a read-only HTTP-Range-request based virtual file system for SQLite. It allows hosting an SQLite database on a static file hoster and querying that database from the browser without fully downloading it.
> The virtual file system is an emscripten filesystem with some "smart" logic to accelerate fetching with virtual read heads that speed up when sequential data is fetched. It could also be useful to other applications, the code is in lazyFile.ts. It might also be useful to implement this lazy fetching as an SQLite VFS [*] since then SQLite could be compiled with e.g. WASI SDK without relying on all the emscripten OS emulation.
- Show HN: Link-Archive.org
-
Database-Less Torrent Website
It doesn't have to be range requests, you could split the file instead of depending on the range requests. Essentially it's the same as the chunking instructions for hosters who have a maximum file size as is demonstrated here: https://github.com/phiresky/sql.js-httpvfs/blob/master/creat...
s3fs-fuse
-
Is Posix Outdated?
The author needs to ask themselves: in this cloud technology stack, is there POSIX involved somewhere lower down, where I can't access it? The answer is, of course, "yes". The sort of cloud storage systems described all run on top of POSIX APIs. They provide convenience (cost efficiency is more debatable) compared to the POSIX alternative, but that's because they exist at an entirely different conceptual layer (hence the presence of POSIX anyway, just buried).
Your point about surfacing a POSIX that's actually there but hidden and thus visible to low-level Amazon employees building the S3 service which makes it invisible to S3 end customers is true but isn't the the point of the article. The author is saying there are motivations for a POSIX-like api visible also the end user.
So your explanation of stack looks like 2 layers: POSIX api <-- AWS S3 built on top of that
Author's essay is actually talking about 3 layers: POSIX <-- AWS S3 <-- POSIX
That's why the blog post has the following links to POSIX-on-top-of-S3-objects :
https://github.com/s3fs-fuse/s3fs-fuse
https://github.com/kahing/goofys
https://www.cuno.io/
- Gcsfuse: A user-space file system for interacting with Google Cloud Storage
-
R2 slow PUT file transfer
sudo apt install build-essential libfuse-dev fuse git clone https://github.com/s3fs-fuse/s3fs-fuse.git cd s3fs-fuse sudo apt install libfuse2 sudo apt install libcurl4-openssl-dev sudo apt install libxml2-dev ./autogen.sh ./configure make
- Cloud Backed SQLite
-
Podman and S3 Storage Driver (Audiobookshelf)
Don’t know actually. Here is project page.
- Uploading hundreds to thousands of files to S3
- Linux Client for R2
-
s3fs-fuse - allows to mount your s3/minio bucket link to your local directory
s3fs-fuse
-
AWS Announces Open Source Mountpoint for Amazon S3
How is this different than these other solutions?
https://github.com/kahing/goofys
https://github.com/s3fs-fuse/s3fs-fuse
-
Introducing Mountpoint for Amazon S3 - A file client that translates local file system API calls to S3 object API calls like GET and LIST.
I don’t get it. Why not just improve https://github.com/s3fs-fuse/s3fs-fuse
What are some alternatives?
alasql - AlaSQL.js - JavaScript SQL database for browser and Node.js. Handles both traditional relational tables and nested JSON data (NoSQL). Export, store, and import data from localStorage, IndexedDB, or Excel.
goofys - a high-performance, POSIX-ish Amazon S3 file system written in Go
datasette - An open source multi-tool for exploring and publishing data
rclone - "rsync for cloud storage" - Google Drive, S3, Dropbox, Backblaze B2, One Drive, Swift, Hubic, Wasabi, Google Cloud Storage, Azure Blob, Azure Files, Yandex Files
sqlite3vfshttp - Go sqlite3 http vfs: query sqlite databases over http with range headers
mountpoint-s3 - A simple, high-throughput file client for mounting an Amazon S3 bucket as a local file system.
duckdb - DuckDB is an in-process SQL OLAP Database Management System
jellyfin-webos - WebOS Client for Jellyfin
wp-sqlite-db - A single file drop-in for using a SQLite database with WordPress. Based on the original SQLite Integration plugin.
jellyfin-tizen - Jellyfin Samsung TV Client
sqltorrent
mediacms - MediaCMS is a modern, fully featured open source video and media CMS, written in Python/Django and React, featuring a REST API.