orjson
datasette
orjson | datasette | |
---|---|---|
22 | 198 | |
7,099 | 10,181 | |
2.2% | 0.8% | |
8.0 | 8.2 | |
18 days ago | about 1 month ago | |
Python | Python | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
orjson
-
Web scraping of a dynamic website using Python with HTTP Client
The library already has support for an HTTP client that allows bypassing Cloudflare - CurlImpersonateHttpClient. Since we have to work with JSON responses we could use parsel_crawler added in version 0.3.0, but I think this is excessive for such tasks, besides I like the high speed of orjson.. Therefore, we'll need to implement our crawler rather than using one of the ready-made ones.
- orjson: Fast, correct Python JSON lib supporting dataclasses, datetimes, NumPy
-
JSON extra uses orjson instead of ujson
(https://github.com/ijl/orjson). In this implementation, the same JSON
-
This Week In Python
orjson – Fast, correct Python JSON library
- Orjson: Fast, correct Python JSON library
- JSON dans les projets data science : Trucs & Astuces
-
JSON in data science projects: tips & tricks
orjson is the fastest JSON library available for python. It natively manages dataclass objects, datetime, numpy and UUID objects.
- Segunda linguagem
-
Litestar 2.0
As we began venturing down that road, a few things emerged that would constitute significant changes to some of the core parts of Litestar, but there were two things in particular that started a chain reaction of changes by opening up further possibilities: The new DTOs and our switch from orjson to msgspec.
- orjson: Fast, correct Python JSON lib (supports dataclasses, datetimes, numpy)
datasette
-
Gmail to SQLite
A couple of reasons which pop to mind:
- Searching a plain text data file is O(n). Searching a SQLite database that has been properly indexed, which is very easy to do nowadays with FTS5, is O(log n) worst case scenario and O(1) in the best case. This doesn't explain why SQLite over a dataframe or anything, but it definitely justifies it over plain text for large email collections.
- SQLite is really easy to write custom views and programs around. Virtually every major programming language can work with it without issue. See also: simonw's wonderful https://datasette.io/ .
- SQLite is an accepted archival format by the Library of Congress, if you ever want to go down the rabbit hole of digital preservation.
-
datasette alternatives - csvbase, portaljs, and DataTables
4 projects | 21 Apr 2025
- A Love Letter to the CSV Format
-
Sell Yourself Sell Your Work
It's https://datasette.io - I'm still having so much fun with it, especially since any idea I want to experiment with can be justified as a Datasette plugin!
-
Exploring LLMs: A Blind Trial for Code Completions
SQLite is used because it's lightweight, requires no server setup, and provides a self-contained database solution ideal for this type of data collection. Additionally, Datasette can be used to easily query, visualize, and publish the data for later analysis.
-
Exploring the Paramilitary Leaks
It does seem like a job well suited to simonw's https://github.com/simonw/datasette
-
Retaking the Web Browser, One Small Step at a Time
In theory, you could make it happen with:
- a db
- a db browser
- a {bookmarks,browser,etc}-to-db tool
For instance,
- sqlite
- datasette https://datasette.io/
- pocket-to-sqlite https://datasette.io/tools/pocket-to-sqlite
Furthermore, you could integrate e.g. Claude with this, by creating a lightweight model-context-protocol server that lets Claude browse through datasette. You would probably want to throw in a vector db in-between somewhere, so you could get semantic search over bookmarks to work nicely.
-
SpiceNice – An Open Source Spice Database
Seems like a perfect job for Datasette: SQLite plus web api and UI
“Datasette is a tool for exploring and publishing data. It helps people take data of any shape, analyze and explore it, and publish it as an interactive website and accompanying API.”
https://datasette.io/
-
I Track My Health Data in Markdown: Lessons in Digital Longevity
You might like this: https://datasette.io/ and it might even fill your blogging needs. Fully opensource (and there are extension to use llm's with it).
-
Show HN: SQLite Transaction Benchmarking Tool
I wrote an async wrapper around SQLite in Python - I'm using a thread pool: https://github.com/simonw/datasette/blob/main/datasette/data...
I have multiple threads for reads and a single dedicated thread for writes, which I send operations to via a queue. That way I avoid ever having two writes against the same connection at the same time.
What are some alternatives?
msgspec - A fast serialization and validation library, with builtin support for JSON, MessagePack, YAML, and TOML
DuckDB - DuckDB is an analytical in-process SQL database management system
ujson
nocodb - 🔥 🔥 🔥 Open Source Airtable Alternative
ormsgpack - Msgpack serialization/deserialization library for Python, written in Rust using PyO3. Reboot of orjson. msgpack.org[Python]
Sequel-Ace - MySQL/MariaDB database management for macOS