Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 23 datasette Open-Source Projects
-
Project mention: Little Data: How do we query personal data? (2013) | news.ycombinator.com | 2024-03-01
I'm a fan on simonw's datasette/dogsheep ecosystem https://datasette.io/
-
Sounds like your problem is with SQLAlchemy, not with SQLite.
My https://sqlite-utils.datasette.io library might be a better fit for you. It's a much thinner abstraction than SQLAlchemy.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
> If you learn something the hard way, share your findings with others. You have blazed a new trail; now you must mark it for your fellow travellers. Sharing knowledge is an unreasonably effective way of helping others.
This is a really nice philosophy. It's one of the reasons why I have my https://til.simonwillison.net TIL site - any time I search for something and can't find the answer is a hint that there's a tiny gap in the internet which I can help fill.
-
-
-
Yeah, I worry about that too. I have SO much of my stuff dependent on my GitHub account now.
I'm slightly reassured by how useful their APIs are. I have automated exports of a lot of my GitHub issues, though I really should shore those up and make sure I'm capturing everything.
That's one of the reasons I built https://github.com/dogsheep/github-to-sqlite
-
Project mention: Show HN: I built an open-source data copy tool called ingestr | news.ycombinator.com | 2024-02-27
I was surprised to see SQLite listed as a source but not as a destination. Any big reasons for that or is it just something you haven't got around to implementing yet?
I've been getting a huge amount of useful work done over the past few years sucking data from other systems into SQLite files on my own computer - I even have my own small db-to-sqlite tool for this (built on top of SQLAlchemy) - https://github.com/simonw/db-to-sqlite
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
-
dogsheep-beta
Build a search index across content from multiple SQLite database tables and run faceted searches against it using Datasette
-
-
-
-
Project mention: Observable 2.0, a static site generator for data apps | news.ycombinator.com | 2024-02-15
Me too, and that lead to developing the « datasette-dashboards » plugin[0]. I use this for my company where all the data is gathered by connectors scheduled in CI, storing data in Git, and triggering a SQLite db build and Datasette deployment. « BI as Code » if you will
-
-
Hah, Softbank isn't the goal here!
I realized that Datasette is the first project of my entire career where if I was still working on it in 15 years time I wouldn't feel bored yet. There's just SO MUCH scope for interesting applications of the core idea.
As such, I want to work on it for decades. But it's lonely working on it alone (the community around it has been growing and is delightful, but it's not the same as having a full-time team.)
So the question I'm trying to answer is how to make the project financially sustainable in the long-run - not just for myself, but so I can pay for a team to work on it with me.
There are plenty of other examples of open source projects that have turned SaaS hosting into a sustainable business model - WordPress and GitLab are just two of the best examples. It feels like it's a reasonably well-trodden path.
Plus... I want people to be able to use my software. Currently to use Datasette as an individual you either have to "pip" or "brew" install it, or you can try the macOS Electron app - https://datasette.io/desktop - but I want newsrooms to be able to use it to collaborate on data. And most newsrooms aren't well equipped to configure a Linux server.
So I realized that a hosted SaaS version can solve two issues at once: it can help the audience I care about actually benefit from the value of the software so far, and it provides a reasonably realistic path to financial sustainability for the project as a whole.
And yeah, I'd also like to make a ton of money out of it myself too!
-
-
Project mention: Tuql: Automatically create a GraphQL server from a SQLite database | news.ycombinator.com | 2023-04-25
Impressive how little code is involved here! This is really neat.
The biggest feature I can see that's missing is pagination - it looks like this doesn't have a way to retrieve e.g. ten results, then pass a next token to get back the next set.
Here's how I implemented pagination in my similar datasette-graphql plugin (which also gives you a GraphQL API for an existing SQLite database): https://github.com/simonw/datasette-graphql#pagination
-
-
-
Project mention: GitHub – GSA/code-gov: An informative repo for all Code.gov repos | news.ycombinator.com | 2023-09-09
https://github.com/simonw/datasette-ripgrep
Seeing as there's already a JSONLD @context (schema) for code.json, CSVW as JSONLD and/or YAMLLD would be an easy way merge Linked Data graphs of tabular data:
-
covid-19-datasette
Deploys a Datasette instance of COVID-19 data from Johns Hopkins CSSE and the New York Times
-
In my own experiments I've caught ChatGPT running the correct query but then hallucinating the results, because the response was too long for the token context window and got truncated!
I have an open issue about that here: https://github.com/simonw/datasette-chatgpt-plugin/issues/2
More about my explorations: https://simonwillison.net/2023/Mar/24/datasette-chatgpt-plug...
-
laion-aesthetic-datasette
Use Datasette to explore LAION improved_aesthetics_6plus training data used by Stable DIffusion
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
datasette related posts
- SpatiaLite: Library extending SQLite to support Spatial SQL capabilities
- Duty to Document
- Getting things done (in small increments)
- Use SQL Without Databases
- More people should write (2012)
- Collection of "Today I Learned" notes
- GitHub – GSA/code-gov: An informative repo for all Code.gov repos
-
A note from our sponsor - InfluxDB
www.influxdata.com | 28 Mar 2024
Index
What are some of the best open-source datasette projects? This list will help you:
Project | Stars | |
---|---|---|
1 | datasette | 8,791 |
2 | sqlite-utils | 1,454 |
3 | til | 926 |
4 | csvs-to-sqlite | 855 |
5 | twitter-to-sqlite | 395 |
6 | github-to-sqlite | 373 |
7 | db-to-sqlite | 334 |
8 | datasette-lite | 305 |
9 | dogsheep-beta | 178 |
10 | healthkit-to-sqlite | 175 |
11 | apple-notes-to-sqlite | 167 |
12 | C64-Collection | 154 |
13 | datasette-dashboards | 131 |
14 | tapuino | 130 |
15 | datasette-app | 115 |
16 | pocket-to-sqlite | 99 |
17 | datasette-graphql | 93 |
18 | google-takeout-to-sqlite | 88 |
19 | datasette.io | 79 |
20 | datasette-ripgrep | 69 |
21 | covid-19-datasette | 61 |
22 | datasette-chatgpt-plugin | 60 |
23 | laion-aesthetic-datasette | 58 |