TileDB vs oil

Our great sponsors

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

SaaSHub - Software Alternatives and Reviews

Our great sponsors

TileDB		oil
	Project
12	Mentions	234
1,762	Stars	2,720
2.1%	Growth	1.6%
9.7	Activity	9.9
6 days ago	Latest Commit	3 days ago
C++	Language	Python
MIT License	License	GNU General Public License v3.0 or later

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

TileDB

Posts with mentions or reviews of TileDB. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-09-01.

Ask HN: Who is hiring? (September 2023)
14 projects | news.ycombinator.com | 1 Sep 2023

- single cell genomics: in collaboration with the Chan-Zuckerberg Initiative, we recently released TileDB-SOMA for single cell data, with APIs for both Python and R built around a common storage specification: https://tiledb.com/blog/tiledb-101-single-cell
With TileDB, all data — tables, genomics, images, videos, location, time-series — across multiple domains is captured as multi-dimensional arrays. TileDB Cloud implements a totally serverless infrastructure and delivers access control, easier data and code sharing and distributed computing at global scale, eliminating cluster management, minimizing TCO and promoting scientific collaboration and reproducibility.
Website: https://tiledb.com
GitHub: https://github.com/TileDB-Inc/TileDB
Why TileDB as a Vector Database
2 projects | news.ycombinator.com | 2 Aug 2023

Stavros from TileDB here (Founder and CEO). I thought of requesting some feedback from the community on this blog. It was only natural for a multi-dimensional array database like TileDB to offer vector (i.e., 1D array) search capabilities. But the team managed to do it very well and the results surprised us. We are just getting started in this domain and a lot of new algorithms and features are coming up, but the sooner we get feedback the better.
TileDB-Vector-Search Github repo: https://github.com/TileDB-Inc/TileDB-Vector-Search
TileDB-Embedded (core array engine) Github repo: https://github.com/TileDB-Inc/TileDB
TileDB 101: Vector Search (blog to get kickstarted): https://tiledb.com/blog/tiledb-101-vector-search/
Ask HN: Who is hiring? (August 2023)
13 projects | news.ycombinator.com | 1 Aug 2023

TileDB, Inc. | Full-Time | REMOTE | USA | Greece | https://tiledb.com
TileDB is the database for complex data, allowing data scientists, researchers, and analysts to access, analyze, and share any data with any tool at global scale. We have just launched a vector search library leveraging TileDB and TileDB Cloud for powerful local search and seamless scaling to multi-modal organizational datasets and batched computation: https://tiledb.com/blog/why-tiledb-as-a-vector-database
With TileDB, all data — tables, genomics, images, videos, location, time-series — across multiple domains is captured as multi-dimensional arrays. Our vector search library and other offerings are designed to empower these datasets with extreme interoperability via numerous APIs and tool integrations across the data science ecosystem, eliminating the hassles and inefficiencies of data conversion. TileDB Cloud implements a totally serverless infrastructure and delivers access control, easier data and code sharing and distributed computing at global scale, eliminating cluster management, minimizing TCO and promoting scientific collaboration and reproducibility.
Ask HN: Who is hiring? (December 2022)
14 projects | news.ycombinator.com | 1 Dec 2022

TileDB, Inc. | Full-Time | REMOTE | USA | Greece | https://tiledb.com
TileDB transforms the lives of analytics professionals and data scientists with a universal database, allowing them to access, analyze, and share any data with any tool at global scale. TileDB unifies the way we think about data, delivering superior performance and foundational data management capabilities. All data — tables, genomics, images, videos, location, time-series — across multiple domains is captured as multi-dimensional arrays. TileDB offers extreme interoperability via numerous APIs and tool integrations across the data science ecosystem, eliminating the hassles and inefficiencies of data conversion. TileDB Cloud implements a totally serverless infrastructure and delivers access control, easier data and code sharing and distributed computing at global scale, eliminating cluster management, minimizing TCO and promoting scientific collaboration and reproducibility.
TileDB, Inc. was spun out of MIT and Intel Labs in May 2017 and is backed by Two Bear Capital, Nexus Venture Partners, Uncorrelated Ventures, Intel Capital and Big Pi.
Recent HN article: https://news.ycombinator.com/item?id=23896131
Website: https://tiledb.com
GitHub: https://github.com/TileDB-Inc/TileDB
Docs: https://docs.tiledb.com
Blog: https://tiledb.com/blog
Our headquarters are located in Cambridge, MA and we have a subsidiary in Athens, Greece. We offer the ability to work remotely. If you are located outside of the USA and Greece we have options to accommodate this, don't hesitate to apply!
We have several open positions aimed at increasing TileDB’s feature set, growth and adoption. You will have the opportunity to work on innovative technology that creates impact on challenging and exciting problems in Genomics, Geospatial, Time Series, and more. Immediate features on the roadmap for TileDB Cloud include, advanced distributed computations, advanced computation pushdown, improved multi-cloud deployments and more.
We are actively seeking:
- Senior Golang Engineer
- Senior Python Engineer
- Site Reliability Engineer
- React Frontend Engineer
Apply today at https://tiledb.workable.com !
Historical weather data API for machine learning, free for non-commercial
1 project | news.ycombinator.com | 6 Jul 2022

Interesting. Have you come across TileDB before?
https://tiledb.com/
Why isn’t there a decent file format for tabular data?
13 projects | news.ycombinator.com | 3 May 2022

Hi folks, Stavros from TileDB here. Here are my two cents on tabular data. TileDB (Embedded) is a very serious competitor to Parquet, the only other sane choice IMO when it comes to storing large volumes of tabular data (especially when combined with Arrow). Admittedly, we haven’t been advertising TileDB’s tabular capabilities, but that’s only because we were busy with much more challenging applications, such as genomics (population and single-cell), LiDAR, imaging and other very convoluted (from a data format perspective) domains.
Similar to Parquet:
* TileDB is columnar and comes with a lot of compressors, checksum and encryption filters.
* TileDB is built in C++ with multi-threading and vectorization in mind
* TileDB integrates with Arrow, using zero-copy techniques
* TileDB has numerous optimized APIs (C, C++, C#, Python, R, Java, Go)
* TileDB pushes compute down to storage, similar to what Arrow does
Better than Parquet:
* TileDB is multi-dimensional, allowing rapid multi-column conditions
* TileDB builds versioning and time-traveling into the format (no need for Delta Lake, Iceberg, etc)
* TileDB allows for lock-free parallel writes / parallel reads with ACID properties (no need for Delta Lake, Iceberg, etc)
* TileDB can handle more than tables, for example n-dimensional dense arrays (e.g., for imaging, video, etc)
Useful links:
* Github repo (https://github.com/TileDB-Inc/TileDB)
* TileDB Embedded overview (https://tiledb.com/products/tiledb-embedded/)
* Docs (https://docs.tiledb.com/)
* Webinar on why arrays as a universal data model (https://tiledb.com/blog/why-arrays-as-a-universal-data-model)
Happy to hear everyone’s thoughts.
Genomics data management reimagined. Analyze and share enormous variant datasets with TileDB Cloud.
1 project | /r/u_tiledb | 28 Jan 2022
TileDB VS Activeloop hub - a user suggested alternative
2 projects | 20 Oct 2021
Seeking options for multidimensional data storage
1 project | /r/Database | 12 Aug 2021

It could be worth checking out TileDB: https://github.com/TileDB-Inc/TileDB The entire system, down to the data format itself, is optimized around storing multi-dimensional arrays. It also supports timestamps and real numbers as dimensions, which could be handy given your example data. [Full disclosure: I currently work for TileDB.]
Ask HN: Who is hiring? (January 2021)
15 projects | news.ycombinator.com | 4 Jan 2021

TileDB, Inc. | Full-Time | REMOTE | USA | Greece | https://tiledb.com
TileDB, Inc. is the company behind TileDB, the first universal data engine. TileDB allows analytics professionals and data scientists to access, analyze, and share complex data sets with any tool at extreme scale. TileDB overcomes the constraints of columnar tables, flat files, and SQL-only tools, handling all data with a multi-dimensional array engine and extreme interoperability across the data science ecosystem. TileDB Cloud is a totally serverless offering of TileDB, which delivers access control and enables distributed computing at planet-scale, eliminating all cluster management and minimizing cost. TileDB, Inc. was spun out of MIT and Intel Labs in May 2017 and closed a $15M Series A in July 2020, following a previous $4M Seed Round.
Recent HN article: https://news.ycombinator.com/item?id=23896131
Website: https://tiledb.com
GitHub: https://github.com/TileDB-Inc/TileDB
Docs: https://docs.tiledb.com
Blog: https://tiledb.com/blog
Our headquarters are located in Cambridge, MA and we have a subsidiary in Athens, Greece. We offer the ability to work remotely, but the candidates must reside either in the US or in Greece. US candidates must be US citizens, whereas Greek candidates must be Greek or EU citizens.
We have several open positions aimed at increasing TileDB’s feature set, growth and adoption. You will have the opportunity to work on innovative technology that creates impact on challenging and exciting problems in Genomics, Geospatial, Time Series, and more. A few features on the roadmap include enhancing our TileDB Cloud offering, optimizing our serverless framework, improving integration with JupyterLab, and expanding our marketplace functionality.
We are primarily seeking:
- Senior Golang Engineer
Apply today at https://tiledb.workable.com !

oil

Posts with mentions or reviews of oil. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-04-03.

Autoconf makes me think we stopped evolving too soon
8 projects | news.ycombinator.com | 3 Apr 2024

will prevent almost all of the "silent footguns".
YSH has strict:all and then a bunch of NEW features.
There's been good feedback recently, which has led to many concrete changes. So your experience can definitely influence the language! https://github.com/oilshell/oil/wiki/Where-To-Send-Feedback
Basic Things
1 project | news.ycombinator.com | 30 Mar 2024

Regarding writing tools/tests/benchmarks in bash+Python, vs. writing tools in your main language:
I think we might eventually concede that something Debian-like is the “standard development environment” (at least for server side stuff, i.e. not iOS apps)
In this case, bash+Python is a non-issue. It works extremely reliably. That’s actually why I use it! Everything else seems to break, or it’s really slow (node.js is a very common alternative).
- Microsoft conceded this back in ~2017, by building Linux into their kernel with WSL, and providing Ubuntu on top
Yes bash + Python is a disaster on Windows (I have scars from it), but Microsoft agrees that the right place to solve that is in Windows :-)
- Every CI system runs Debian/Ubuntu
- Every hosting provider runs Debian/Ubuntu
- Every online dev env like gitpod.io provides Debian/Ubuntu
This is somewhat related to remote dev envs: https://lobste.rs/s/ucirlx/lapdev_self_hosted_remote_dev
One vision for https://www.oilshell.org/ is that the CI environment is the dev environment is the hosting environment.
Everything is just an equal node in a distributed system. BUT it’s more git like, in that you explicitly sync and work “locally”, wherever that is. You don’t have the network chatter and flakiness of “the cloud”.
Oils has a very large set of monotonically increasing properties too - https://www.oilshell.org/release/0.21.0/quality.html
All that is bash+Python that is run on every commit, and it’s extremely good at catching bugs and perf regressions.
I’m skeptical that any project has that level of quality automation written in pure Rust or Zig. More likely it’s a bunch of cloud services with YAML.
Also a bunch of “hard-coded” toolchains that you can’t script with bespoke code. Like some shell commands in your package.json, which is just a worse way of writing a shell script.
Our quality process is all self-hosted, in the repo, and runs on both Github Actions and sourcehut - https://www.oilshell.org/release/0.21.0/pub/metrics.wwz/line...
bash and Python runs perfectly on Github Actions and sourcehut, with zero change. Containers also do.
(Although we need to unify the CI and release, because the release runs on 2 different real hardware machines, while CI is cloud only.)
Also, a main point Oils is that bash now has another highly compatible, spec-driven implementation – OSH. Having 2 independent implementations is something newer languages don’t have.
(copy of lobste.rs comment)
The secret weapon of Bash power users
2 projects | news.ycombinator.com | 24 Mar 2024

in your bashrc to enable it. I've used it for probably ~18 years now.
It also works with https://www.oilshell.org/ since we use GNU readline. Just 'set -o vi' in ~/.config/oils/oshrc
Pipexec – Handling pipe of commands like a single command
6 projects | news.ycombinator.com | 9 Mar 2024

No other shell does that.
But I didn't know it was called MULTIOS until now. (I guess that's read "mult I/O's"? I have a hard time not reading it was multi-OS :) )
It seems a bit niche to be honest, but it's possible to support in Oils.
---
Oils also uses Unix domain sockets already for the headless shell protocol
https://github.com/oilshell/oil/wiki/Headless-Mode
We could do something like dgsh, but so far I haven't seen a lot of uptake / demand. Every time it's mentioned, somebody kinda wants it, and then it kinda peters out again ... still possible though.
I think flat files work fine for a lot of use cases, and once you add streaming, you also want monitoring, more control over backpressure/queue sizes, etc.
Show HN: Hancho – A simple and pleasant build system in ~500 lines of Python
4 projects | news.ycombinator.com | 3 Mar 2024

which works well. You don't have to clean when rebuilding variants. IMO this is 100% essential for writing C++ these days. You need a bunch of test binaries, and all tests should be run with ASAN and UBSAN.
---
I wrote a mini-bazel on top of Ninja with these features:
https://www.oilshell.org/blog/2022/10/garbage-collector.html...
So it's ~1700 lines, but for that you get the build macros like asdl_library() generating C++ and Python (the same as proto_library(), a schema language that generates code)
And it also correctly finds dependencies of code generators. So if you change a .py file that is imported by another .py file that is used to generated a C++ header, everything will work. That was one of the trickier bits, with Ninja implicit dependencies.
I also use the Bazel-target syntax like //core/process
This build file example mixes low level Ninja n.rule() and n.build() with high level r.cc_library() and so forth. I find this layering really does make it scale better for bigger projects
https://github.com/oilshell/oil/blob/master/asdl/NINJA_subgr...
Some more description - https://lobste.rs/s/qnb7xt/ninja_is_enough_build_system#c_tu...
Re2c
4 projects | news.ycombinator.com | 22 Feb 2024

This is sort of a category error...
re2c is a lexer generator, and YAML and Python are recursive/nested formats.
You can definitely use re2c to lex them, but it's not the whole solution.
I use it for everything possible in https://www.oilshell.org, and it's amazing. It really reduces the amount of fiddly C code you need to parse languages, and it drops in anywhere.
Ask HN: Looking for a project to volunteer on? (February 2024)
15 projects | news.ycombinator.com | 1 Feb 2024

SEEKING VOLUNTEERS - https://www.oilshell.org/ - https://github.com/oilshell/oil/
I'm looking for people to help fill out the "standard library" for Oils/YSH. We're implementing a shell for Python and JavaScript programmers who avoid shell!
On the surface, this is writing some very simple functions in typed Python. But I've realized that the hardest parts are specifying, TESTING, and documenting what the functions do.
---
The most recent release announcement also asks for help - https://www.oilshell.org/blog/2024/01/release-0.19.0.html (long)
If you find all those details interesting (if maybe overwhelming), you might have a mind for language design, and could be a good person to help.
Surveying what Python and JavaScript do is very helpful, e.g. for the recent Str.replace() function, which is nontrivial (takes a regex or string, replacement template or string)
But there are also very simple methods to get started, like Dict.values() and List.indexOf(). Other people have already contributed code. Examples:
https://github.com/oilshell/oil/commit/58d847008427dba2e60fe...
https://github.com/oilshell/oil/commit/8f38ee36d01162593e935...
This can also be useful to tell if you'll have fun working on the project - https://github.com/oilshell/oil/wiki/Where-Contributors-Have...
More on #help-wanted on Zulip (requires login) - https://oilshell.zulipchat.com/#narrow/stream/417617-help-wa...
Please send a message on Github or Zulip! Or e-mail me andy at oilshell dot org.
The rust project has a burnout problem
3 projects | news.ycombinator.com | 17 Jan 2024

This is true, but then the corrolary is that new PRs need to come with this higher and rigorous level of test coverage.
And then that becomes a bit of a barrier to contribution -- that's a harness
I often write entirely new test harnesses for features, e.g. for https://www.oilshell.org, many of them linked here . All of these run in the CI - https://www.oilshell.org/release/latest/quality.html
The good thing is that it definitely helps me accept PRs faster. Current contributors are good at this kind of exhaustive testing, but many PRs aren't
Unix as IDE: Introduction (2012)
3 projects | news.ycombinator.com | 27 Dec 2023
Oils
1 project | news.ycombinator.com | 8 Dec 2023

What are some alternatives?

When comparing TileDB and oil you can also consider the following projects:

ClickHouse - ClickHouse® is a free analytics DBMS for big data

nushell - A new type of shell

RocksDB - A library that provides an embeddable, persistent key-value store for fast storage.

fish-shell - The user-friendly command line shell.

MongoDB C Driver - The Official MongoDB driver for C language

elvish - Powerful scripting language & Versatile interactive shell

LevelDB - LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values.

xonsh - :shell: Python-powered, cross-platform, Unix-gazing shell.

libmdbx - One of the fastest embeddable key-value ACID database without WAL. libmdbx surpasses the legendary LMDB in terms of reliability, features and performance.

PowerShell - PowerShell for every system!

MongoDB Libbson

ShellCheck - ShellCheck, a static analysis tool for shell scripts

TileDB vs ClickHouse oil vs nushell TileDB vs RocksDB oil vs fish-shell TileDB vs MongoDB C Driver oil vs elvish TileDB vs LevelDB oil vs xonsh TileDB vs libmdbx oil vs PowerShell TileDB vs MongoDB Libbson oil vs ShellCheck

Compare TileDB vs oil and see what are their differences.

TileDB

oil

TileDB

oil

What are some alternatives?