Our great sponsors
-
Activeloop Hub
Discontinued Data Lake for Deep Learning. Build, manage, query, version, & visualize datasets. Stream data real-time to PyTorch/TensorFlow. https://activeloop.ai [Moved to: https://github.com/activeloopai/deeplake] (by activeloopai)
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
For readers' context: zarr is a self-describing n-dimensional array hierarchy format specification which can sit over more or less any key-value store. If you've ever used HDF5, it's basically that, but array chunks are exploded over the file system/ cloud store, and all the metadata is JSON. It's gaining traction in the biological imaging and geo/meteorological data communities, among other places. Work on the v3 specification is in progress, which aims to abstract away a generic protocol, as well as fold in the community behind N5, an almost-identical format used by a small but vocal number of bio-imaging labs.
Related posts
- What are good alternatives to zip files when working with large online image datasets?
- [N] Access Google Objectron (~1.92 TBs) in less than 5 seconds with Activeloop Hub
- Win up to $1000 by using activeloopai/Hub to make the data preprocessing easier for CVPR Kaggle challenges
- Win up to $1000 by using activeloopai/Hub to make the data preprocessing easier for CVPR Kaggle challenges
- Win up to $1000 by using activeloopai/Hub to make the data preprocessing easier for CVPR Kaggle challenges