Our great sponsors
-
Activeloop Hub
Discontinued Data Lake for Deep Learning. Build, manage, query, version, & visualize datasets. Stream data real-time to PyTorch/TensorFlow. https://activeloop.ai [Moved to: https://github.com/activeloopai/deeplake] (by activeloopai)
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
What solution have you used that you like as a data scientist when working with large datasets? Any standard python API to access the data? Other solution? If anyone has used https://github.com/activeloopai/Hub or other similar API I'd be interested to hear your experience working with it!
We are hosting image datasets on our platform and until recently the stored datasets were relatively small (several hundreds of images, few GB) so we only offered the possibility to export zip files containing images and labels in the COCO or YOLO format. As the average size of the datasets is growing, it's not convenient anymore to export a zip.
Related posts
- [N] Access Google Objectron (~1.92 TBs) in less than 5 seconds with Activeloop Hub
- Win up to $1000 by using activeloopai/Hub to make the data preprocessing easier for CVPR Kaggle challenges
- Win up to $1000 by using activeloopai/Hub to make the data preprocessing easier for CVPR Kaggle challenges
- Win up to $1000 by using activeloopai/Hub to make the data preprocessing easier for CVPR Kaggle challenges
- [D] How to handle big datasets in computer vision ?