petastorm

Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code. (by uber)

Petastorm Alternatives

Similar projects and alternatives to petastorm

  • horovod

    8 petastorm VS horovod

    Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

  • Activeloop Hub

    Discontinued Data Lake for Deep Learning. Build, manage, query, version, & visualize datasets. Stream data real-time to PyTorch/TensorFlow. https://activeloop.ai [Moved to: https://github.com/activeloopai/deeplake] (by activeloopai)

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • d2l-en

    6 petastorm VS d2l-en

    Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.

  • best-of-ml-python

    🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.

  • nni

    5 petastorm VS nni

    An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

  • jina

    126 petastorm VS jina

    ☁️ Build multimodal AI applications with cloud-native stack

  • wandb

    16 petastorm VS wandb

    🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • SysML-v2-Release

    The latest incremental release of SysML v2. Start here.

  • data-toolset

    Upgrade from avro-tools and parquet-tools jars to a more user-friendly Python package.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better petastorm alternative or higher similarity.

petastorm reviews and mentions

Posts with mentions or reviews of petastorm. We have used some of these posts to build our list of alternatives and similar projects.

Stats

Basic petastorm repo stats
2
1,751
3.7
5 months ago

uber/petastorm is an open source project licensed under Apache License 2.0 which is an OSI approved license.

The primary programming language of petastorm is Python.


Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com