Scala Parquet

Open-source Scala projects categorized as Parquet

Scala Parquet Projects

  • GitHub repo parquet4s

    Read and write Parquet in Scala. Use Scala classes as schema. No need to start a cluster.

    Project mention: Advice for storing tick data in Google Cloud | reddit.com/r/scala | 2021-01-25

    I don't have experience with cloud, but we used https://github.com/mjakubowski84/parquet4s for storing data to HDFS with Akka Streams. It uses Hadoop libraries for actual writing so it supports various object stores like S3 and Google Cloud Storage.

  • GitHub repo Schemer

    Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2021-01-25.

Index

Project Stars
1 parquet4s 138
2 Schemer 97