Python Avro

Open-source Python projects categorized as Avro

Top 3 Python Avro Projects

  • DataProfiler

    What's in your data? Extract schema, statistics and entities from datasets

  • Project mention: LongRoPE: Extending LLM Context Window Beyond 2M Tokens | news.ycombinator.com | 2024-02-22

    It's been possible to skip tokenization for a long time, my team and I did it here - https://github.com/capitalone/DataProfiler

    For what it's worth, we actually were working with LSTMs with nearly a billion params back in 2016-2017 area. Transformers made it far more effective to train and execute, but ultimately LSTMs are able to achieve similar results, though slow & require more training data.

  • clickhouse-sink-connector

    Replicate data from MySQL, Postgres and MongoDB to ClickHouse

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • data-toolset

    Upgrade from avro-tools and parquet-tools jars to a more user-friendly Python package.

  • Project mention: data-toolset: Upgrade from avro-tools and parquet-tools jars to a more user-friendly Python package. | /r/dataengineering | 2023-09-22
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Avro related posts

Index

What are some of the best open-source Avro projects in Python? This list will help you:

Project Stars
1 DataProfiler 1,357
2 clickhouse-sink-connector 168
3 data-toolset 1

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com