Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 4 apachespark Open-Source Projects
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
FLiPStackWeekly
FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...
Project mention: Getting Started with Flink SQL, Apache Iceberg and DynamoDB Catalog | dev.to | 2023-12-18Apache Iceberg is one of the three types of lakehouse, the other two are Apache Hudi and Delta Lake.
NOTE:
The open source projects on this list are ordered by number of github stars.
The number of mentions indicates repo mentiontions in the last 12 Months or
since we started tracking (Dec 2020).
apachespark related posts
- For those of you with Lakehouse Architectures, how do you handle duplicate records?
- AWS ACID data lakehouse
- PySpark: A brief analysis to the most common words in Dracula, by Bram Stoker
- Data n00b looking for guidance on how to setup data lake/warehouse
- apache/hudi: Upserts, Deletes And Incremental Processing on Big Data.
- Big Data file formats
- What do you use for Data versioning?
-
A note from our sponsor - InfluxDB
www.influxdata.com | 28 Apr 2024
Index
What are some of the best open-source apachespark projects? This list will help you:
Project | Stars | |
---|---|---|
1 | hudi | 5,066 |
2 | SparkSQL.jl | 25 |
3 | FLiPStackWeekly | 14 |
4 | dracula | 0 |
Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com