tispark
spline
tispark | spline | |
---|---|---|
2 | 1 | |
878 | 582 | |
0.0% | 1.2% | |
5.1 | 6.3 | |
26 days ago | 6 days ago | |
Scala | Scala | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
tispark
-
A simple way to import TiSpark into Databricks to load TiDB data
dbutils.fs.mkdirs("dbfs:/databricks/scripts/") dbutils.fs.put( "/databricks/scripts/tispark-install.sh", """ #!/bin/bash wget --quiet -O /mnt/driver-daemon/jars/tispark-assembly-3.2_2.12-3.1.0-SNAPSHOT.jar https://github.com/pingcap/tispark/releases/download/v3.1.0/tispark-assembly-3.2_2.12-3.1.0.jar """, True)
-
Understand TiSpark pushdown
TiSpark is a thin layer built for running Apache Spark on top of TiKV/TiFlash to answer complex OLAP queries. It supports reading, writing, and deleting from TiKV/TiFlash with the guarantee of transactions.
spline
-
Show HN: First open source data discovery and observability platform
We found a way by leveraging the Spline Agent (https://github.com/AbsaOSS/spline) to make use of the Execution Plans, transform them into a suiting data model for our set of requirements and developed a UI to explore these relationships. We also open-sourced our approach in a
What are some alternatives?
incubator-livy - Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.
odd-platform - First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
kyuubi - Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
opendatadiscovery-specification - ODD Specification is a universal open standard for collecting metadata.
opendatadiscovery-speci
seq-datasource-v2 - Sequence Data Source for Apache Spark
parquet4s - Read and write Parquet in Scala. Use Scala classes as schema. No need to start a cluster.
Clustering4Ever - C4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.