thanos-remote-read
spark-extension
thanos-remote-read | spark-extension | |
---|---|---|
1 | 1 | |
34 | 172 | |
- | 4.7% | |
3.5 | 8.3 | |
2 days ago | 19 days ago | |
Go | Scala | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
thanos-remote-read
spark-extension
-
Data diffs: Algorithms for explaining what changed in a dataset (2022)
We're doing a env migration and I've been using spark diff extension for reconcile data, it's amazing, we've discover bugs in the data logic so quickly,
here is the extension if anyone is interested https://github.com/G-Research/spark-extension/blob/master/DI...
What are some alternatives?
armada - A multi-cluster batch queuing system for high-throughput workloads on Kubernetes.
deep-diff2 - Deep diff Clojure data structures and pretty print the result
fsharp-formatting-conventions - G-Research F# code formatting guidelines
handy_sql_queries
pyspark-starter - Starter pyspark code with a working combination of all versions
recidiffist - Diffs for structured data
macrobase-diff - Minimal implementation of Macrobase Diff
ExplainDaV
Azure-Databricks-NYC-Taxi-Workshop - An Azure Databricks workshop leveraging the New York Taxi and Limousine Commission Trip Records dataset
Apache Calcite - Apache Calcite
lakeFS - lakeFS - Data version control for your data lake | Git for data