Dblink Alternatives
Similar projects and alternatives to dblink
-
mmlspark
Discontinued Simple and Distributed Machine Learning [Moved to: https://github.com/microsoft/SynapseML]
-
CodeRabbit
CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
-
delight
Discontinued A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.
-
entity-embed
PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.
-
splink
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
-
-
sparkMeasure
This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spark jobs. It focuses on easing the collection and examination of Spark metrics, making it a practical choice for both developers and data engineers.
-
-
InfluxDB
InfluxDB high-performance time series database. Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems.
-
dblink discussion
dblink reviews and mentions
-
[D] Machine Learning and "Record Linkage"
Felligi-Sunter is the baseline model in record linkage research. It is implemented in R in fastLink and RecordLinkage, but you will need training data. There are some other options, e.g. dblink, that use Bayesian methods and a latent variable set up so you don’t need training data.
Stats
cleanzr/dblink is an open source project licensed under GNU General Public License v3.0 or later which is an OSI approved license.
The primary programming language of dblink is Scala.