Dblink Alternatives
Similar projects and alternatives to dblink based on common topics and language
-
entity-embed
PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.
-
splink
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
mmlspark
Discontinued Simple and Distributed Machine Learning [Moved to: https://github.com/microsoft/SynapseML]
-
sparkMeasure
This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spark jobs. It focuses on easing the collection and examination of Spark metrics, making it a practical choice for both developers and data engineers.
-
delight
A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
dblink reviews and mentions
-
[D] Machine Learning and "Record Linkage"
Felligi-Sunter is the baseline model in record linkage research. It is implemented in R in fastLink and RecordLinkage, but you will need training data. There are some other options, e.g. dblink, that use Bayesian methods and a latent variable set up so you don’t need training data.
Stats
cleanzr/dblink is an open source project licensed under GNU General Public License v3.0 or later which is an OSI approved license.
The primary programming language of dblink is Scala.
Popular Comparisons
Sponsored