What is the best approach to removing duplicate person records if the only identifier is person firstname middle name and last name? These names are entered in varying ways to the DB, thus they are free-fromatted.

This page summarizes the projects mentioned and recommended in the original post on /r/SQL

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • OpenRefine

    OpenRefine is a free, open source power tool for working with messy data and improving it

  • It's not suited to SQL, use Open Refine or python fuzzywuzzy.

    https://moj-analytical-services.github.io/splink/ is a FOSS python package (but it runs against your db using SQL).

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • What you need to know about the future of Mozilla Hubs

    1 project | news.ycombinator.com | 15 Feb 2024
  • OpenRefine

    1 project | /r/patient_hackernews | 23 Oct 2023
  • OpenRefine

    2 projects | news.ycombinator.com | 21 Oct 2023
  • java string equals returns false, even for identical strings

    1 project | /r/javahelp | 8 Sep 2023
  • UIUC MCS - CS 513 Review - Theory and Practice of Data Cleaning

    1 project | dev.to | 4 Sep 2023