SaaSHub helps you find the best software and product alternatives Learn more →
Jellyfish Alternatives
Similar projects and alternatives to jellyfish
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
TextDistance
📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.
-
Levenshtein
The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
jellyfish reviews and mentions
-
Python Libraries
For sounds something like https://github.com/jamesturk/jellyfish ?
-
Comparing Strings (Street Names) With Machine Learning
When comparing strings (in our case street names), there are plenty of off-the-shelf features that can be used, such as those provided by the jellyfish. This package also provides a number of phonetic encodings. We can combine an encoding with a metric, such as Levenshtein Distance, to measure the phonetic similarity between two street names.
-
How to match names which differ slightly?
You can use a library like jellyfish which implements a bunch of string comparison algorithms, you'd just have to experiment and see which one gives the best results for you. I think I've had the best luck with Jaro-Winkler, then looking at the % match result and picking a cutoff above which I have good confidence that the match is real. It's still not perfect, and I really don't see how your last example would work with just about any automated comparison.
-
A note from our sponsor - SaaSHub
www.saashub.com | 10 May 2024
Stats
jamesturk/jellyfish is an open source project licensed under MIT License which is an OSI approved license.
The primary programming language of jellyfish is Jupyter Notebook.
Sponsored