dash-tools
intertext
dash-tools | intertext | |
---|---|---|
1 | 1 | |
88 | 110 | |
- | 0.0% | |
4.3 | 0.0 | |
6 months ago | 12 months ago | |
Python | Python | |
MIT License | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
dash-tools
intertext
-
Ask HN: Tool to find text reuse, similar paragraphs, fuzzy/near dupes in folder?
Do you know of any too that I can use to compare my own notes and documents vault in search for copied paragraphs or almost similar phrases? Normal diffing/hashing wouldn't work as we're talking about the contents of slightly modified documents, and the comparison of each file against all others.
I found the following tools that seem related yet not quite there, maybe I'm missing a particular term of art?
https://github.com/YaleDHLab/intertext
What are some alternatives?
dash-cytoscape - Interactive network visualization in Python and Dash, powered by Cytoscape.js
sourmash - Quickly search, compare, and analyze genomic and metagenomic data sets.
vizro - Vizro is a toolkit for creating modular data visualization applications.
neardup - Near-duplicate detection
hiitpi - A workout trainer Dash/Flask app that helps track your HIIT workouts by analyzing real-time video streaming from your sweet Pi using machine learning and Edge TPU..
LSH - Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents
Dash.jl - Dash for Julia - A Julia interface to the Dash ecosystem for creating analytic web applications in Julia. No JavaScript required.
Neural-Scam-Artist - Web Scraping, Document Deduplication & GPT-2 Fine-tuning with a newly created scam dataset.
python-explorer - A Python environment exploration interface.
datasketch - MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
ipyvizzu-story - Build, present and share animated data stories in Jupyter Notebook and similar environments.
dash - Data Apps & Dashboards for Python. No JavaScript Required.