Ask HN: Tool to find text reuse, similar paragraphs, fuzzy/near dupes in folder?

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • intertext

    Detect and visualize text reuse

  • Do you know of any too that I can use to compare my own notes and documents vault in search for copied paragraphs or almost similar phrases? Normal diffing/hashing wouldn't work as we're talking about the contents of slightly modified documents, and the comparison of each file against all others.

    I found the following tools that seem related yet not quite there, maybe I'm missing a particular term of art?

    https://github.com/YaleDHLab/intertext

  • neardup

    Near-duplicate detection

  • Python app. Requires to load and tag a corpus of text, it is used to compare different works in a visual way.

    https://github.com/e-orlov/neardup

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Show HN: Pi-C.A.R.D, a Raspberry Pi Voice Assistant

    3 projects | news.ycombinator.com | 13 May 2024
  • Open Source Tool Changer for FDM 3D Printers

    4 projects | news.ycombinator.com | 13 May 2024
  • everything-ai: the power of AI, on your computer

    2 projects | dev.to | 14 May 2024
  • Ask HN: Founders who offer free/OS and paid SaaS, how do you manage your code?

    17 projects | news.ycombinator.com | 13 May 2024
  • Show HN: Julep: A platform to manage memories, knowledge and tools for LLM apps

    1 project | news.ycombinator.com | 14 May 2024