dictomaton VS PSI

Compare dictomaton vs PSI and see what are their differences.

PSI

Private Set Intersection Cardinality protocol based on ECDH and Bloom Filters (by OpenMined)
Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
dictomaton PSI
2 3
129 125
- 1.6%
1.8 5.2
about 2 years ago 18 days ago
Java C++
Apache License 2.0 Apache License 2.0
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

dictomaton

Posts with mentions or reviews of dictomaton. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-09-11.
  • Calculate the difference and intersection of any two regexes
    4 projects | news.ycombinator.com | 11 Sep 2023
    Say you want to compute all strings of length 5 that the automaton can generate. Conceptually the nicest way is to create an automaton that matches any five characters and then compute the intersection between that automaton and the regex automaton. Then you can generate all the strings in the intersection automaton. Of course, IRL, you wouldn't actually generate the intersection (you can easily do this on the fly), but you get the idea.

    Automata are really a lost art in modern natural language processing. We used to do things like store a large vocabulary in an deterministic acyclic minimized automaton (nice and compact, so-called dictionary automaton). And then to find, say all words within Levenshtein distance 2 of hacker, create a Levenshtein automaton for hacker and then compute (on the fly) the intersection between the Levenshtein automaton and the dictionary automaton. The language of the automaton is then all words within the intersection automaton.

    I wrote a Java package a decade ago that implements some of this stuff:

    https://github.com/danieldk/dictomaton

  • Ask HN: What are some 'cool' but obscure data structures you know about?
    54 projects | news.ycombinator.com | 21 Jul 2022
    Also related: Levenshtein automata - automata for words that match every word within a given Levenshtein distance. The intersection of a Levenshtein automaton of a word and a DAWG gives you an automaton of all words within the given edit distance.

    I haven't done any Java in years, but I made a Java package in 2013 that supports: DAWGs, Levenshtein automata and perfect hash automata:

    https://github.com/danieldk/dictomaton

PSI

Posts with mentions or reviews of PSI. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-07-21.
  • Can a new form of cryptography solve the internet’s privacy problem?
    1 project | news.ycombinator.com | 30 Oct 2022
    There are other techniques that aren't generally included in the "Zero Knowledge Proofs" set of techniques that are perhaps more practical for general development.

    For example, I fine private set intersection[1] as implemented by OpenMined a really useful primative a bunch of privacy enhancing applications can be built on top of.

    My colleagues and I recently published a pre-print[2] showing how to use this for sharing locations you and another person have had in common, without being able to see other locations. The paper talks about a social network built around this but I also think there are useful applications in things like real-world games (scavenger hunts etc)

    [1] https://github.com/OpenMined/PSI/blob/master/private_set_int...

    [2] https://arxiv.org/abs/2210.01927

  • Ask HN: What are some 'cool' but obscure data structures you know about?
    54 projects | news.ycombinator.com | 21 Jul 2022
    I came here to say Golomb compressed sets except now I see it's part of the question!

    They are used by default in the OpenMined implementation of Private Set Intersection[1] - a multi-party computation technique.

    [1] https://github.com/OpenMined/PSI/blob/master/private_set_int...

  • Is there a Private Set Intersection protocol where the server learns the length of the intersection?
    1 project | /r/crypto | 3 Sep 2021
    I was using OpenMinded/PSI exploring some PSI implementations, but I would like a way for the server to know the intersection size. Say Signal wants to calculate the average number of users from one person's address book (or whatever).

What are some alternatives?

When comparing dictomaton and PSI you can also consider the following projects:

ann-benchmarks - Benchmarks of approximate nearest neighbor libraries in Python

ctrie-java - Java implementation of a concurrent trie

sdsl-lite - Succinct Data Structure Library 2.0

AspNetCoreDiagnosticScenarios - This repository has examples of broken patterns in ASP.NET Core applications

RVS_Generic_Swift_Toolbox - A Collection Of Various Swift Tools, Like Extensions and Utilities

t-digest - A new data structure for accurate on-line accumulation of rank-based statistics such as quantiles and trimmed means

multiversion-concurrency-contro

cheerp-meta - Cheerp - a C/C++ compiler for Web applications - compiles to WebAssembly and JavaScript

minisketch - Minisketch: an optimized library for BCH-based set reconciliation

swift - the multiparty transport protocol (aka "TCP with swarming" or "BitTorrent at the transport layer")

TablaM - The practical relational programing language for data-oriented applications

pvfmm - A parallel kernel-independent FMM library for particle and volume potentials