Unredactor

In this project we are tryinbg to create unredactor. Unredactor will take a redacted document and the redacted flag as input, inreturn it will give the most likely candidates to fill in redacted location. In this project we are only considered about unredacting names only. The data that we are considering is imdb data set with many review files. These files are used to buils corpora for finding tfidf score. Few files are used to train and in these files names are redacted and written into redacted folder. These redacted files are used for testing and different classification models are built to predict the probabilies of each class. Top 5 classes i.e names similar to the test features are written at the end of text in unreddacted foleder. (by gt0410)

Unredactor Alternatives

Similar projects and alternatives to Unredactor

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better Unredactor alternative or higher similarity.

Unredactor reviews and mentions

Posts with mentions or reviews of Unredactor. We have used some of these posts to build our list of alternatives and similar projects.
  • Redacted and Sanitized
    1 project | /r/conspiracyNOPOL | 24 Oct 2022
    Interestingly, some years back (perhaps 12-15 years?) someone developed a program that would examine the font a physically redacted document was written in, and the spacing to try to unredact it, with some relatively decent success as only a set combination of words/letters etc. could fill a specific redacted portion. Of course the larger the redacted block, the harder it becomes. It was interesting none the less, not sure what happened to it though. This: https://github.com/gt0410/Unredactor is similar, but not what I was thinking of, and this: https://hackaday.com/2008/08/01/exposing-poorly-redacted-pdfs/ may also prove interesting for you.

Stats

Basic Unredactor repo stats
1
0
10.0
over 2 years ago

gt0410/Unredactor is an open source project licensed under GNU General Public License v3.0 only which is an OSI approved license.

The primary programming language of Unredactor is Python.


Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com