torchextractor VS Unredactor

Compare torchextractor vs Unredactor and see what are their differences.

Unredactor

In this project we are tryinbg to create unredactor. Unredactor will take a redacted document and the redacted flag as input, inreturn it will give the most likely candidates to fill in redacted location. In this project we are only considered about unredacting names only. The data that we are considering is imdb data set with many review files. These files are used to buils corpora for finding tfidf score. Few files are used to train and in these files names are redacted and written into redacted folder. These redacted files are used for testing and different classification models are built to predict the probabilies of each class. Top 5 classes i.e names similar to the test features are written at the end of text in unreddacted foleder. (by gt0410)
Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
torchextractor Unredactor
1 1
99 0
- -
4.2 10.0
about 3 years ago over 2 years ago
Python Python
Apache License 2.0 GNU General Public License v3.0 only
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

torchextractor

Posts with mentions or reviews of torchextractor. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-03-11.
  • [P] Pytorch: Intermediate Feature Extraction
    2 projects | /r/MachineLearning | 11 Mar 2021
    Recently I worked on torchextrator, a standalone python package that makes it simple to extract features in PyTorch. You no longer need to duplicate code and rewrite the forward function. Also the extractor supports nested modules, custom caching operations and is ONNX compatible!

Unredactor

Posts with mentions or reviews of Unredactor. We have used some of these posts to build our list of alternatives and similar projects.
  • Redacted and Sanitized
    1 project | /r/conspiracyNOPOL | 24 Oct 2022
    Interestingly, some years back (perhaps 12-15 years?) someone developed a program that would examine the font a physically redacted document was written in, and the spacing to try to unredact it, with some relatively decent success as only a set combination of words/letters etc. could fill a specific redacted portion. Of course the larger the redacted block, the harder it becomes. It was interesting none the less, not sure what happened to it though. This: https://github.com/gt0410/Unredactor is similar, but not what I was thinking of, and this: https://hackaday.com/2008/08/01/exposing-poorly-redacted-pdfs/ may also prove interesting for you.

What are some alternatives?

When comparing torchextractor and Unredactor you can also consider the following projects:

stringlifier - Stringlifier is on Opensource ML Library for detecting random strings in raw text. It can be used in sanitising logs, detecting accidentally exposed credentials and as a pre-processing step in unsupervised ML-based analysis of application text data.

awesome-gradient-boosting-papers - A curated list of gradient boosting research papers with implementations.

muzero-general - MuZero

wordview - A Python package for Exploratory Data Analysis (EDA) for text-based data.

nni - An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

DeepMalwareDetector - A Deep Learning framework that analyses Windows PE files to detect malicious Softwares.

merged_depth - Monocular Depth Estimation - Weighted-average prediction from multiple pre-trained depth estimation models

obsei - Obsei is a low code AI powered automation tool. It can be used in various business flows like social listening, AI based alerting, brand image analysis, comparative study and more .

carbon - :black_heart: Create and share beautiful images of your source code

mljar-supervised - Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentation

mapextrackt - Pytorch Feature Map Extractor

orange - 🍊 :bar_chart: :bulb: Orange: Interactive data analysis