Unredactor
wordview
Unredactor | wordview | |
---|---|---|
1 | 1 | |
0 | 11 | |
- | - | |
10.0 | 9.4 | |
over 2 years ago | 6 days ago | |
Python | Python | |
GNU General Public License v3.0 only | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Unredactor
-
Redacted and Sanitized
Interestingly, some years back (perhaps 12-15 years?) someone developed a program that would examine the font a physically redacted document was written in, and the spacing to try to unredact it, with some relatively decent success as only a set combination of words/letters etc. could fill a specific redacted portion. Of course the larger the redacted block, the harder it becomes. It was interesting none the less, not sure what happened to it though. This: https://github.com/gt0410/Unredactor is similar, but not what I was thinking of, and this: https://hackaday.com/2008/08/01/exposing-poorly-redacted-pdfs/ may also prove interesting for you.
wordview
-
[P] Open-source Python package for Exploratory Data Analysis for modern NLP applications looking for contributors.
If you're working on NLP and you're serious about understanding your data, check out Wordview.
What are some alternatives?
awesome-gradient-boosting-papers - A curated list of gradient boosting research papers with implementations.
NLP-Model-for-Corpus-Similarity - A NLP algorithm I developed to determine the similarity or relation between two documents/Wikipedia articles. Inspired by the cosine similarity algorithm and built from WordNet.
torchextractor - Feature extraction made simple with torchextractor
tf-idf - Term Frequency-Inverse Document Frequency from Scratch
DeepMalwareDetector - A Deep Learning framework that analyses Windows PE files to detect malicious Softwares.
pandas-profiling - Create HTML profiling reports from pandas DataFrame objects [Moved to: https://github.com/ydataai/pandas-profiling]
obsei - Obsei is a low code AI powered automation tool. It can be used in various business flows like social listening, AI based alerting, brand image analysis, comparative study and more .
ydata-profiling - 1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
mljar-supervised - Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentation
orange - 🍊 :bar_chart: :bulb: Orange: Interactive data analysis