presidio
randomizer
presidio | randomizer | |
---|---|---|
5 | 1 | |
3,102 | 17 | |
4.1% | - | |
8.9 | 5.6 | |
3 days ago | about 1 month ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
presidio
-
You can't build a moat with AI
Perhaps de-identification before training could be helpful here.
Microsoft does seem active in this, e.g. https://microsoft.github.io/presidio/
- Presidio β Data Protection and De-Identification SDK
-
Show HN: Cape API β Keep your sensitive data private while using GPT-4
Something like https://github.com/microsoft/presidio for stripping out PII might fill the role I expected https://github.com/capeprivacy/private-ai to do.
-
Handling PII data in Azure
Depending on your use case, you may want to check out Presidio as well. Itβs a Microsoft product for PII scrubbing. Perfect for ADF and Synapse pipelines.
-
Data Anonymization
There's an API from Microsoft, named Presidio which is used for Anonymization. This is the Github link.
randomizer
What are some alternatives?
DataProfiler - What's in your data? Extract schema, statistics and entities from datasets
adversarial-robustness-toolbox - Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams
exodus - Platform to audit trackers used by Android application
whoogle-search - A self-hosted, ad-free, privacy-respecting metasearch engine
databunker - A secure user directory built for developers to comply with the GDPR [Moved to: https://github.com/securitybunker/databunker]
Shynet - Modern, privacy-friendly, and detailed web analytics that works without cookies or JS.
Databunker - Secure SDK/vault for personal records/PII built to comply with GDPR
tribler - Privacy enhanced BitTorrent client with P2P content discovery
techbench-json-dump - Dump Tech Bench metadata to a JSON file.
PySyft - Perform data science on data that remains in someone else's server
PrivacyEngCollabSpace - Privacy Engineering Collaboration Space
hosts - π Consolidating and extending hosts files from several well-curated sources. Optionally pick extensions for porn, social media, and other categories.