presidio
PrivacyEngCollabSpace
presidio | PrivacyEngCollabSpace | |
---|---|---|
5 | 1 | |
3,102 | 220 | |
4.1% | 2.3% | |
8.9 | 7.4 | |
4 days ago | about 1 month ago | |
Python | Python | |
MIT License | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
presidio
-
You can't build a moat with AI
Perhaps de-identification before training could be helpful here.
Microsoft does seem active in this, e.g. https://microsoft.github.io/presidio/
- Presidio – Data Protection and De-Identification SDK
-
Show HN: Cape API – Keep your sensitive data private while using GPT-4
Something like https://github.com/microsoft/presidio for stripping out PII might fill the role I expected https://github.com/capeprivacy/private-ai to do.
-
Handling PII data in Azure
Depending on your use case, you may want to check out Presidio as well. It’s a Microsoft product for PII scrubbing. Perfect for ADF and Synapse pipelines.
-
Data Anonymization
There's an API from Microsoft, named Presidio which is used for Anonymization. This is the Github link.
PrivacyEngCollabSpace
-
What format / templates do you (CISOs/ISOs) use for your risk assessments of the org?
I would look into some NIST-provided tools like this one: https://github.com/usnistgov/PrivacyEngCollabSpace/tree/master/tools/risk-assessment/NIST-Privacy-Risk-Assessment-Methodology-PRAM. Haven't used it myself but it looks like it might fit your use-case.
What are some alternatives?
DataProfiler - What's in your data? Extract schema, statistics and entities from datasets
differential-privacy-library - Diffprivlib: The IBM Differential Privacy Library
exodus - Platform to audit trackers used by Android application
attack-control-framework-mappings - 🚨ATTENTION🚨 The NIST 800-53 mappings have migrated to the Center’s Mappings Explorer project. See README below. This repository is kept here as an archive.
databunker - A secure user directory built for developers to comply with the GDPR [Moved to: https://github.com/securitybunker/databunker]
PyDP - The Python Differential Privacy Library. Built on top of: https://github.com/google/differential-privacy
Databunker - Secure SDK/vault for personal records/PII built to comply with GDPR
gretel-synthetics - Synthetic data generators for structured and unstructured text, featuring differentially private learning.
techbench-json-dump - Dump Tech Bench metadata to a JSON file.
tern - Tern is a software composition analysis tool and Python library that generates a Software Bill of Materials for container images and Dockerfiles. The SBOM that Tern generates will give you a layer-by-layer view of what's inside your container in a variety of formats including human-readable, JSON, HTML, SPDX and more.
private-ai - Repo for Udacity's Secure & Private AI course
transformer-smaller-training-vocab - Temporary remove unused tokens during training to save ram and speed.