Python Privacy

Open-source Python projects categorized as Privacy

Top 23 Python Privacy Projects

  • hosts

    🔒 Consolidating and extending hosts files from several well-curated sources. Optionally pick extensions for porn, social media, and other categories.

  • Project mention: Does PiHole block porn? | /r/pihole | 2023-12-06

    Not by default but a blocklist can be found here https://github.com/StevenBlack/hosts

  • macOS-Security-and-Privacy-Guide

    Guide to securing and improving privacy on macOS

  • Project mention: Hardening macOS | /r/MacOS | 2023-07-03
  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • ungoogled-chromium

    Google Chromium, sans integration with Google

  • Project mention: console.log(DOOM) | news.ycombinator.com | 2024-02-25
  • PySyft

    Perform data science on data that remains in someone else's server

  • Project mention: A Better Mastodon Client | news.ycombinator.com | 2023-12-21

    https://github.com/OpenMined/PySyft - Federated Learning data science

    Incentives are much harder but smart contracts can handle the tech part.

    Going this route eventually you quickly have "quantum AI app store" and your system of government is a 12GB download. Can't even say if it's a good idea compared to e.g. anarcho-primitivism.

    Project mention: So I deployed Whoogle on my NAS.... | /r/selfhosted | 2023-12-08
  • tribler

    Privacy enhanced BitTorrent client with P2P content discovery

  • Project mention: Tribler 7.13.0 | news.ycombinator.com | 2023-08-30

    > Towards making Bittorrent anonymous and impossible to shut down.

    > We use our own dedicated Tor-like network for anonymous torrent downloading. We implemented and enhanced the Tor protocol specifications. Tribler includes our own Tor-like onion routing network with hidden services based seeding and end-to-end encryption.

    https://github.com/Tribler/tribler#readme (GPLv3 although also LGPL)

    I first thought "Is Justin Bieber gay?" in their release was some kind of vandalism of their release, but no, they pose that question as a vehicle for how their software is attempting to solve(?) trusted tagging

  • adversarial-robustness-toolbox

    Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • FedML

    FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs on any GPU cloud or on-premise cluster. Built on this library, FEDML Nexus AI (https://fedml.ai) is your generative AI platform at scale.

  • Project mention: [Experiment] The future of AI is open-source, and here is the plan | /r/samkoesnadi | 2023-06-05

    FedML https://github.com/FedML-AI/FedML might already provide a lot of tools to do the job

  • ProxyBroker

    Proxy [Finder | Checker | Server]. HTTP(S) & SOCKS :performing_arts:

  • presidio

    Context aware, pluggable and customizable data protection and de-identification SDK for text and images

  • Project mention: You can't build a moat with AI | news.ycombinator.com | 2024-04-11

    Perhaps de-identification before training could be helpful here.

    Microsoft does seem active in this, e.g. https://microsoft.github.io/presidio/

  • Shynet

    Modern, privacy-friendly, and detailed web analytics that works without cookies or JS.

  • Project mention: It Took Me a Decade to Find the Perfect Personal Website Stack – Ghost+Fathom | news.ycombinator.com | 2023-07-09

    +1 on shynet! I use it for my personal website and my blog, and it's been working great.

    I got it up and running with Podman, so no need to install and run the Docker daemon. I also fixed SQLite support [1], so no need for an additional DB server.

    I analyzed available open-source web analytics tools [2] and AFAIK there is simpler solution for web analytics that doesn't involve a third party.

    [1] https://github.com/milesmcc/shynet/issues/208

    [2] https://blog.fidelramos.net/software/privacy-respecting-self...

  • bleachbit

    BleachBit system cleaner for Windows and Linux

  • Project mention: Change in "Web Data" Autofill file under User Data\Default | /r/techsupport | 2023-08-19

    For who want the complete story, this is a recent issue with the Bleachbit cleaner in this github: https://github.com/bleachbit/bleachbit/issues/1518

  • email2phonenumber

    A OSINT tool to obtain a target's phone number just by having his email address

  • Project mention: FLaNK Stack Weekly for 20 Nov 2023 | dev.to | 2023-11-20
  • privacy

    Library for training machine learning models with privacy for training data

  • noisy

    Simple random DNS, HTTP/S internet traffic noise generator

  • hosts

    Hostfile blocklist for ads and tracking, updated regularly (by lightswitch05)

  • Project mention: DNS server set to Pihole but no traffic | /r/pihole | 2023-06-24

    I've added the Ads & Tracking list and the AMP Hosts list from Developer Dan to the default list; any others you recommend I add? It's hard to tell if the ads coming through are a 'my blocklist isn't good enough' problem or a 'my pihole's not set up properly yet' problem.

  • DataProfiler

    What's in your data? Extract schema, statistics and entities from datasets

  • Project mention: LongRoPE: Extending LLM Context Window Beyond 2M Tokens | news.ycombinator.com | 2024-02-22

    It's been possible to skip tokenization for a long time, my team and I did it here - https://github.com/capitalone/DataProfiler

    For what it's worth, we actually were working with LSTMs with nearly a billion params back in 2016-2017 area. Transformers made it far more effective to train and execute, but ultimately LSTMs are able to achieve similar results, though slow & require more training data.

  • OpenWPM

    A web privacy measurement framework

  • tf-encrypted

    A Framework for Encrypted Machine Learning in TensorFlow

  • no-google

    Completely block Google and its services

  • Project mention: Is there a filter or otherwise a way to block all google domains except YouTube? | /r/Adguard | 2023-05-12

    Yes

  • Shreddit

    Remove your comment history on Reddit as deleting an account does not do so.

  • Project mention: Reddit Fulfilled My Data Copy Request - What's the best script to use this to nuke? | /r/privacy | 2023-07-11

    Some scripts like https://github.com/x89/Shreddit look promising, and I'm getting ready to pull the trigger on it just once I make sure my whitelist IDs are good. However, it's probably not thorough enough to hit all my content. My reddit data has over 68,000 comments.

  • Social-Amnesia

    Forget the past. Social Amnesia makes sure your social media accounts only show your posts from recent history, not from "that phase" 5 years ago.

  • concrete-ml

    Concrete ML: Privacy Preserving ML framework built on top of Concrete, with bindings to traditional ML frameworks.

  • Project mention: Show HN: Logistic Regression Training on Encrypted Data with FHE | news.ycombinator.com | 2024-02-06
  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Privacy related posts

Index

What are some of the best open-source Privacy projects in Python? This list will help you:

Project Stars
1 hosts 25,463
2 macOS-Security-and-Privacy-Guide 20,878
3 ungoogled-chromium 18,764
4 PySyft 9,253
5 whoogle-search 8,789
6 tribler 4,476
7 adversarial-robustness-toolbox 4,447
8 FedML 4,052
9 ProxyBroker 3,707
10 presidio 3,077
11 Shynet 2,806
12 bleachbit 2,701
13 email2phonenumber 1,947
14 privacy 1,868
15 noisy 1,621
16 hosts 1,485
17 DataProfiler 1,357
18 OpenWPM 1,311
19 tf-encrypted 1,193
20 no-google 1,173
21 Shreddit 985
22 Social-Amnesia 799
23 concrete-ml 770

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com