sayn
ghostpii_client
sayn | ghostpii_client | |
---|---|---|
2 | 3 | |
117 | 23 | |
0.9% | - | |
6.8 | 1.1 | |
5 days ago | about 1 year ago | |
Python | Python | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
sayn
-
Average reply times from some of my Facebook friends over the last few years [OC], full article here: https://medium.com/@timsugaipov/taking-your-facebook-messenger-data-further-f9da079b1409?source=friends_link&sk=3bd04bb35ad9a4b6f586300e52f96e4f
Data Processing: SAYN
-
Introducing SAYN: A Simple Yet Powerful Data Processing Framework.
We believe simplicity to be crucial when maintaining pipelines at scale. However, we also believe that simplicity should not come at the expense of flexibility. This is why we have built our own open source data processing framework: SAYN. SAYN is designed to empower analytics teams by being simple, flexible and centralised. It democratises the contribution to data processes within an analytics team, enables full flexibility and helps save a lot of time through automation.
ghostpii_client
-
Help me spread the word, or at least play with a free toy
I am an entrepreneur trying to get a movement going to really start using this tech at big corporations to keep them out of trouble. I am guessing the conversation in here is a little more abstract than my usual day-to-day (although I am a reformed mathematician) but I wanted to introduce myself nonetheless.
If anybody is interested we maintain a software library, implemented in Python, that is designed to let relatively everyday people (software engineers, data scientists, etc.) use these privacy-enhancing techniques in a familiar interface without a rocket science course. If you go to the GitHub page I link below there is a Binder server where you can play with it right now via a Jupyter notebook over the web with basically no work or commitment.
https://github.com/capnion/ghostpii_client
I also put a ton of content out on LinkedIn, mostly oriented towards why businesses should adopt these things, what to do with them, and how they relate to other trends.
https://www.linkedin.com/in/alexander-c-mueller-phd-0272a6108/
I would greatly appreciate engagement of any kind: test-drivers, early-adopters, complainers, design feedback, likes, reshares, stars, emails. I am a true believer trying to this tech out where it can do some good and I need to spread the word.
- help me spread the word, or at least play with a free toy
What are some alternatives?
dbt-databricks - A dbt adapter for Databricks.
python-fpe - FPE - Format Preserving Encryption with FF3 in Python
dataform - Dataform is a framework for managing SQL based data operations in BigQuery
linkedin-visualizer - The missing feature in LinkedIn
tinvois-parser - Extract receipt info
dagster - An orchestration platform for the development, production, and observation of data assets.
data-engineering-wiki - The best place to learn data engineering. Built and maintained by the data engineering community.
Apache Superset - Apache Superset is a Data Visualization and Data Exploration Platform [Moved to: https://github.com/apache/superset]
beneath - Beneath is a serverless real-time data platform ⚡️
modin - Modin: Scale your Pandas workflows by changing a single line of code
yaetos - Write data & AI pipelines in (SQL, Spark, Pandas) and deploy to the cloud, simplified
versatile-data-kit - One framework to develop, deploy and operate data workflows with Python and SQL.