amazon-s3-find-and-forget
awesome-aws
amazon-s3-find-and-forget | awesome-aws | |
---|---|---|
3 | 2 | |
232 | 12,165 | |
0.9% | - | |
7.3 | 2.6 | |
8 days ago | about 2 months ago | |
Python | Python | |
Apache License 2.0 | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
amazon-s3-find-and-forget
-
Deleting particular data from S3 External Tables
Take a look at this: https://github.com/awslabs/amazon-s3-find-and-forget We use it for GDPR compliance; it will open a file, delete a row and pack it back. It will modify the file so watch out if you are using Glue job bookmarks. Because you are using external tables, the manifest file will also have to be updated with a proper lenght for the new, updated file. If you have hundreds of tables and thousands of files, and you need to do this on a regular basis this would be the scalable solution, but if you have few files honestly I would do it manually
-
Update S3 Files
Have a look at S3 Find and Forget
-
How to handle GDPR requests for data stored in S3 ?
S3 Find and Forget is probably worth looking into, even if just to get ideas on how to implement a similar solution for yourself
awesome-aws
-
There are 40,000+ quality AWS open source repositories on GitHub but are completely unorganized. I made a search engine and browser for all of them, all curated carefully with 1000+ filters.
There is also https://github.com/donnemartin/awesome-aws
- Scope of GCP in India
What are some alternatives?
DataEngineeringProject - Example end to end data engineering project.
web-client-for-aws-transfer-family - This solution creates a web portal for your customers to access your corporate Secure Shell File Transfer Protocol (SFTP) environment. It combines the benefits of using AWS Transfer for SFTP with an intuitive web browser interface for your non-technical users.
isp-data-pollution - ISP Data Pollution to Protect Private Browsing History with Obfuscation
aws-cli - Universal Command Line Interface for Amazon Web Services
data-toolset - Upgrade from avro-tools and parquet-tools jars to a more user-friendly Python package.
aws-sso-util - Smooth out the rough edges of AWS SSO (temporarily, until AWS makes it better).
s3-credentials - A tool for creating credentials for accessing S3 buckets
PMapper - A tool for quickly evaluating IAM permissions in AWS.
cookiecutter-django-ecs-github - Complete Walkthrough: Blue/Green Deployment to AWS ECS using Cookiecutter-Django using GitHub actions
streamalert - StreamAlert is a serverless, realtime data analysis framework which empowers you to ingest, analyze, and alert on data from any environment, using datasources and alerting logic you define.
awesome-kubernetes - A curated list for awesome kubernetes sources :ship::tada:
taskcat - Test all the CloudFormation things! (with TaskCat)