condenser
NetVendor
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
condenser
-
Jailer, a unique open-source database tool
[1]: https://github.com/TonicAI/condenser
-
Ask HN: What are the open source tools for database subsetting?
[1]: https://github.com/TonicAI/condenser
-
Recommendation for tool or script for sanitizing data
This may be overkill but we have used Tonic for this: https://www.tonic.ai/
-
Is it atypical to have a dev DB service on your local environment?
A tool like https://www.tonic.ai/ might help.
- Launch HN: JumpWire (YC W22) – Easily encrypt customer data in your databases
-
Anonymize test data?
I attended a presentation a month or so ago where a co-worker was advocating for tonic. I've never used it myself, but, definitely looks as though it would cover your bases. I do agree with onomazein though that this should really be handled on the infrastructure side if at all possible. Dev's, testers, anyone else should be able to pull from an already anonymized location, but, but, someone else should be responsible for setting up the environment and the initial synchronization.
-
What's the coolest automation tool you've built or been involved in?
So I built a gitlab pipeline to create a backup of this upstream db without downtime using various SQL utils. This archive is then staged into an image so the data will unpack and load on startup. I then used a data subsetter called condenser to create datasets for certain use cases. Now devs can load reliable dev data quicker, test against data that QA uses but within their unique envs (local and preview) and create datasets for their own use cases.
- Preserve the unique relationships between data columns while wiping sensitive information from those columns using randomization.
- Don't let your test data suffer - meet the Tonic and Google BigQuery partnership.
- Don't let your test data suffer - meet the Tonic.ai and Amazon Redshift partnership for Real. Fake. Data.
NetVendor
-
What's the coolest automation tool you've built or been involved in?
Needing to benchmark what was in my network I built NetVendor that takes network router output called an arp table and turns into actionable data of what exactly is on the network
- NetVendor - app I made that ingests a switch / router IP ARP table and tells you what exactly is on your network
What are some alternatives?
Replibyte - Seed your development database with real data ⚡️
wordscapes-bot - A Bot that Completes Levels on the Videogame WordScapes
boxball - Prebuilt Docker images with Retrosheet's complete baseball history data for many analytical frameworks. Includes Postgres, cstore_fdw, MySQL, SQLite, Clickhouse, Drill, Parquet, and CSV.
nango - A single API for all your integrations.
prisma-field-encryption - Transparent field-level encryption at rest for Prisma
profiles
mocki-ui - Mocki is a HTTP benchmarking and data generation tool.
sitcom-simulator-cli - A tool that combines GPT-3, Stable Diffusion, and FakeYou to create fully automated video. [Moved to: https://github.com/joshmoody24/sitcom-simulator]
infrastructure-tools - JumpWire deployment and installation scripts
Docker Compose - Define and run multi-container applications with Docker
afum - audio file upload manager gui for tttweb