-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
We have some preliminary work in this direction https://github.com/gretelai/multi-table
I love the idea of "table space" though. It would be fun to traverse this space and output a new database at each step, like a VAE.
Thanks for the shoutout, cush! And yup, our platform Tonic enables developers to realistically de-identify their data while preserving relationships and consistency across tables within their DBs, to optimize dev and test with real fake data. You can sign up for a sandbox here: https://www.tonic.ai/
We've also recently released a new platform called Djinn that is specifically designed for data science workflows. It enables you to query from tables across your DB to build customized views of only the data you need and synthesize high-fidelity data based on models trained on those views. Relationships are fully preserved and no external scripting is required. You can create an account and take it for a spin here: https://djinn.tonic.ai/?signup
Full disclosure, I'm Chiara Colombi, Product Marketing Manager at Tonic.ai. Cheers!
Related posts
-
Recommendation for tool or script for sanitizing data
-
Is it atypical to have a dev DB service on your local environment?
-
Anonymize test data?
-
Preserve the unique relationships between data columns while wiping sensitive information from those columns using randomization.
-
Don't let your test data suffer - meet the Tonic and Google BigQuery partnership.