InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →
Top 9 Jupyter Notebook synthetic-data Projects
-
-
Stream
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
-
-
-
genalog
Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.
-
REaLTabFormer
A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.
-
-
awesome-python-for-data-science
A curated list of awesome resources such as books, tutorials, courses, open-source libraries, exercises, and other materials that support Pythonistas in the making, and Pythonistas migrating into Data Science! 📊
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
synthetic-data-genomics
Proof of concept code from Gretel.ai and Illumina using generative neural networks to create synthetic versions of mouse genotype and phenotype data.
-
nist-crc-2023
NIST Collaborative Research Cycle on Synthetic Data. Learn about Synthetic Data week by week!
Jupyter Notebook synthetic-data discussion
Jupyter Notebook synthetic-data related posts
-
ydata-synthetic: NEW Data - star count:1083.0
-
I absolutely hate my internship
-
Assessing the Quality of Synthetic Data with Data-Centric AI
-
SOMEBODY HELP ME!
-
Help for Data Scientist position
-
How to become a beast in DS ?
-
Hey guys, I have a few questions
-
A note from our sponsor - InfluxDB
www.influxdata.com | 16 Jul 2025
Index
What are some of the best open-source synthetic-data projects in Jupyter Notebook? This list will help you:
# | Project | Stars |
---|---|---|
1 | machine-learning-for-trading | 15,194 |
2 | ydata-synthetic | 1,558 |
3 | awesome-data-centric-ai | 336 |
4 | genalog | 330 |
5 | REaLTabFormer | 231 |
6 | anonymeter | 89 |
7 | awesome-python-for-data-science | 85 |
8 | synthetic-data-genomics | 37 |
9 | nist-crc-2023 | 27 |