The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →
Top 9 Jupyter Notebook synthetic-data Projects
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
genalog
Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.
-
REaLTabFormer
A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.
-
synthetic-data-genomics
Proof of concept code from Gretel.ai and Illumina using generative neural networks to create synthetic versions of mouse genotype and phenotype data.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
nist-crc-2023
NIST Collaborative Research Cycle on Synthetic Data. Learn about Synthetic Data week by week!
Project mention: Machine Learning for Trading: Notebooks, resources and references accompanying the book Machine Learning for Algorithmic Trading. Courses - star count:10678.0 | /r/algoprojects | 2023-11-20
Project mention: Coding Wonderland: Contribute to YData Profiling and YData Synthetic in this Advent of Code | dev.to | 2023-12-05Send us your North ⭐️: "On the first day of Christmas, my true contributor gave to me..." a star in my GitHub tree! 🎵 If you love these projects too, star ydata-profiling or ydata-synthetic and let your friends know why you love it so much!
Project mention: [P] SkinDeep 2.0 : SkinDeep Tattoo removal project now powered by ControlNet. GitHub Link in comments/Images. | /r/MachineLearning | 2023-04-29
Project mention: Thoughts: Continue current degree with one year left, or start anew with degree apprenticeship | /r/cscareerquestionsuk | 2023-07-13I would finish the degree anyway. It's only one year left. If teachers miss classes, I would disregard that and try to learn on my own, and then yes, I would move on to an internship (or even do It at the same time if it's possible). If you like, come as meet us at the Data-Centric AI Community and we can do some projects together :)
Project mention: Assessing the Quality of Synthetic Data with Data-centric AI | /r/ArtificialInteligence | 2023-07-13Data Quality is key for all applications and models, and LLMs are no exception :) I've been working on a small community project with synthetic data using ydata-synthetic, and it really shows! Underrepresentation (category imbalance) and missing data are two of the main issues!
Jupyter Notebook synthetic-data related posts
- ydata-synthetic: NEW Data - star count:1083.0
- I absolutely hate my internship
- Assessing the Quality of Synthetic Data with Data-Centric AI
- SOMEBODY HELP ME!
- Help for Data Scientist position
- How to become a beast in DS ?
- Hey guys, I have a few questions
-
A note from our sponsor - WorkOS
workos.com | 19 Apr 2024
Index
What are some of the best open-source synthetic-data projects in Jupyter Notebook? This list will help you:
Project | Stars | |
---|---|---|
1 | machine-learning-for-trading | 11,750 |
2 | ydata-synthetic | 1,279 |
3 | SkinDeep | 929 |
4 | awesome-data-centric-ai | 300 |
5 | genalog | 294 |
6 | REaLTabFormer | 182 |
7 | synthetic-data-genomics | 32 |
8 | nist-crc-2023 | 27 |
9 | multi-table | 7 |