SDV
genalog
Our great sponsors
SDV | genalog | |
---|---|---|
59 | 1 | |
2,117 | 295 | |
14.0% | 2.7% | |
9.3 | 0.0 | |
7 days ago | 3 months ago | |
Python | Jupyter Notebook | |
GNU General Public License v3.0 or later | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
SDV
-
Synthetic data generation for tabular data
Can someone help me understand the licensing of this?
https://github.com/sdv-dev/SDV/blob/main/LICENSE
It was MIT licensed up until 2022 where it was changed to what it is now, where they say that it will become MIT again 4 years after release... but is that from when the license was changed or the first release of the software in GitHub?
- SDV: NEW Data - star count:1441.0
- FLaNK Stack Weekly for 30 April 2023
- SDV: NEW Data - star count:1196.0
genalog
-
Microsoft Unveils Genalog: An Open Source, AI Cross-Platform Python Package For Generating Document Images With Synthetic Noise
Github: https://github.com/microsoft/genalog
What are some alternatives?
CTGAN - Conditional GAN for generating synthetic tabular data.
deep-text-recognition-benchmark - Text recognition (optical character recognition) with deep learning methods, ICCV 2019
gretel-python-client - The Gretel Python Client allows you to interact with the Gretel REST API.
synthetic-data-genomics - Proof of concept code from Gretel.ai and Illumina using generative neural networks to create synthetic versions of mouse genotype and phenotype data.
machine-learning-for-trading - Code for Machine Learning for Algorithmic Trading, 2nd edition.
Copulas - A library to model multivariate data using copulas.
tsfresh - Automatic extraction of relevant features from time series:
docutron - Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.
ML-For-Beginners - 12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
TimeSynth - A Multipurpose Library for Synthetic Time Series Generation in Python
nist-crc-2023 - NIST Collaborative Research Cycle on Synthetic Data. Learn about Synthetic Data week by week!