awesome-python-for-data-science
ydata-synthetic
awesome-python-for-data-science | ydata-synthetic | |
---|---|---|
7 | 60 | |
68 | 1,309 | |
- | 4.0% | |
7.3 | 7.3 | |
6 months ago | 5 days ago | |
Jupyter Notebook | Jupyter Notebook | |
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
awesome-python-for-data-science
-
[D] Best tools to learn data science nowadays?
We're updating our awesome-python-for-data-science repository.
-
Embarking on a Journey of 99 Data Science Projects - From Beginner to Expert
Sounds like an amazing journey! Feel free to add your projects on our awesome-python-for-data-science repo as you go! And in case you need a hand or feedback on the projects, we'll be happy to help at the Data-Centric AI Community.
-
[D] What is the best way to learn machine learning?
We've started a nice repo on the DS roadmap: https://github.com/Data-Centric-AI-Community/awesome-python-for-data-science/tree/main
-
Where can I find data science projects to gain more experience.
Hey! You can find several resources online, check out this repo. Also, if you're up for it, we are running aproject on synthetic data (instructions are given weekly) on the Data-Centric AI Community. You'll find the #ds-projects channel and the #nist-challenge project where we're currently working on.
-
Hands-on Data-Centric AI: Data Preparation tuning - Why and how?
We made a tutorial following a fully Data-Centric AI pipeline for fraud detection! The material is freely available, let us know what you think! :)
- Hands-On Data-Centric Preparation Tuning – Why and How?
-
I'm new to data science. Where to start?
You're very much welcome into the Data-Centric AI Community, take a look at our awesome-python-for-data-science repo: https://github.com/Data-Centric-AI-Community/awesome-python-for-data-science
ydata-synthetic
-
Coding Wonderland: Contribute to YData Profiling and YData Synthetic in this Advent of Code
Send us your North ⭐️: "On the first day of Christmas, my true contributor gave to me..." a star in my GitHub tree! 🎵 If you love these projects too, star ydata-profiling or ydata-synthetic and let your friends know why you love it so much!
- ydata-synthetic: NEW Data - star count:1083.0
-
I absolutely hate my internship
1: Try to work with what you have and augment your dataset (honestly, 10 points is crap)
-
Assessing the Quality of Synthetic Data with Data-Centric AI
Data Quality is key for all applications and models, and LLMs are no exception :) I've been working on a small community project with synthetic data (https://github.com/ydataai/ydata-synthetic) using ydata-synthetic, and it really shows! Underrepresentation (category imbalance) and missing data are two of the main issues!
-
SOMEBODY HELP ME!
The Data-Centric AI Community creates community projects from time to time and is probably willing to help you in your project.
-
Help for Data Scientist position
Join nice data communities and start networking.
-
How to become a beast in DS ?
You know what they say: "Tell me who your friends are, and I'll tell you who you are!". Hang out with DS beasts and learn from them :)
-
Hey guys, I have a few questions
Interesting question! I think our AI/ML devs at the Data-Centric AI Community could have nice perspectives for your to decide :)
-
Embarking on a Journey of 99 Data Science Projects - From Beginner to Expert
Sounds like an amazing journey! Feel free to add your projects on our awesome-python-for-data-science repo as you go! And in case you need a hand or feedback on the projects, we'll be happy to help at the Data-Centric AI Community.
-
Data science problems
The best to do is to get started with end-to-end projects in a collaborative environment (somewhat approaching real-world settings). You may find some interesting resources in this GitHub repository. The Data-Centric AI Community actually has a nice support system for this.
What are some alternatives?
rgb-to-hex - Python script to convert an RGB text sequence into HEX Code
REaLTabFormer - A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.
ultimate-python - Ultimate Python study guide for newcomers and professionals alike. :snake: :snake: :snake:
Copulas - A library to model multivariate data using copulas.
ml-earth-observation-101 - An introduction to applying machine learning to satellite imagery (remote sensing).
DeepRL-TensorFlow2 - 🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2
mud-pi - A simple MUD server in Python, for teaching purposes, which could be run on a Raspberry Pi
Conditional-Sig-Wasserstein-GANs
python-tutorial - A Python 3 programming tutorial for beginners.
pytorch-forecasting - Time series forecasting with PyTorch
Data-Science-Resources - Data Science related resources and cheatsheets
gretel-python-client - The Gretel Python Client allows you to interact with the Gretel REST API.