DataScienceProjects
awesome-data-centric-ai
Our great sponsors
DataScienceProjects | awesome-data-centric-ai | |
---|---|---|
1 | 7 | |
0 | 302 | |
- | 2.0% | |
5.1 | 3.2 | |
2 months ago | 5 months ago | |
Jupyter Notebook | Jupyter Notebook | |
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
DataScienceProjects
awesome-data-centric-ai
-
Thoughts: Continue current degree with one year left, or start anew with degree apprenticeship
I would finish the degree anyway. It's only one year left. If teachers miss classes, I would disregard that and try to learn on my own, and then yes, I would move on to an internship (or even do It at the same time if it's possible). If you like, come as meet us at the Data-Centric AI Community and we can do some projects together :)
-
Data science projects
Definitely a lot of growth in the AI space, and it will evolve rapidly in the next few years. There several paid propositions at the Data-Centric AI Community discord, check them out.
-
I absolutely hate my internship
2: Tbh, quit (?) We have open jobs at the Data-Centric AI Community. Bonus points: you can vent there as much as you want
-
Prioritise Data Science Projects
Let me invite you to the Data-Centric AI Community we have several code along sessions and projects and a lot of beginners that are starting to learn DS that you can connect with.
-
Imbalanced data
If you need specific help with your project you can find me at the Data-Centric AI Community and we'll be happy to take a look and give you some tips to move forward :)
-
Building my first Porfolio
You can share with us your progress on the Data-Centric AI Community and ask someone to review it, we often do that with CVs as well and help each other out.
-
[Q] How to generate synthetic dataset for anomaly detection?
Maybe you can use a synthetic data generator and use your current dataset as input? I believe there are a lot of GAN-based models for this purpose out there. The ones listed on https://github.com/Data-Centric-AI-Community/awesome-data-centric-ai are mostly focused on structured data, but I'm sure there are similar packages for images.
What are some alternatives?
machine-learning-for-trading - Code for Machine Learning for Algorithmic Trading, 2nd edition.
ydata-synthetic - Synthetic data generators for tabular and time-series data
Deep-Learning-Machine-Learning-Stock - Deep Learning and Machine Learning stocks represent a promising long-term or short-term opportunity for investors and traders. [Moved to: https://github.com/LastAncientOne/Deep_Learning_Machine_Learning_Stock]
machine_learning_complete - A comprehensive machine learning repository containing 30+ notebooks on different concepts, algorithms and techniques.
Deep_Learning_Machine_Learning_Stock - Deep Learning and Machine Learning stocks represent promising opportunities for both long-term and short-term investors and traders.
walkalongs - Resources and solutions of various technologies that I am currently learning
linear-regression-from-scratch - A data science project for part II physics project E (surveying using stars)
awesome-generative-ai-companies - A curated list of Gеnerative AI companies, sorted by focus area and total fundraised amount.
data-analytics-project-template - A python project starter template for data-analytics and data-science.
fullnamematchscore-go - Generates a match score of two person names from 0-100, where 100 is the highest, on how closely two individual full names match. The scoring is based on a series of tests, algorithms, AI, and an ever-growing body of Machine Learning-based generated knowledge
datadoubleconfirm - Simple datasets and notebooks for data visualization, statistical analysis and modelling - with write-ups here: http://projectosyo.wix.com/datadoubleconfirm.
COVID-US - Open benchmark dataset of COVID-19 related ultrasound imaging data, curated and systematically validated — Ensemble de données de référence ouvert d'imagerie échographique liées à la COVID-19, organisé et systématiquement validé