nist-crc-2023 VS awesome-data-centric-ai

Compare nist-crc-2023 vs awesome-data-centric-ai and see what are their differences.

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
nist-crc-2023 awesome-data-centric-ai
7 7
27 305
- 2.0%
4.3 3.2
11 months ago 5 months ago
Jupyter Notebook Jupyter Notebook
MIT License MIT License
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.


Posts with mentions or reviews of nist-crc-2023. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-06-01.


Posts with mentions or reviews of awesome-data-centric-ai. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-07-13.
  • Thoughts: Continue current degree with one year left, or start anew with degree apprenticeship
    1 project | /r/cscareerquestionsuk | 13 Jul 2023
    I would finish the degree anyway. It's only one year left. If teachers miss classes, I would disregard that and try to learn on my own, and then yes, I would move on to an internship (or even do It at the same time if it's possible). If you like, come as meet us at the Data-Centric AI Community and we can do some projects together :)
  • Data science projects
    1 project | /r/datascience | 13 Jul 2023
    Definitely a lot of growth in the AI space, and it will evolve rapidly in the next few years. There several paid propositions at the Data-Centric AI Community discord, check them out.
  • I absolutely hate my internship
    2 projects | /r/csMajors | 13 Jul 2023
    2: Tbh, quit (?) We have open jobs at the Data-Centric AI Community. Bonus points: you can vent there as much as you want
  • Prioritise Data Science Projects
    1 project | /r/learnprogramming | 13 Jul 2023
    Let me invite you to the Data-Centric AI Community we have several code along sessions and projects and a lot of beginners that are starting to learn DS that you can connect with.
  • Imbalanced data
    1 project | /r/learnmachinelearning | 13 Jul 2023
    If you need specific help with your project you can find me at the Data-Centric AI Community and we'll be happy to take a look and give you some tips to move forward :)
  • Building my first Porfolio
    2 projects | /r/learnmachinelearning | 12 Jul 2023
    You can share with us your progress on the Data-Centric AI Community and ask someone to review it, we often do that with CVs as well and help each other out.
  • [Q] How to generate synthetic dataset for anomaly detection?
    1 project | /r/statistics | 8 May 2023
    Maybe you can use a synthetic data generator and use your current dataset as input? I believe there are a lot of GAN-based models for this purpose out there. The ones listed on are mostly focused on structured data, but I'm sure there are similar packages for images.

What are some alternatives?

When comparing nist-crc-2023 and awesome-data-centric-ai you can also consider the following projects:

tdk-demo - This is a collection of TDK demo projects that use different databases and options

ydata-synthetic - Synthetic data generators for tabular and time-series data

awesome-python-for-data-science - A curated list of awesome resources such as books, tutorials, courses, open-source libraries, exercises, and other materials that support Pythonistas in the making, and Pythonistas migrating into Data Science! 📊

machine_learning_complete - A comprehensive machine learning repository containing 30+ notebooks on different concepts, algorithms and techniques.

genalog - Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.

walkalongs - Resources and solutions of various technologies that I am currently learning

SDV - Synthetic data generation for tabular data


gan-vae-pretrained-pytorch - Pretrained GANs + VAEs + classifiers for MNIST/CIFAR in pytorch.


fullnamematchscore-go - Generates a match score of two person names from 0-100, where 100 is the highest, on how closely two individual full names match. The scoring is based on a series of tests, algorithms, AI, and an ever-growing body of Machine Learning-based generated knowledge

awesome-generative-ai-companies - A curated list of Gеnerative AI companies, sorted by focus area and total fundraised amount.