BotLibre
dcai-lab
BotLibre | dcai-lab | |
---|---|---|
1 | 10 | |
561 | 401 | |
-0.7% | 3.2% | |
6.6 | 5.4 | |
about 1 month ago | 4 months ago | |
Java | Jupyter Notebook | |
Eclipse Public License 1.0 | GNU Affero General Public License v3.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
BotLibre
dcai-lab
-
Resources to learn practical/industry-focused ML (preferably using TensorFlow)?
Data-Centric AI honestly if you've been working on ML pipelines this might be familiar to you
-
Andrew NG, github courses
Another great resource inspired by the Andrew Ng data-centric AI movement is the Introduction to Data-Centric AI course taught this past semester at MIT by PhDs.
-
Good Beginner Courses for ML?
Data-centric AI course. Brand new, taught the 1st time a few months ago by MIT PhD grads. This covers how to ensure good data quality for your models. More data science havy.
-
[P] We are building a curated list of open source tooling for data-centric AI workflows, looking for contributions.
Thanks for the kind words! Make sure to check out the current open MIT course if you are just starting out: https://dcai.csail.mit.edu/
-
The Missing Semester of Your CS Education
Introduction to Data-Centric AI https://dcai.csail.mit.edu
- Introduction to Data-Centric AI
-
MIT Introduction to Data-Centric AI
Course homepage | Lecture videos on YouTube | Lab Assignments
What are some alternatives?
learn - Neuro-symbolic interpretation learning (mostly just language-learning, for now)
snorkel - A system for quickly generating training data with weak supervision
refinery - The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.
cleanlab - The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
DKPro Core - Collection of software components for natural language processing (NLP) based on the Apache UIMA framework.
llm-course - Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
deodel - A mixed attributes predictive algorithm implemented in Python.
simplenlg - Java API for Natural Language Generation. Originally developed by Ehud Reiter at the University of Aberdeen’s Department of Computing Science and co-founder of Arria NLG. This git repo is the official SimpleNLG version.
chordviz - A convolutional neural network trained using PyTorch to predict the next chord (as tablature) on a guitar based on image data. Includes labeling software for the image data as well as an iOS app for hosting and running the model.
sematle - NLU service that converts plain English to known and structured data.
UBB-INFO - All projects from university.