shabby-pages
cia
shabby-pages | cia | |
---|---|---|
1 | 2 | |
44 | 3 | |
- | - | |
3.7 | 0.0 | |
5 days ago | almost 2 years ago | |
Jupyter Notebook | Jupyter Notebook | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
shabby-pages
-
[P][R] Announcing: Dataset & Denoising Shabby Pages Competition
Into machine learning? Want a chance to earn a new MacBook Pro? Check out the Denoising ShabbyPages competition! The ShabbyPages dataset is being produced as a way to help train, test, and calibrate computer vision machine learning algorithms designed for working with documents. Enter the competition by training a model to remove the noise, and be awarded a MacBook Pro or some swag in the process! Check out the short paper introducing the dataset, and learn more about the competition at denoising-shabby.com.
cia
-
CIA Factbook - 250 countries & 66 Columns of Dataset & API
While you can download and use this dataset for free through https://github.com/woosal1337/cia, you can also prefer using Kaggle Page, whereas both of the pages are going to stay updated to the latest versions accordingly.
For each separate file, folder please click here, if you want to visit the file where all of the columns were combined together (over 66 columns), then please click here.
What are some alternatives?
layout-parser - A Unified Toolkit for Deep Learning Based Document Image Analysis
hate-speech-and-offensive-language - Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017
Data-Science-Cheatsheet - A helpful 5-page machine learning cheatsheet to assist with exam reviews, interview prep, and anything in-between.
visuallayer - Simplify Your Visual Data Ops. Find and visualize issues with your computer vision datasets such as duplicates, anomalies, data leakage, mislabels and others.
HugsVision - HugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision
covid19za - Coronavirus COVID-19 (2019-nCoV) Data Repository and Dashboard for South Africa
label-studio - Label Studio is a multi-type data labeling and annotation tool with standardized output format
whylogs - An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collection, ensuring safety & robustness. 📈
DataProfiler - What's in your data? Extract schema, statistics and entities from datasets
openbrewerydb - 🍻 An open-source dataset of breweries, cideries, brewpubs, and bottleshops.