datasets
datumaro
Our great sponsors
datasets | datumaro | |
---|---|---|
15 | 2 | |
18,376 | 483 | |
1.7% | 4.6% | |
9.5 | 9.4 | |
5 days ago | 4 days ago | |
Python | Python | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
datasets
- 🐍🐍 23 issues to grow yourself as an exceptional open-source Python expert 🧑💻 🥇
- Mastering ROUGE Matrix: Your Guide to Large Language Model Evaluation for Summarization with Examples
-
How to Train Large Models on Many GPUs?
https://github.com/huggingface/datasets
https://github.com/huggingface/transformers
-
[D] Can we use Ray for distributed training on vertex ai ? Can someone provide me examples for the same ? Also which dataframe libraries you guys used for training machine learning models on huge datasets (100 gb+) (because pandas can't handle huge data).
https://huggingface.co/docs/datasets backed with an Arrow file or buffer
- Need help with a data science project
-
Is there a text evaluation metric that does not need reference text?
I'm looking for an automatic evaluation metric that can score the first text higher (since it's more grammatically correct/better for other reasons). All the metrics for NLG I found require some reference text to match the generated text with, which I don't have.
-
FauxPilot – an open-source GitHub Copilot server
And then pass that my_code.json as the dataset name.
[1] https://github.com/huggingface/datasets
-
Hugging Face Introduces ‘Datasets’: A Lightweight Community Library For Natural Language Processing (NLP)
Code for https://arxiv.org/abs/2109.02846 found: https://github.com/huggingface/datasets
Quick Read | Paper | Github
- Datasets: A Community Library for Natural Language Processing
datumaro
-
Does anyone use CVAT for image annotation?
1) CVAT has internal inference for models. If you upload model there in the correct format, then it will be able to generate the detection box itself - https://onepanel.medium.com/train-an-object-detection-model-from-scratch-and-run-inference-on-it-in-10-minutes-16147ef656aa 2) Yes you can upload your prediction. But last time i did it - there were some problems and it took me several hours. It seems to me that you just need to load the markup in one of the formats that it supported by CVAT. If your format is not supported, then you will need to convert. For example like this - https://github.com/openvinotoolkit/datumaro
- Can I know what kind of dataset annotation for object detection is this?
What are some alternatives?
sentence-transformers - Multilingual Sentence & Image Embeddings with BERT
cocojson - Utility scripts for COCO json annotation format
cypress-realworld-app - A payment application to demonstrate real-world usage of Cypress testing methods, patterns, and workflows.
sahi - Framework agnostic sliced/tiled inference + interactive ui + error analysis plots
edex-ui - A cross-platform, customizable science fiction terminal emulator with advanced monitoring & touchscreen support.
DA-RetinaNet - Official Detectron2 implementation of DA-RetinaNet of our Image and Vision Computing 2021 work 'An unsupervised domain adaptation scheme for single-stage artwork recognition in cultural sites'
first-contributions - 🚀✨ Help beginners to contribute to open source projects
label-studio - Label Studio is a multi-type data labeling and annotation tool with standardized output format
frankmocap - A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator
UCF-SST-CitySim1-Dataset - Official github page of UCF SST CitySim Dataset
evaluate - 🤗 Evaluate: A library for easily evaluating machine learning models and datasets.
GeoCOCO - Tool for converting GIS annotations to Microsoft's Common Objects In Context (COCO) datasets