Jupyter Notebook Dataset

Open-source Jupyter Notebook projects categorized as Dataset

Top 23 Jupyter Notebook Dataset Projects

  • covid-chestxray-dataset

    We are building an open database of COVID-19 cases with chest X-ray or CT images.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • whylogs

    An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collection, ensuring safety & robustness. 📈

  • datasets

    🎁 5,400,000+ Unsplash images made available for research and machine learning (by unsplash)

    Project mention: AI-Powered Image Search with CLIP, pgvector, and Fast API | dev.to | 2024-02-12

    Here's a live demo with a simple React frontend. It's searching against an S3 bucket containing Unsplash's open source dataset of 25,000 images, plus a few of my own.

  • fma

    FMA: A Dataset For Music Analysis

  • clusterdata

    cluster data collected from production clusters in Alibaba for cluster management research

  • raccoon_dataset

    The dataset is used to train my own raccoon detector and I blogged about it on Medium

  • torchxrayvision

    TorchXRayVision: A library of chest X-ray datasets and models. Classifiers, segmentation, and autoencoders.

  • ThoughtSource

    A central, open resource for data and tools related to chain-of-thought reasoning in large language models. Developed @ Samwald research group: https://samwald.info/

  • hate-speech-and-offensive-language

    Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017

  • OpenAI-CLIP

    Simple implementation of OpenAI CLIP model in PyTorch.

    Project mention: Simple Implementation of OpenAI Clip (Tutorial) | news.ycombinator.com | 2024-02-21
  • TACO

    🌮 Trash Annotations in Context Dataset Toolkit (by pedropro)

  • SKAB

    SKAB - Skoltech Anomaly Benchmark. Time-series data for evaluating Anomaly Detection algorithms.

  • Awesome_Satellite_Benchmark_Datasets

    Supplementary material for our paper "THERE IS NO DATA LIKE MORE DATA" is provided.

  • covid19za

    Coronavirus COVID-19 (2019-nCoV) Data Repository and Dashboard for South Africa

  • roboflow-100-benchmark

    Code for replicating Roboflow 100 benchmark results and programmatically downloading benchmark datasets

  • ImageNetV2

    A new test set for ImageNet

  • alis

    [ICCV 2021] Aligning Latent and Image Spaces to Connect the Unconnectable (by universome)

  • goodreads

    code samples for the goodreads datasets (by MengtingWan)

  • mnist1d

    A 1D analogue of the MNIST dataset for measuring spatial biases and answering Science of Deep Learning questions.

  • clip-italian

    CLIP (Contrastive Language–Image Pre-training) for Italian

  • openbrewerydb

    🍻 An open-source dataset of breweries, cideries, brewpubs, and bottleshops.

  • medmcqa

    A large-scale (194k), Multiple-Choice Question Answering (MCQA) dataset designed to address realworld medical entrance exam questions.

  • Tegridy-MIDI-Dataset

    Tegridy MIDI Dataset for precise and effective Music AI models creation.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Jupyter Notebook Dataset discussion

Log in or Post with

Jupyter Notebook Dataset related posts

Index

What are some of the best open-source Dataset projects in Jupyter Notebook? This list will help you:

Project Stars
1 covid-chestxray-dataset 2,998
2 whylogs 2,657
3 datasets 2,443
4 fma 2,212
5 clusterdata 1,619
6 raccoon_dataset 1,268
7 torchxrayvision 936
8 ThoughtSource 899
9 hate-speech-and-offensive-language 779
10 OpenAI-CLIP 640
11 TACO 603
12 SKAB 328
13 Awesome_Satellite_Benchmark_Datasets 323
14 covid19za 255
15 roboflow-100-benchmark 248
16 ImageNetV2 240
17 alis 243
18 goodreads 250
19 mnist1d 199
20 clip-italian 180
21 openbrewerydb 179
22 medmcqa 174
23 Tegridy-MIDI-Dataset 158

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com

Did you konow that Jupyter Notebook is
the 13th most popular programming language
based on number of metions?