Jupyter Notebook Data

Open-source Jupyter Notebook projects categorized as Data

Top 23 Jupyter Notebook Data Projects

  • data

    Data and code behind the articles and graphics at FiveThirtyEight

  • Project mention: [USMNT] It only took 20 caps for Jesus Ferreira to get double-digit goals. The fastest in #USMNT history. | /r/MLS | 2023-06-29

    You of course already know this answer, but just to put it into more perspective. Here are the SPI ranking equivalents to what he did with these 11 goals in Scotland and Switzerland.

  • datasets

    🎁 5,400,000+ Unsplash images made available for research and machine learning (by unsplash)

  • Project mention: AI-Powered Image Search with CLIP, pgvector, and Fast API | dev.to | 2024-02-12

    Here's a live demo with a simple React frontend. It's searching against an S3 bucket containing Unsplash's open source dataset of 25,000 images, plus a few of my own.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • quilt

    Quilt is a data mesh for connecting people with actionable data

  • spyql

    Query data on the command line with SQL-like SELECTs powered by Python expressions

  • Project mention: Fq: Jq for Binary Formats | news.ycombinator.com | 2023-06-03

    I prefer a SQL-like format. It’s not as complete but it cover most of the day-to-day use cases. Take a look at https://github.com/dcmoura/spyql (I am the author). Congrats on fq!

  • pdpipe

    Easy pipelines for pandas DataFrames.

  • Reactors

    🌱 Join a community of developers at Microsoft Reactor and connect with people, skills, and technology to build your career or personal learning. We offer free livestreams, on-demand content, and hybrid/in-person events daily around the world. Access our projects and code here.

  • ydata-quality

    Data Quality assessment with one line of code

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • awesome-data-centric-ai

    Open-Source Software, Tutorials, and Research on Data-Centric AI 🤖

  • Project mention: Thoughts: Continue current degree with one year left, or start anew with degree apprenticeship | /r/cscareerquestionsuk | 2023-07-13

    I would finish the degree anyway. It's only one year left. If teachers miss classes, I would disregard that and try to learn on my own, and then yes, I would move on to an internship (or even do It at the same time if it's possible). If you like, come as meet us at the Data-Centric AI Community and we can do some projects together :)

  • PANDAS-TUTORIAL

    Jupyter Notebooks and Data Sets for Pandas Library (by TirendazAcademy)

  • uawardata

    The data behind uawardata.com

  • visuallayer

    Simplify Your Visual Data Ops. Find and visualize issues with your computer vision datasets such as duplicates, anomalies, data leakage, mislabels and others.

  • Project mention: VL Datasets - A Free Collection of Clean Computer Vision Datasets | /r/computervision | 2023-06-28

    To learn more visit our GitHub repository - https://github.com/visual-layer/visuallayer

  • vulcan-sql-examples

    Curated VulcanSQL show cases

  • Project mention: VulcanSQL and LLMs | news.ycombinator.com | 2023-08-25

    https://github.com/Canner/vulcan-sql-examples/tree/main/hugg...

  • rihal-challenges

    This repository is used to house Rihal's challenges for hiring.

  • Project mention: [P] Would like to see other peoples solutions to this, to see what i could have done better | /r/MachineLearning | 2023-09-24
  • German-NER-BERT

    German NER on Legal Data using BERT

  • PARSE-CLIP

    A simple CLIP based project for combining images from multiple datasets.

  • ACC_Data_2

    A tool for recording telemetry from Assetto Corsa Competitzione (on PC) for post-session analysis

  • search-engine

    Full stack search-engine created from youtube videos obtained using "web-scraping"

  • KunOnYomiFrequency

    The most common possible readings of the most frequently used Kanji characters.

  • walmart-stores-coffee-analysis

    Walmart Coffee Exploratory Data Analysis. Data Extracted with SerpApi 🧡

  • russo-ukraine-war-prediction-losses

    Highlights rusian losses with predictions based on historic data from Ministry Defence of Ukraine 🐱‍👤

  • onpe2021

    2021 presidential election's data extractor. This script collect all the data from official ONPE page.

  • global-temp-change-animation

    This animated map shows the change in surface temperature around the world from 1970 to 2021, based on data from Kaggle.

  • DataScienceProjects

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Jupyter Notebook Data related posts

Index

What are some of the best open-source Data projects in Jupyter Notebook? This list will help you:

Project Stars
1 data 16,631
2 datasets 2,299
3 quilt 1,313
4 spyql 902
5 pdpipe 715
6 Reactors 506
7 ydata-quality 406
8 awesome-data-centric-ai 300
9 PANDAS-TUTORIAL 158
10 uawardata 113
11 visuallayer 65
12 vulcan-sql-examples 17
13 rihal-challenges 11
14 German-NER-BERT 7
15 PARSE-CLIP 3
16 ACC_Data_2 3
17 search-engine 1
18 KunOnYomiFrequency 1
19 walmart-stores-coffee-analysis 1
20 russo-ukraine-war-prediction-losses 0
21 onpe2021 0
22 global-temp-change-animation 0
23 DataScienceProjects 0

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com