Jupyter Notebook Data Mining

Open-source Jupyter Notebook projects categorized as Data Mining

Top 4 Jupyter Notebook Data Mining Projects

  • python-machine-learning-book

    The "Python Machine Learning (1st edition)" book code repository and info resource

  • fraud-detection-handbook

    Reproducible Machine Learning for Credit Card Fraud Detection - Practical Handbook

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

  • BrewPOTS

    The tutorials for PyPOTS.

    Project mention: We're building PyPOTS: a Python toolbox for data mining on Partially-Observed Time Series | /r/learnprogramming | 2023-06-19

    Due to all kinds of reasons like failures of collection sensors, communication errors, and unexpected malfunctions, missing values are common to see in time series from the real-world environment. No matter whether we like them or not, missing data makes partially-observed time series (POTS) a pervasive problem in open-world modeling and prevents advanced data analysis. Although this problem is important, the area of data mining on POTS still lacks a dedicated toolkit. PyPOTS is created to fill in this gap. PyPOTS (pronounced "Pie Pots") is the first (and so far the only) Python toolbox/library specifically designed for data mining and machine learning on partially-observed time series (POTS), namely, incomplete time series with missing values, A.K.A. irregularly-sampled time series, supporting tasks of imputation, classification, clustering, and forecasting on POTS datasets. It is born to become a handy toolbox that is going to make data mining on POTS easy rather than tedious, to help engineers and researchers focus more on the core problems in their hands rather than on how to deal with the missing parts in their data. PyPOTS will keep integrating classical and the latest state-of-the-art data mining algorithms for partially-observed multivariate time series. For sure, besides various algorithms, PyPOTS has unified APIs together with detailed documentation and interactive examples across algorithms as tutorials. Feedback, questions, and contributions are all very welcome! Website: https://pypots.com Paper link: https://arxiv.org/abs/2305.18811 GitHub repo: https://github.com/WenjieDu/PyPOTS Tutorials: https://github.com/WenjieDu/BrewPOTS Docs: https://docs.pypots.com

  • VevestaX

    2 Lines of code to track ML experiments + EDA + check into Github

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2023-06-19.

Jupyter Notebook Data Mining related posts

Index

What are some of the best open-source Data Mining projects in Jupyter Notebook? This list will help you:

Project Stars
1 python-machine-learning-book 12,076
2 fraud-detection-handbook 420
3 BrewPOTS 35
4 VevestaX 27
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com