We built PyPOTS: an open-source toolbox for data mining on partially-observed time series

This page summarizes the projects mentioned and recommended in the original post on /r/pythontips

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • PyPOTS

    A Python toolbox/library for reality-centric machine/deep learning and data mining on partially-observed time series with PyTorch, including SOTA neural network models for science tasks of imputation, classification, clustering, forecasting & anomaly detection on incomplete (irregularly-sampled) multivariate time series with NaN missing values/data

  • Due to all kinds of reasons like failure of collection sensors, communication error, and unexpected malfunction, missing values are common to see in time series from the real-world environment. This makes partially-observed time series (POTS) a pervasive problem in open-world modelling and prevents advanced data analysis. Although this problem is important, the area of data mining on POTS still lacks a dedicated toolkit. PyPOTS is created to fill in this gap. PyPOTS (pronounced "Pie Pots") is the first (and so far the only) Python toolbox/library specifically designed for data mining and machine learning on partially-observed time series (POTS), namely, incomplete time series with missing values, A.K.A. irregularly-sampled time series, supporting tasks of imputation, classification, clustering, and forecasting on POTS datasets. It is born to become a handy toolbox that is going to make data mining on POTS easy rather than tedious, to help engineers and researchers focus more on the core problems in their hands rather than on how to deal with the missing parts in their data. PyPOTS will keep integrating classical and the latest state-of-the-art data mining algorithms for partially-observed multivariate time series. For sure, besides various algorithms, PyPOTS has unified APIs together with detailed documentation and interactive examples across algorithms as tutorials. Feedback and contributions are very welcome! Website: https://pypots.com Paper link: https://arxiv.org/abs/2305.18811 GitHub repo: https://github.com/WenjieDu/PyPOTS

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • [P] PyPOTS: a Python toolbox for data mining on Partially-Observed Time Series

    1 project | /r/MachineLearning | 28 Jun 2023
  • Missing values in time series collected from the real world are common to see and very pesky. A new state-of-the-art and fast neural network called SAITS is proposed to impute missing data in partially-observed multivariate time series. The code is open source on GitHub.

    2 projects | /r/datascience | 28 Jun 2023
  • We're building PyPOTS: a Python toolbox for data mining on Partially-Observed Time Series (GitHub repo: https://github.com/WenjieDu/PyPOTS, Paper link: https://arxiv.org/abs/2305.18811)

    1 project | /r/technology | 20 Jun 2023
  • We built PyPOTS, an open-source toolbox for data mining on partially-observed time series

    2 projects | /r/datascience | 14 Jun 2023
  • PyPOTS: NEW Data - star count:182.0

    1 project | /r/algoprojects | 14 Jan 2023