We are the developers behind pandas, currently preparing for the 2.0 release :) AMA

This page summarizes the projects mentioned and recommended in the original post on /r/Python

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • Pandas

    Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

  • Personally polars' strictness is making me think about situations when in pandas we end up with object dtype, which we should probably avoid. Here's an example: https://github.com/pandas-dev/pandas/issues/50887 (polars would just error in such a case, which I think is the correct thing to do)

  • pandas-stubs

    Public type stubs for pandas

  • There is a typing effort that is led by some core members (unfortunately none of them takes part today). You can check the stubs package out at https://github.com/pandas-dev/pandas-stubs. I am not really familiar with the progress there

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • scikit-learn

    scikit-learn: machine learning in Python

  • There's an issue here about that https://github.com/scikit-learn/scikit-learn/discussions/25450

  • bench-warmers

    DigThatData's Public Brainstorming space

  • you've sort of become victims of your own success: as another pandas dev mentioned, you want to preserve backwards compatibility and this significantly complicates any restructuring. I'm sympathetic and am not sure what the best solution here would be. I had this idea last night but i'm not sure I like this approach either.

  • python-bigquery-pandas

    Google BigQuery connector for pandas

  • I'm not sure if there is already support for all Arrow complex types in pandas 2.0, but we have some support of lists for sure, and I think structs too. For the bigquery part, I think you can ask this to the developers of this repo: https://github.com/googleapis/python-bigquery-pandas We basically wrap that library with the read_gbq() function. but there is not much big query specific in pandas other than that, so not much idea.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Read files from s3 using Pandas/s3fs or AWS Data Wrangler?

    3 projects | /r/dataengineering | 6 Dec 2023
  • The Design Philosophy of Great Tables (Software Package)

    7 projects | news.ycombinator.com | 4 Apr 2024
  • Welcome to 14 days of Data Science!

    1 project | dev.to | 7 Mar 2024
  • Data Science for Beginners - A Curriculum

    1 project | /r/programming | 8 Sep 2023
  • How to Build and Deploy a Machine Learning model using Docker

    5 projects | dev.to | 30 Jul 2023