Python Pandas

Open-source Python projects categorized as Pandas

Top 23 Python Panda Projects

  • Pandas

    Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

  • Project mention: Deploying a Serverless Dash App with AWS SAM and Lambda | dev.to | 2024-03-04

    Dash is a Python framework that enables you to build interactive frontend applications without writing a single line of Javascript. Internally and in projects we like to use it in order to build a quick proof of concept for data driven applications because of the nice integration with Plotly and pandas. For this post, I'm going to assume that you're already familiar with Dash and won't explain that part in detail. Instead, we'll focus on what's necessary to make it run serverless.

  • 30-Days-Of-Python

    30 days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days. This challenge may take more than100 days, follow your own pace. These videos may help too: https://www.youtube.com/channel/UC7PNRuno1rzYPb1xLa4yktw

  • Project mention: Top GitHub Resources to Level Up Your Python game | dev.to | 2023-11-27

    πŸŽ‡ Repository Link: 30 Days of Python

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • tqdm

    :zap: A Fast, Extensible Progress Bar for Python and CLI

  • Project mention: Neat Parallel Output in Python | news.ycombinator.com | 2024-02-25

    yeah my code needs to use multiprocessing, which does not play nice with tqdm. thanks for the tip about positions though, that helped me search more effectively and came up with two promising comments. unmerged / require some workarounds, but might just work:

    https://github.com/tqdm/tqdm/issues/1000#issuecomment-184208...

  • data-science-ipython-notebooks

    Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

  • datasets

    πŸ€— The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

  • Project mention: 🐍🐍 23 issues to grow yourself as an exceptional open-source Python expert πŸ§‘β€πŸ’» πŸ₯‡ | dev.to | 2023-10-19
  • ydata-profiling

    1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

  • Project mention: FLaNK 25 December 2023 | dev.to | 2023-12-26
  • Dask

    Parallel computing with task scheduling

  • Project mention: The Distributed Tensor Algebra Compiler (2022) | news.ycombinator.com | 2023-06-15
  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • seaborn

    Statistical data visualization in Python

  • Project mention: Apache Superset | news.ycombinator.com | 2024-02-26

    If you are doing data analysis I don't think any of the 3 pieces of software you mentioned are going to be that helpful.

    I see these products as tools for data visualization and reporting i.e. presenting prepared datasets to users in a visually appealing way. They aren't as well suited for serious analytics.

    I can't comment on Superset or Tableau but I am familiar with Power BI (it has been rolled out across my org), the type of statistics you can do with it are fairly rudimentary. If you need to do any thing beyond summarizing (counts, averages, min, max etc). It is not particularly easy.

    For data analysis I use SAS or R. This software allows you do things like multivariate regression, timeseries forecasting, PCA, Cluster analysis etc. There is also plotting capability.

    Both these products are kind of old school, I've been using them since early 2000's, the "new school" seems to be Python. Pretty much all the recent data science people in my organization use Python. Particularly Pandas and libraries like Seaborn (https://seaborn.pydata.org/).

    The "power" users of Power BI in my organization tend to be finance/HR people for use cases like drill down into cost figures or Interactively presenting KPI's and other headline figures to management things like that.

  • yfinance

    Download market data from Yahoo! Finance's API

  • Project mention: How to catch exceptions in library? | /r/learnpython | 2023-07-06

    If you check the file here - https://github.com/ranaroussi/yfinance/blob/main/yfinance/base.py - you can see this is communicated via the "raise Exception('%s: %s' % (self.ticker, err_msg))" line. I'm trying to use the following to catch the exception but no luck.

  • pygwalker

    PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis

  • Project mention: Show HN: Use an "eraser" to clean data on flight without breaking your workflow | news.ycombinator.com | 2024-03-15
  • modin

    Modin: Scale your Pandas workflows by changing a single line of code

  • Project mention: The Distributed Tensor Algebra Compiler (2022) | news.ycombinator.com | 2023-06-15
  • mlcourse.ai

    Open Machine Learning Course

  • Project mention: Open Machine Learning Course | news.ycombinator.com | 2023-10-22
  • visidata

    A terminal spreadsheet multitool for discovering and arranging data

  • Project mention: Fx – Terminal JSON Viewer | news.ycombinator.com | 2023-09-19

    [4] "Is it possible to "flatten" structured data (like JSON?)": https://github.com/saulpw/visidata/discussions/1605

  • lux

    Automatically visualize your pandas dataframe via a single print! πŸ“Š πŸ’‘ (by lux-org)

  • pandas-ta

    Technical Analysis Indicators - Pandas TA is an easy to use Python 3 Pandas Extension with 150+ Indicators

  • Project mention: Help recreating ta-lib python MACDFIX in pure python | /r/algotrading | 2023-05-03

    I do not know what is the difference between MACD and MACDFIX but maybe you can take a look how MACD is implemented in pandas_ta library and modify it a bit to achive a behavior you want.

  • orange

    🍊 :bar_chart: :bulb: Orange: Interactive data analysis

  • Project mention: Hierarchical Clustering | news.ycombinator.com | 2024-04-20

    I know I've tooted its horn before, but Orange3 is a pretty neat Python-based GUI platform that makes this and a metric buttload of other statistical/ML techniques available to non-programmer types.

    Just watch out for null character `x00` in the corpus. That always seems to kill it stone dead.

    https://orangedatamining.com/

    https://orange3.readthedocs.io/projects/orange-visual-progra...

  • Mimesis

    Mimesis is a powerful Python library that empowers developers to generate massive amounts of synthetic data efficiently.

  • geopandas

    Python tools for geographic data

  • Project mention: Geopandas spatial predicate performance increase? | /r/gis | 2023-05-12

    Interesting! Full honesty I don’t know, lol. But if I were to guess… something to do with implementation of pygeos and indexing. Recent geopandas versions I believe handle joins with shapely vs. pygeos differently. Maybe if there is a declared predicate, it treats the spatial index differently. https://github.com/geopandas/geopandas/pull/1421

  • alpha_vantage

    A python wrapper for Alpha Vantage API for financial data.

  • Project mention: alpha_vantage: NEW Data - star count:3917.0 | /r/algoprojects | 2023-05-15
  • ibis

    the portable Python dataframe library

  • Project mention: Show HN: Hashquery, a Python library for defining reusable analysis | news.ycombinator.com | 2024-04-23

    I really don't understand the appeal of dbt vs a proper programming language. The templating approach leads to massive spaghetti. I look forward to trying out something like Ibis [0]

    0: https://ibis-project.org/

  • AWS Data Wrangler

    pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).

  • Project mention: Read files from s3 using Pandas/s3fs or AWS Data Wrangler? | /r/dataengineering | 2023-12-06

    I had no problem with awswrangler (https://github.com/aws/aws-sdk-pandas) and it supports reading and writing partitions which was really helpful and a few other optimizations that made it a great tool

  • missingno

    Missing data visualization module for Python.

  • Artificial-Intelligence-Deep-Learning-Machine-Learning-Tutorials

    A comprehensive list of Deep Learning / Artificial Intelligence and Machine Learning tutorials - rapidly expanding into areas of AI/Deep Learning / Machine Vision / NLP and industry specific areas such as Climate / Energy, Automotives, Retail, Pharma, Medicine, Healthcare, Policy, Ethics and more.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Pandas related posts

Index

What are some of the best open-source Panda projects in Python? This list will help you:

Project Stars
1 Pandas 41,923
2 30-Days-Of-Python 31,031
3 tqdm 27,405
4 data-science-ipython-notebooks 26,459
5 datasets 18,376
6 ydata-profiling 12,022
7 Dask 11,982
8 seaborn 11,946
9 yfinance 11,778
10 pygwalker 9,759
11 modin 9,465
12 mlcourse.ai 9,390
13 visidata 7,409
14 lux 4,915
15 pandas-ta 4,732
16 orange 4,604
17 Mimesis 4,300
18 geopandas 4,177
19 alpha_vantage 4,155
20 ibis 4,074
21 AWS Data Wrangler 3,797
22 missingno 3,771
23 Artificial-Intelligence-Deep-Learning-Machine-Learning-Tutorials 3,638

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com