Python Pandas

Open-source Python projects categorized as Pandas | Edit details

Top 23 Python Panda Projects

  • Pandas

    Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

    Project mention: Is this the right way too approach the problem? | reddit.com/r/learnpython | 2022-01-23

    First of all make sure you export the data from Excel into CSV (Comma Separated Value) format which is going to make it much easier to work with from Python. Other person gave a nice link to the python csv docs, if you wanted to get crazy you could also checkout popular data science library Pandas which has a read_csv function which should import your data pretty easily at least

  • data-science-ipython-notebooks

    Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

  • OPS

    OPS - Build and Run Open Source Unikernels. Quickly and easily build and deploy open source unikernels in tens of seconds. Deploy in any language to any cloud.

  • tqdm

    A Fast, Extensible Progress Bar for Python and CLI

    Project mention: tqdm (Python) | news.ycombinator.com | 2021-12-16

    That's a reasonable request. Discussion about a feature along those lines seems to be happening in https://github.com/tqdm/tqdm/issues/614; perhaps you could weigh in there?

  • datasets

    🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

    Project mention: Hugging Face Introduces ‘Datasets’: A Lightweight Community Library For Natural Language Processing (NLP) | reddit.com/r/artificial | 2021-11-08

    Code for https://arxiv.org/abs/2109.02846 found: https://github.com/huggingface/datasets

  • Dask

    Parallel computing with task scheduling

    Project mention: What does it mean to scale your python powered pipeline? | dev.to | 2022-01-03

    Dask: Distributed data frames, machine learning and more

  • seaborn

    Statistical data visualization in Python

    Project mention: Scanned Black Holes | reddit.com/r/eliteexplorers | 2022-01-23

    Graphs, heatmaps and palettes using seaborn

  • modin

    Modin: Speed up your Pandas workflows by changing a single line of code

    Project mention: TIL about modin.pandas which significantly speeds up pandas if you import modin.pandas instead of pandas. | reddit.com/r/u_pygsm | 2021-06-30

    Source

  • Scout APM

    Less time debugging, more time building. Scout APM allows you to find and fix performance issues with no hassle. Now with error monitoring and external services monitoring, Scout is a developer's best friend when it comes to application development.

  • yfinance

    Download market data from Yahoo! Finance's API

    Project mention: Download market data from Yahoo Finance's API | news.ycombinator.com | 2022-01-20
  • visidata

    A terminal spreadsheet multitool for discovering and arranging data

    Project mention: My high school students' first exposure to Garuda. They were blown away with how cool everything is. | reddit.com/r/linux | 2022-01-22

    VisiData has eliminated my last remaining use cases for Excel.

  • alpha_vantage

    A python wrapper for Alpha Vantage API for financial data.

    Project mention: Yesterday I came across Awesome-Quant repository and it was great. I went ahead and dig through all the backtesting & AI repos from Python and created a list of repo which are most updated & maintained. Let me know if I missed your favorite. | reddit.com/r/algotrading | 2021-05-27
  • lux

    Automatically visualize your pandas dataframe via a single print! 📊 💡

    Project mention: Do you see SQL being under threat in any way as a way of querying databases? I know it's possibly a dumb question but wondering. | reddit.com/r/BusinessIntelligence | 2021-09-27
  • orange

    🍊 :bar_chart: :bulb: Orange: Interactive data analysis

    Project mention: ETL Library for Python | reddit.com/r/Python | 2021-09-27

    "On the simpler side". Do you mean with a graphical interface? Then, orange would be a nice solution. https://orangedatamining.com/

  • koalas

    Koalas: pandas API on Apache Spark

    Project mention: Spark vs Pandas | reddit.com/r/dataengineering | 2021-02-18

    If you like excessive use of square brackets.. I mean pandas, you might wanna check out Koalas. Koalas suppose to provide pandas datafrafe API implementation atop of Spark.

  • missingno

    Missing data visualization module for Python.

    Project mention: For all the python/pandas users out there I just released a bunch of UI updates to the free visualizer, D-Tale | reddit.com/r/algotrading | 2021-04-12

    analysis of "Missing" data using the missingno package is now available in a sliding side panel enlarge or download PNG files for matrix/bar/heatmap/dendrogram charts generated using missingno

  • dtale

    Visualizer for pandas data structures

    Project mention: Show HN: D-Tale, easy to use pandas GUI | news.ycombinator.com | 2021-11-01
  • XlsxWriter

    A Python module for creating Excel XLSX files.

    Project mention: What are you working on? - October 2021 | reddit.com/r/IOPsychology | 2021-10-01

    Bam! https://xlsxwriter.readthedocs.io

  • arctic

    High performance datastore for time series and tick data

    Project mention: arctic: NEW Data - star count:2555.0 | reddit.com/r/algoprojects | 2022-01-21
  • PandasGUI

    A GUI for Pandas DataFrames

    Project mention: Low-code GUI tools for PySpark? | reddit.com/r/apachespark | 2021-12-09

    Similar to the several pandas low-code GUI tools such as [bamboolib](https://bamboolib.8080labs.com) or [PandasGUI](https://github.com/adamerose/PandasGUI), is there something available for PySpark?

  • AWS Data Wrangler

    Pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).

    Project mention: Automate some wrangling and data visualization in Python | reddit.com/r/aws | 2022-01-03
  • xarray

    N-D labeled arrays and datasets in Python

    Project mention: How we found and helped fix 24 bugs in 24 hours (in Tensorflow, Sentry, V8, PyTorch, Hue, and more) | dev.to | 2022-01-05

    Pydata's xarray

  • mars

    Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.

  • pandas-datareader

    Extract data from a wide range of Internet sources into a pandas DataFrame.

    Project mention: Best quantitative tools/repos/apis for Sentiment & Social Media analysis of individual Stock/Crypto tickers | reddit.com/r/algotrading | 2021-07-03

    Also Yahoo continually takes steps to discourage programmatic access (the most recent attempt is happening right now: https://github.com/pydata/pandas-datareader/issues/868).

  • pandas-ta

    Technical Analysis Indicators - Pandas TA is an easy to use Python 3 Pandas Extension with 130+ Indicators

    Project mention: Pandas TA? | reddit.com/r/learnpython | 2022-01-14
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2022-01-23.

Python Pandas related posts

Index

What are some of the best open-source Panda projects in Python? This list will help you:

Project Stars
1 Pandas 32,439
2 data-science-ipython-notebooks 22,360
3 tqdm 20,919
4 datasets 11,855
5 Dask 9,424
6 seaborn 9,094
7 modin 6,704
8 yfinance 6,487
9 visidata 4,738
10 alpha_vantage 3,581
11 lux 3,273
12 orange 3,237
13 koalas 3,061
14 missingno 3,045
15 dtale 2,936
16 XlsxWriter 2,800
17 arctic 2,561
18 PandasGUI 2,528
19 AWS Data Wrangler 2,457
20 xarray 2,382
21 mars 2,336
22 pandas-datareader 2,211
23 pandas-ta 2,059
Find remote jobs at our new job board 99remotejobs.com. There are 30 new remote jobs listed recently.
Are you hiring? Post a new remote job listing for free.
Static code analysis for 29 languages.
Your projects are multi-language. So is SonarQube analysis. Find Bugs, Vulnerabilities, Security Hotspots, and Code Smells so you can release quality code every time. Get started analyzing your projects today for free.
www.sonarqube.org