Pandas
NumPy
Our great sponsors
Pandas  NumPy  

307  225  
36,692  22,581  
1.0%  1.2%  
10.0  9.9  
5 days ago  1 day ago  
Python  Python  
BSD 3clause "New" or "Revised" License  BSD 3clause "New" or "Revised" License 
Stars  the number of stars that a project has on GitHub. Growth  month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Pandas

How to query pandas DataFrames with SQL
Pandas is a goto tool for tabular data management, processing, and analysis in Python, but sometimes you may want to go from pandas to SQL.

What are the best Python libraries to learn for beginners?
panda’s

Replacing Pandas with Polars. A Practical Guide
> The big thing pandas has going for it is that it's already been through this field testing. All the bugs have been ironed out by the hundreds of thousands of users.
At this very moment pandas github repo has 1563 open issues labeled with a bug tag [0]. So much for "all the bugs have been ironed out".
[0] https://github.com/pandasdev/pandas/issues?q=is%3Aopen+is%3...

Joining the Open Source Development Course
Python is the main programming language I use nowadays. In particular numpy and pandas are of course extremely useful. I also use biopython package  a collection of software tools for biological computation written in Python by an international group of researchers and developers.

Pandas VS Rath  a user suggested alternative
2 projects  12 Jan 2023

Twitter Data Pipeline with Apache Airflow + MinIO (S3 compatible Object Storage)
Below is the python Task that transforms the tweets list into a Pandas dataframe, then dumps it in our MinIO Object Storage as a CSV file:

Hanukkah of Data 2022  Puzzle 2
It was rewarding to dig into SQLite a bit while solving this puzzle, so I figured this would be a good opportunity to learn a bit more about pandas too! So how would I adapt this working SQL solution to pandas?
 ETL using pandas

Tor Network Statistics & Performance [OC]
All the data has been extracted from the official Tor Metrics website, and using Python with the Pandas library I've cleaned the data. Finally, the visualizations have been made with Tableau.

How to take inputs from an ascii file in Python
If you did that you could use a builtin library like csv to read and parse the file or you could use a 3rd party library like Pandas. Alternatively, you could store your file as json:
NumPy

What are the best Python libraries to learn for beginners?
NumPy: Scientific computing library and I know this one is the most popular especially in Data Science.

Joining the Open Source Development Course
Python is the main programming language I use nowadays. In particular numpy and pandas are of course extremely useful. I also use biopython package  a collection of software tools for biological computation written in Python by an international group of researchers and developers.

DO YOU YAML?
If you want to start a project from scratch, I prefer to start with a very basic virtual environment and add the packages I need as I go along. My basic framework usually consists of: Python NumPy Pandas MatplotLib & sometimes Seaborn

Tips for parsing inputs in Python?
Since you mention 2D arrays, did you consider using NumPy? It has tons of useful functions for array operations.

Microfeatures I'd like to see in more languages
> If you look at something like numpy functions, so many of them share the exact same parameter definitions. What if you could write def log(standardexpparams) instead of having to write them out every single time?
They're not actually written out every time, the issue is mostly documentary (and it would be nice if Python or Sphinx ever had a good solution). And numpy actually has a bunch of generators for that e.g. https://github.com/numpy/numpy/blob/45bc13e6d922690eea43b9d8... handles filling in the common bits of documentation for the ufuncs.

IT a trh práce
Python se pouziva vsude a nema specializaci. Asi jedina oblast kde vylozene vynika je Data Science a AI.

The Bitter Truth: Python 3.11 vs Cython vs C++ Performance for Simulations
Not any more. They rewrote the Fortran parts: https://github.com/numpy/numpy

Extending Python with Rust
Definitely. However, the equivalent for the array multiplication in numpy looks like this:
https://github.com/numpy/numpy/blob/22e683d84f2584a6f9a57b2c... + https://github.com/numpy/numpy/blob/22e683d84f2584a6f9a57b2c...
To know what they do, you need the source for INPLACE_GIVE_UP_IF_NEEDED (https://github.com/numpy/numpy/blob/b222eb66c79b8eccba39f46f...) and PyArray_GenericInplaceBinaryFunction (I don't even know where that's coming from, it's not defined in Numpy, maybe it's part of the Python interface?).
In the end, both are unreadable in their own way. I personally prefer the Rust version above to the macrofied C version that's in Numpy but that's a matter of taste. I'd also trust the safe Rust implementations more than the C implementations because of the memory management guarantees Rust provides, though I suppose for simple operations like multiplication it'll be easier to make the program safe enough in C.

I used Python to control a custom stopmotion animation drawing machine
The frames' vector data is generated using a sketch made with my vsketch framework (it involves loading GeoJSON data, processing it with Shapely, and projecting it in 3D with NumPy).

NumPy 1.24.0 has been released
worth a look https://github.com/numpy/numpy/issues/14412
What are some alternatives?
SymPy  A computer algebra system written in pure Python
Cubes  Lightweight Python OLAP framework for multidimensional data analysis
orange  🍊 :bar_chart: :bulb: Orange: Interactive data analysis
tensorflow  An Open Source Machine Learning Framework for Everyone
Airflow  Apache Airflow  A platform to programmatically author, schedule, and monitor workflows
pyexcel  Single API for reading, manipulating and writing data in csv, ods, xls, xlsx and xlsm files
Keras  Deep Learning for humans
Dask  Parallel computing with task scheduling
Pytorch  Tensors and Dynamic neural networks in Python with strong GPU acceleration
blaze  NumPy and Pandas interface to Big Data
Scrapy  Scrapy, a fast highlevel web crawling & scraping framework for Python.