Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work. Learn more →
Top 4 Python Arrow Projects
🏹 Better dates & times for Python (by arrow-py)Project mention: Useful Python Modules for us | reddit.com/r/savedForMS | 2022-12-04
pdbpp: Improved pdb boltons: assorted python addtions twisted: event driven networking framework sorcery: Dark magic in python, things know where+how they are being called, helps reducing boilerplate sh: Better alternative for subprocess module, much more pythonic taskipy: npm run scipt_name like functionality snoop: pdb lite, record+replay function steps birdseye: graphical debugger remote-pdb: easy pdb from inside containers typer: wrapper around click for simpler code for CLIs arrow: Always TZ aware datetimes, plus more features more-itertools: more functions for iterators pydantic: data validation + dataclasses loguru: better logging notifiers: sending notifications from python
VisiData interface for databasesProject mention: Is there a CLI interface to browse SQL databases? | reddit.com/r/commandline | 2022-08-10
I would especially keep an eye on the `vdsql` project, a VisiData plugin that uses `Ibis` to harness the power of SQL: https://github.com/visidata/vdsql =) It just had its first release.
Access the most powerful time series database as a service. Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression.
🦖 An in-process SQL query federator, for analysis of distributed datasetsProject mention: Pure Python Distributed SQL Engine | news.ycombinator.com | 2022-12-30
Thanks for sharing.
I have a SQL Engine in Python too (https://github.com/mabel-dev/opteryx). I focused my initial effort on supporting SQL statements and making the usage feel like a database - that probably reflects the problem I had in front of me when I set out - only handling handfuls of gigabytes in a batch environment for ETLs with a group of new-to-data-engineering engineers. Have recently started looking more at real-time performance, such as distributing work. Am interesting in how you've approached.
A Python wrapper around calamine (by ToucanToco)Project mention: Meet FastExcel, the fastest excel reader wrote in rust with a python binding. | reddit.com/r/rust | 2022-11-23
Python Arrow related posts
A Polars exploration into Kedro
6 projects | dev.to | 17 May 2023
New Pandas-for-Haskell data frame library: Name suggestions
3 projects | reddit.com/r/haskell | 17 Aug 2022
How to Deploy ML Models Using Gravity AI and Meadowrun
4 projects | dev.to | 17 Aug 2022
Apache Hop 2.0
4 projects | news.ycombinator.com | 8 Jun 2022
How to use Spark and Pandas to prepare big data
3 projects | dev.to | 10 May 2022
Awkward: Nested, jagged, differentiable, mixed type, GPU-enabled, JIT'd NumPy
5 projects | news.ycombinator.com | 16 Dec 2021
Test Parquet float16 Support in Pandas
3 projects | dev.to | 14 Dec 2021
A note from our sponsor - Sonar
www.sonarsource.com | 31 May 2023
What are some of the best open-source Arrow projects in Python? This list will help you: