Managing outdated pull requests is time-consuming. Mergify's Merge Queue automates your pull request management & merging. It's fully integrated to GitHub & coordinated with any CI. Start focusing on code. Try Mergify for free. Learn more →
Top 23 Python CSV Projects
-
Project mention: I wrote this iCalendar (.ics) command-line utility to turn common calendar exports into more broadly compatible CSV files. | /r/commandline | 2023-03-24
CSV utilities (still haven't pick a favorite one...): https://github.com/harelba/q https://github.com/BurntSushi/xsv https://github.com/wireservice/csvkit https://github.com/johnkerl/miller
-
Project mention: Seeking Help to Preserve Rare WWII Database on Windows 98 | news.ycombinator.com | 2023-09-18
Looks like it's an Access database. Perhaps convert it to SQLite and publish with something like https://datasette.io/?
I think the problem is, thread author doesn't know how to rip an ISO of the CD or move the database out; looks like they are getting help already though.
-
InfluxDB
Collect and Analyze Billions of Data Points in Real Time. Manage all types of time series data in a single, purpose-built database. Run at any scale in any environment in the cloud, on-premises, or at the edge.
-
[4] "Is it possible to "flatten" structured data (like JSON?)": https://github.com/saulpw/visidata/discussions/1605
-
csvkit
A suite of utilities for converting to and working with CSV, the king of tabular file formats.
Project mention: I wrote this iCalendar (.ics) command-line utility to turn common calendar exports into more broadly compatible CSV files. | /r/commandline | 2023-03-24CSV utilities (still haven't pick a favorite one...): https://github.com/harelba/q https://github.com/BurntSushi/xsv https://github.com/wireservice/csvkit https://github.com/johnkerl/miller
-
django-import-export
Django application and library for importing and exporting data with admin integration.
django-import-export provides a sophisticated framework for importing data. Good if you need to do this on a regular basis and need to do some work on the data before writing to the database.
-
ethereum-etl
Python scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal transactions. Data is available in Google BigQuery https://goo.gl/oY5BCQ
-
pygraphistry
PyGraphistry is a Python library to quickly load, shape, embed, and explore big graphs with the GPU-accelerated Graphistry visual graph analyzer
Project mention: How to pass any first-round interview (even in a terrible talent market) | news.ycombinator.com | 2023-07-05I appreciate the good faith attempt:
https://github.com/graphistry/pygraphistry
And yes, we currently get used by data scientists and devs on problems like supply chain analysis, misinformation, cybersecurity, human trafficking. Seeing 100x+ more data than d3 and having a full env there makes their investigations easier. Our original tech helped lead to what is now Apache Arrow (we wrote the JS tier) and Nvidia RAPIDS (we wrote the precursor in js/opencl, and worked with Nvidia to restart for pydata), and are now focusing on the Nvidia Morpheus & graph AI sides for end-to-end GPU pipelines with our bigger customers (cyber, ...). To make this kind of tech easier for analysts, who are traditionally stuck with Splunk/Kibana/etc style UIs for investigations, we have been launching louie.ai with various customers. L
Hopefully now it makes sense why we don't go far with candidates who can't have conversations on these things.
-
Mergify
Tired of breaking your main and manually rebasing outdated pull requests?. Managing outdated pull requests is time-consuming. Mergify's Merge Queue automates your pull request management & merging. It's fully integrated to GitHub & coordinated with any CI. Start focusing on code. Try Mergify for free.
-
datamodel-code-generator
Pydantic model and dataclasses.dataclass generator for easy conversion of JSON, OpenAPI, JSON Schema, and YAML data sources.
Project mention: tRPC – Move Fast and Break Nothing. End-to-end typesafe APIs made easy | news.ycombinator.com | 2023-08-12Like generating pydantic models or dataclasses for an OpenAPI schema? I haven't needed to go in that direction myself, but this[0] looks promising!
Apologies if I've misunderstood your comment
-
-
-
python-benedict
:blue_book: dict subclass with keylist/keypath support, built-in I/O operations (base64, csv, ini, json, pickle, plist, query-string, toml, xls, xml, yaml), s3 support and many utilities.
Project mention: What are the best Python libraries to learn for beginners? | /r/learnpython | 2023-01-30python-benedict: Dictionary manipulation library
-
-
CleverCSV
CleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved dialect detection, and comes with a handy command line application for working with CSV files.
There’s things like this, but I consider the existence of messy, non standard CSV files (backed by a decade of experience dealing with the problem) a strong reason to not use the format ever.
-
Project mention: Dividindo o Brasil com base no paralelo 22°s por população | /r/brasil | 2022-11-27
acochambrei rapidamente a tabela da estimativa 2021 (https://www.ibge.gov.br/estatisticas/sociais/populacao/9103-estimativas-de-populacao.html?=&t=resultados) usando as latitutes dessa base aqui: https://github.com/kelvins/Municipios-Brasileiros
-
https://github.com/mariostoev/finviz may be helpful to you
-
-
-
extract_otp_secrets
Extract one time password (OTP) secrets from QR codes exported by two-factor authentication (2FA) apps such as "Google Authenticator". The exported QR codes from authentication apps can be captured by camera, read from images, or read from text files. The secrets can be exported to JSON or CSV, or printed as QR codes to console.
Project mention: Google Online Security Blog: Google Authenticator now supports Google Account synchronization | /r/Android | 2023-04-24I used https://github.com/scito/extract_otp_secrets to export the keys from the Google Authenticator and imported them manually into Bitwarden.
-
pytablewriter
pytablewriter is a Python library to write a table in various formats: AsciiDoc / CSV / Elasticsearch / HTML / JavaScript / JSON / LaTeX / LDJSON / LTSV / Markdown / MediaWiki / NumPy / Excel / Pandas / Python / reStructuredText / SQLite / TOML / TSV.
-
rainbow_csv
🌈Rainbow CSV - Vim plugin: Highlight columns in CSV and TSV files and run queries in SQL-like language
Probably not an exact fit, but this plugin came to mind: rainbow_csv
-
-
-
pysonDB
A Simple , ☁️ Lightweight , 💪 Efficent JSON based database for 🐍 Python. PysonDB-V2 has been released ⬇️
Have a tl:dr how it differs / improves on pysonDB?
-
Sonar
Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
Python CSV related posts
- Fx – Terminal JSON Viewer
- Why my favourite API is a zipfile on the European Central Bank's website
- The Awk Programming Language, Second Edition
- Question about Merge from CSV
- Excelize
- Looking for two plugins for Log Analysis
- Import or load a json into a database
-
A note from our sponsor - Mergify
blog.mergify.com | 22 Sep 2023
Index
What are some of the best open-source CSV projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | q | 9,937 |
2 | datasette | 8,272 |
3 | visidata | 6,804 |
4 | csvkit | 5,575 |
5 | django-import-export | 2,750 |
6 | ethereum-etl | 2,658 |
7 | pygraphistry | 1,906 |
8 | datamodel-code-generator | 1,834 |
9 | JobFunnel | 1,659 |
10 | DataProfiler | 1,253 |
11 | python-benedict | 1,152 |
12 | pyexcel | 1,137 |
13 | CleverCSV | 1,119 |
14 | Municipios-Brasileiros | 1,008 |
15 | finviz | 931 |
16 | rows | 848 |
17 | csvs-to-sqlite | 779 |
18 | extract_otp_secrets | 768 |
19 | pytablewriter | 568 |
20 | rainbow_csv | 526 |
21 | sterraxcyl | 407 |
22 | test-lists | 376 |
23 | pysonDB | 344 |