SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 CSV Open-Source Projects
-
SheetJS js-xlsx
📗 SheetJS Spreadsheet Data Toolkit -- New home https://git.sheetjs.com/SheetJS/sheetjs
-
Scout Monitoring
Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
-
-
Papa Parse
Fast and powerful CSV (delimited text) parser that gracefully handles large files and malformed input
-
-
pandas-ai
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
miller
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
-
gotenberg
A developer-friendly API for converting numerous document formats into PDF files, and more!
-
countries-states-cities-database
🌍 Discover our global repository of countries, states, and cities! 🏙️ Get comprehensive data in JSON, SQL, PSQL, XML, YAML, and CSV formats. Access ISO2, ISO3 codes, country code, capital, native language, timezones (for countries), and more. #countries #states #cities
-
-
winmerge
WinMerge is an Open Source differencing and merging tool for Windows. WinMerge can compare both folders and files, presenting differences in a visual text format that is easy to understand and handle.
-
csvkit
A suite of utilities for converting to and working with CSV, the king of tabular file formats.
-
-
octosql
OctoSQL is a query tool that allows you to join, analyse and transform data from multiple databases and file formats using SQL.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
ExcelJS and XLSX (SheetJS) are great libraries to work with XLSX files. The former I've found a bit easier to work with but less efficient in general.
At the click of a button, the data is downloaded and parsed into an array of objects with the help of papa-parse library.
Project mention: PandasAI – Open-Source AI Agents for Data Analysis | news.ycombinator.com | 2024-06-13
What I often use to just get the full key paths is yq (https://github.com/mikefarah/yq), piping into grep when necessary
yq -o=props
Project mention: Show HN: TextQuery – Query and Visualize Your CSV Data in Minutes | news.ycombinator.com | 2024-04-02I realize it's not really that comparable since these tools don't support SQL, but a more fully functioned CLI tool is - https://github.com/BurntSushi/xsv
They are both fairly good
Project mention: Ask HN: High quality Python scripts or small libraries to learn from | news.ycombinator.com | 2024-04-19Simon Willison's github would be a great place to get started imo -
https://github.com/simonw/datasette
Project mention: Data Science at the Command Line, 2nd Edition (2021) | news.ycombinator.com | 2024-05-06I'd like to call out one of my favorite pieces of software from the past 10 years: VisiData [1] has completely changed the way I do ad-hoc data processing, and is now my go-to for pretty much all use cases that I previously used spreadsheets for, and about half of those I previously used databases for.
It's a TUI application, not strictly CLI, but scriptable, and I figure anyone building pipelines using tools like jq, q, awk, grep, etc. to process tabular data will find it extremely useful.
----
[1]: https://visidata.org
Use a server-side headless browser such as puppeteer to convert the HTML to PDF. This is the most reliable free option, but requires a server. If you need to use it in production, we recommend you use Gotenberg.
Project mention: Show HN: DB to map cities to countries and states | news.ycombinator.com | 2024-03-12
Project mention: Why Is Common Lisp Not the Most Popular Programming Language? | news.ycombinator.com | 2024-02-14No, it's difficult to read, and understand. It's a parenthesis circus, example -
https://github.com/dimitri/pgloader/blob/master/src/sources/...
Project mention: Wazero: Zero dependency WebAssembly runtime written in Go | news.ycombinator.com | 2023-07-01Never got it to anything close to a finished state, instead moving on to doing the same prototype in llvm and then cranelift.
That said, here's some of the wazero-based code on a branch - https://github.com/cube2222/octosql/tree/wasm-experiment/was...
It really is just a very very basic prototype.
Project mention: 20 Top C# Frameworks and Libraries on GitHub for Building Powerful Applications | dev.to | 2024-05-2313. CsvHelper
CSV discussion
CSV related posts
-
CSV Lint plug-in for Notepad++ with highlighting, validation and datatypes
-
Ask HN: How would you chunk a large Excel file?
-
Building Agentic RAG with Rust, OpenAI & Qdrant
-
Show HN: Django-import-export v4 is out
-
Eli Bendersky: Faster XML Stream Processing in Go
-
Data Science at the Command Line, 2nd Edition (2021)
-
Plotille: Plot in the terminal using Braille dots
-
A note from our sponsor - SaaSHub
www.saashub.com | 17 Jun 2024
Index
What are some of the best open-source CSV projects? This list will help you:
Project | Stars | |
---|---|---|
1 | SheetJS js-xlsx | 34,665 |
2 | ExcelJS | 13,138 |
3 | Papa Parse | 12,232 |
4 | Laravel-Excel | 12,096 |
5 | pandas-ai | 11,558 |
6 | yq | 11,101 |
7 | xsv | 10,156 |
8 | q | 10,139 |
9 | datasette | 9,026 |
10 | miller | 8,668 |
11 | visidata | 7,502 |
12 | gotenberg | 7,159 |
13 | structured-text-tools | 6,895 |
14 | countries-states-cities-database | 6,734 |
15 | tabula | 6,580 |
16 | winmerge | 5,951 |
17 | countries | 5,912 |
18 | csvkit | 5,875 |
19 | ServiceStack | 5,362 |
20 | pgloader | 5,146 |
21 | octosql | 4,722 |
22 | CsvHelper | 4,595 |
23 | PlotJuggler | 4,174 |