Top 23 CSV Open-Source Projects
-
-
Papa Parse
Fast and powerful CSV (delimited text) parser that gracefully handles large files and malformed input
-
Scout APM
Scout APM - Leading-edge performance monitoring starting at $39/month. Scout APM uses tracing logic that ties bottlenecks to source code so you know the exact line of code causing performance issues and can get back to building a great product faster.
-
Project mention: What are the most useful VSCode extensions you know which could be reimplemented in Emacs? | reddit.com/r/emacs | 2021-03-31
I would love if someone ported this to emacs, but perhaps in the meantime check out q (https://github.com/harelba/q), a command line tool that offers similar query abilities.
-
It looks like exceljs is able to word directly with buffers: https://github.com/exceljs/exceljs#reading-xlsx
-
Project mention: If you want to run SQL queries on CSV files from the command line without installing/opening any DBMS software, use CSVKIT | reddit.com/r/SQL | 2021-04-14
I've also had success using this tool to do sql like operations on csv files! https://github.com/BurntSushi/xsv
-
Project mention: Consultare un databate XML, JSON, CVS o RDF | reddit.com/r/ItalyInformatica | 2021-03-31
Per una vasta lista commentata vedi https://github.com/dbohdan/structured-text-tools
-
Project mention: SQLite the only database you will ever need in most cases | news.ycombinator.com | 2021-04-15
-
-
csvkit
A suite of utilities for converting to and working with CSV, the king of tabular file formats.
Project mention: If you want to run SQL queries on CSV files from the command line without installing/opening any DBMS software, use CSVKIT | reddit.com/r/learnpython | 2021-04-14 -
Project mention: `uq is a simple, user-friendly alternative to `sort | uniq`. | reddit.com/r/commandline | 2021-04-15
Run vd (VisiData on the file, press Shift+F, instant unique lines sorted by number of uses. Like sort | uniq -c | sort -n in one go.
-
Project mention: How to test CSV in Drupal without losing your mind | reddit.com/r/drupal | 2021-03-18
to add to this, this library is really good when wanting to parse CSV files https://github.com/box/spout
-
Project mention: How do I import data from a text file. BTW, I’m a noob. | reddit.com/r/PostgreSQL | 2021-04-22
-
-
miller
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
Project mention: Consultare un databate XML, JSON, CVS o RDF | reddit.com/r/ItalyInformatica | 2021-03-31 -
octosql
OctoSQL is a query tool that allows you to join, analyse and transform data from multiple databases and file formats using SQL.
Project mention: Minimally Invasive (and More Accurate) Analytics: GoAccess and Athena/SQL | news.ycombinator.com | 2021-02-16With OctoSQL[0], as I wanted to see how people are using it, I literally just set up an http endpoint which received a JSON request on each CLI invocation (you can see the data sent in the code, it's open source) and appended it to an on-disk JSON file.
Then I used... OctoSQL to analyze it!
Nit: The project may seem dead for a few months, but I'm just in midst of a rewrite (on a branch) which gets rid of wrong decisions and makes it easier to embed in existing applications.
-
Project mention: Ask HN: Software for Time Series Data Manipulation | news.ycombinator.com | 2021-02-17
Based on your comment, I assume you mean a GUI program for interactive exploration and manipulation, rather than a library?
This was posted on HN a few months ago and came to mind: https://github.com/facontidavide/PlotJuggler
In a past job, we used the commercial app nCode Glyphworks for graphical / interactive signal processing from field tests with hundreds of sensors.
-
django-import-export
Django application and library for importing and exporting data with admin integration.
Doc: https://django-import-export.readthedocs.io/en/latest/ Source: https://github.com/django-import-export/django-import-export
-
-
Project mention: Cereal Monitor Opensource Dropin Replacement For | reddit.com/r/arduino | 2021-02-06
you folks might wanna check SerialStudio out https://github.com/Serial-Studio/Serial-Studio
-
Project mention: Late Night Random Discussion Thread - February 05, 2021 at 09:00PM | reddit.com/r/unitedstatesofindia | 2021-02-05
JobFunnel ( repo for scrapping )
-
ISO-3166-Countries-with-Regional-Codes
ISO 3166-1 country lists merged with their UN Geoscheme regional codes in ready-to-use JSON, XML, CSV data sets
Project mention: Looking for recommendations for an API to lookup 3-letter country ISO codes with approximate matching | reddit.com/r/learnprogramming | 2021-03-25You can download the country code data from https://github.com/lukes/ISO-3166-Countries-with-Regional-Codes
-
Universal Data Tool
Collaborate & label any type of data, images, text, or documents, in an easy web interface or desktop app.
-
This one comes to mind: https://github.com/ben-strasser/fast-cpp-csv-parser
Index
What are some of the best open-source CSV projects? This list will help you:
Project | Stars | |
---|---|---|
1 | SheetJS js-xlsx | 25,264 |
2 | Papa Parse | 9,610 |
3 | q | 8,110 |
4 | ExcelJS | 7,410 |
5 | xsv | 7,026 |
6 | structured-text-tools | 5,690 |
7 | datasette | 5,003 |
8 | ServiceStack | 4,899 |
9 | csvkit | 4,548 |
10 | visidata | 3,739 |
11 | Spout | 3,469 |
12 | pgloader | 3,353 |
13 | CSV | 2,738 |
14 | miller | 2,710 |
15 | octosql | 2,402 |
16 | PlotJuggler | 2,188 |
17 | django-import-export | 2,090 |
18 | AdaptiveTableLayout | 1,763 |
19 | Serial-Studio | 1,629 |
20 | JobFunnel | 1,464 |
21 | ISO-3166-Countries-with-Regional-Codes | 1,393 |
22 | Universal Data Tool | 1,388 |
23 | Fast C++ CSV Parser | 1,249 |