tablib
feather
Our great sponsors
tablib | feather | |
---|---|---|
2 | 3 | |
4,524 | 2,708 | |
0.9% | - | |
7.0 | 0.0 | |
20 days ago | over 2 years ago | |
Python | JavaScript | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
tablib
-
Is this possible with Python?
other than Pandas, you can also use tablib. I personally find tablib to be slightly easier but it doesn't have as many features. But for what you need, tablib might be best
-
Fun with File Formats
There are two problems leading to the decision of only accepting public domain info: licensing and provenance.
"Licensing" is hard. The "Open Specifications Promise" [1], which covers a bunch of Microsoft-designed file formats, is merely a covenant not to sue.
"Provenance" is tricky. For example, much of the knowledge of the Apple iWork formats were derived by reverse-engineering the source programs and extracting protobuf definitions. Many open source projects have freely copied from each other, making detailed analysis tricky [2].
[1] https://en.wikipedia.org/wiki/Microsoft_Open_Specification_P...
[2] https://github.com/jazzband/tablib/issues/114
feather
- Best resources for learning R, with Python (pandas, sklearn, scipy, numpy) background?
- Fun with File Formats
-
Vineyard: An open-source in-memory data manager
It'd be interesting to know how this compares with alternative solutions.
I might not understand the benefit proposition correctly, and I'm not specifically into Python for data work, but I immediately thought of things like feather[1], fst[2], disk.frame[3] and even DuckDB[4].
Some of these are on disk rather than in memory, but I'd still be interested in performance and use case comparisons.
[1] https://github.com/wesm/feather
[2] https://www.fstpackage.org/fst/
[3] https://diskframe.com/
[4] https://duckdb.org/
What are some alternatives?
pymorphy2 - Morphological analyzer / inflection engine for Russian and Ukrainian languages.
libvineyard - vineyard (v6d): an in-memory immutable data manager. [Moved to: https://github.com/alibaba/v6d]
Kaitai Struct - Kaitai Struct: declarative language to generate binary data parsers in C++ / C# / Go / Java / JavaScript / Lua / Nim / Perl / PHP / Python / Ruby
tika-docker - Convenience Docker images for Apache Tika Server
DistorteD - Ruby multimedia toolkit with deep Jekyll integration ๐งช
file - Read-only mirror of file CVS repository, updated every half hour. NOTE: do not make pull requests here, nor comment any commits, submit them usual way to bug tracker or to the mailing list. Maintainer(s) are not tracking this git mirror.
SheetJS js-xlsx - ๐ SheetJS Spreadsheet Data Toolkit -- New home https://git.sheetjs.com/SheetJS/sheetjs
fuzzywuzzy - Fuzzy String Matching in Python