TableIO.jl
arrow-julia
Our great sponsors
TableIO.jl | arrow-julia | |
---|---|---|
1 | 4 | |
13 | 277 | |
- | 1.8% | |
0.0 | 6.2 | |
over 1 year ago | 16 days ago | |
Julia | Julia | |
MIT License | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
TableIO.jl
-
Reading HDF5 Files
You can use https://github.com/lungben/TableIO.jl
arrow-julia
-
Julia 1.8 has been released
For some examples of people porting existing C++ Fortran libraries to julia, you should check out https://github.com/JuliaLinearAlgebra/Octavian.jl, https://github.com/dgleich/GenericArpack.jl, https://github.com/apache/arrow-julia (just off the top of my head). These are all ports of C++ or Fortran libraries that match (or exceed) performance of the original, and in the case of Arrow.jl is faster, more general, and 10x less code.
- How to adapt Arrow.Table columns (naturally per record batch basis) into CuArrays for GPU processing?
-
Reading HDF5 Files
I guess current preferred format not feather, but arrow: https://github.com/JuliaData/Arrow.jl
-
Apache Arrow 3.0.0 Release
Excited to see this release's official inclusion of the pure Julia Arrow implementation [1]!
It's so cool to be able mmap Arrow memory and natively manipulate it from within Julia with virtually no performance overhead. Since the Julia compiler can specialize on the layout of Arrow-backed types at runtime (just as it can with any other type), the notion of needing to build/work with a separate "compiler for fast UDFs" is rendered obsolete.
It feels pretty magical when two tools like this compose so well without either being designed with the other in mind - a testament to the thoughtful design of both :) mad props to Jacob Quinn for spearheading the effort to revive/restart Arrow.jl and get the package into this release.
[1] https://github.com/JuliaData/Arrow.jl
What are some alternatives?
vscode-data-preview - Data Preview 🈸 extension for importing 📤 viewing 🔎 slicing 🔪 dicing 🎲 charting 📊 & exporting 📥 large JSON array/config, YAML, Apache Arrow, Avro, Parquet & Excel data files
perspective - A data visualization and analytics component, especially well-suited for large and/or streaming datasets.
prql-query - Query and transform data with PRQL
Apache Arrow - Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
pgloader - Migrate to PostgreSQL in a single command!
arquero - Query processing and transformation of array-backed data tables.
qsv - CSVs sliced, diced & analyzed.
ClickHouse - ClickHouse® is a free analytics DBMS for big data
Stonks.jl - Julia library for standardizing financial data retrieval and storage from multiple APIs.
vega-loader-arrow - Data loader for the Apache Arrow format.
go-py-arrow-bridge - Bridge between Go and Python to facilitate zero-copy using Apache Arrow
cylon - Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.