polars
rust-csv
Our great sponsors
polars | rust-csv | |
---|---|---|
144 | 6 | |
26,043 | 1,601 | |
6.1% | - | |
10.0 | 4.8 | |
4 days ago | 14 days ago | |
Rust | Rust | |
MIT License | The Unlicense |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
polars
-
Why Python's Integer Division Floors (2010)
This is because 0.1 is in actuality the floating point value value 0.1000000000000000055511151231257827021181583404541015625, and thus 1 divided by it is ever so slightly smaller than 10. Nevertheless, fpround(1 / fpround(1 / 10)) = 10 exactly.
I found out about this recently because in Polars I defined a // b for floats to be (a / b).floor(), which does return 10 for this computation. Since Python's correctly-rounded division is rather expensive, I chose to stick to this (more context: https://github.com/pola-rs/polars/issues/14596#issuecomment-...).
-
Polars
https://github.com/pola-rs/polars/releases/tag/py-0.19.0
-
Stuff I Learned during Hanukkah of Data 2023
That turned out to be related to pola-rs/polars#11912, and this linked comment provided a deceptively simple solution - use PARSE_DECLTYPES when creating the connection:
- Polars 0.20 Released
- Segunda linguagem
- Polars: Dataframes powered by a multithreaded query engine, written in Rust
- Summing columns in remote Parquet files using DuckDB
- Polars 0.34 is released. (A query engine focussing on DataFrame front ends)
rust-csv
-
A question for all those that use Python
Serde for most of your input and output formats, with the serde-yaml and csv crates for format backends.
-
Specific csv file manipulation
If you want to do it in Rust, then you could combine the https://github.com/mjc-gh/rev_lines and and https://github.com/BurntSushi/rust-csv crates.
-
How to convert xslx to csv using Rust?
csv for writing to CSV
-
anyone using rust in production? what do you do?
Pair that with Serde for serialization/deserialization (JSON, TOML, YAML, CSV/TSV, XML, URL query strings, etc.), Figment for configuration, and ignore for filesystem traversal with blacklist support, and Rust is a real joy for writing CLI utilities.
-
Deserializing a CSV file with serde to an internally tagged enum doesn't seem to work
I had a similar issue and learned that internally tagged enums are not (and apparently can't be) supported: github issue.
-
Data Manipulation: Pandas vs Rust
Yep, I'll try to have a look at the nesting PR https://github.com/BurntSushi/rust-csv/pull/197 tonight, don't want to be a bitch, and not helping ahah :)
What are some alternatives?
vaex - Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
nom - Rust parser combinator framework
modin - Modin: Scale your Pandas workflows by changing a single line of code
rust-peg - Parsing Expression Grammar (PEG) parser generator for Rust
arrow-datafusion - Apache DataFusion SQL Query Engine
zero - A Rust library for zero-allocation parsing of binary data.
DataFrames.jl - In-memory tabular data in Julia
pest - The Elegant Parser
datatable - A Python package for manipulating 2-dimensional tabular data structures
grex - A command-line tool and Rust library with Python bindings for generating regular expressions from user-provided test cases
Apache Arrow - Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
lalrpop - LR(1) parser generator for Rust