How do you sort a CSV file with several million rows?

This page summarizes the projects mentioned and recommended in the original post on /r/dataengineering

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • DataGristle

    Tough and flexible tools for data analysis, transformation, validation and movement.

  • DataGristle: this one contains some more unusual csv utilities, and what's in master includes the ability to sort by field names rather than offsets: https://github.com/kenfar/DataGristle

  • xsv

    A fast CSV command line toolkit written in Rust.

  • xsv: super-fast rust csv toolkit. I haven't tested it, but the documentation doesn't mention escapechars, double-quoting, or handling of delimiters or newlines embedded within quoted fields. So, this is the fastest but maybe not the safest option: https://github.com/BurntSushi/xsv

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Anyone else feel like they are using Pandas as a crutch?

    1 project | /r/dataengineering | 5 Mar 2023
  • xsv

    1 project | /r/ITProTuesday | 3 Mar 2023
  • Using Commandline To Process CSV files

    1 project | /r/programming | 14 Dec 2022
  • How do I delete lines in a CSV using Sed based on condition?

    2 projects | /r/commandline | 26 Jul 2022
  • Write a program in Rust to read a CSV file and create two output CSV files – one file with odd rows and the other file with even rows from the input file

    1 project | /r/rust | 17 Jun 2022