CSVLint VS csv_log_cleaner

Compare CSVLint vs csv_log_cleaner and see what are their differences.

CSVLint

CSV Lint plug-in for Notepad++ for syntax highlighting, csv validation, automatic column and datatype detecting, fixed width datasets, change datetime format, decimal separator, sort data, count unique values, convert to xml, json, sql etc. A plugin for data cleaning and working with messy data files. (by BdR76)

csv_log_cleaner

Clean CSV files to conform to a type schema by streaming them through small memory buffers using multiple threads and logging data loss. (by ambidextrous)
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
CSVLint csv_log_cleaner
44 2
134 2
- -
7.6 6.3
about 1 month ago about 2 months ago
C# Rust
GNU General Public License v3.0 only MIT License
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

CSVLint

Posts with mentions or reviews of CSVLint. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-06-11.

csv_log_cleaner

Posts with mentions or reviews of csv_log_cleaner. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-04-14.
  • How do you guys handle pandas and its sh*tty data type inference
    3 projects | /r/Python | 14 Apr 2023
    Sounds like it could be more of a data cleansing problem you're facing than a data inference one. Even a single non-numerical value in a million rows of numbers will necessarily mess up type inference for the whole column. I work with a lot of CSVs and that's one of the issues we have to spend a huge amount of time dealing with. I even ended up writing this open source tool to handle the cleansing: https://github.com/ambidextrous/csv_log_cleaner
  • Hey Rustaceans! Got a question? Ask here! (39/2022)!
    11 projects | /r/rust | 26 Sep 2022
    Hi. I'm new to Rust. I've written up a little opensource tool to clean CSV files as a practical learning exercise that will help me with my job: https://github.com/ambidextrous/csv_cleaner Where would be a good place to post it for code review?

What are some alternatives?

When comparing CSVLint and csv_log_cleaner you can also consider the following projects:

OpenRefine - OpenRefine is a free, open source power tool for working with messy data and improving it

unescape-rs - "Unescapes" strings with escape sequences written with literal characters and converts it into a properly escaped one.

datasetmultitool - CSV lint tool to validate csv files. It is a helper utility to process csv textfiles and check for data errors. It can check text width, validate and reformat date and datetime values, change point or comma decimal separator, remove thousand separator and change column order.

doku - fn(Code) -> Docs

CsvQuery - Plugin for Notepad++ that treats CSV files as (read only) SQL tables

mimalloc - mimalloc is a compact general purpose allocator with excellent performance.

Customer-Analysis-Tableau - This repository contains the data source and the tableau workbook used in my YouTube video: https://www.youtube.com/watch?v=_qReGTOrKTk

dtype_diet - Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM

NppPluginLexerExample - Notepad++ Plug-in Lexer and Folder example using the C# template

Peroxide - Rust numeric library with R, MATLAB & Python syntax

Pandas - Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

esp8266-hal - A experimental hardware abstraction layer for the esp8266 written in Rust.