Our great sponsors
-
Pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
-
CSVLint
CSV Lint plug-in for Notepad++ for syntax highlighting, csv validation, automatic column and datatype detecting, fixed width datasets, change datetime format, decimal separator, sort data, count unique values, convert to xml, json, sql etc. A plugin for data cleaning and working with messy data files.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Does the raw data consist of CSV files? If so I just want to mention that there is a CSV Lint plug-in for Notepad++, which can do some basic error checking, and generate a Python or R script based on the column metadata. The generated script just reads the CSV file into a dataframe, and does need further coding, it's just a starting point so to speak.
Related posts
- Show HN: Use an "eraser" to clean data on flight without breaking your workflow
- Help Us Build Our Roadmap – Pydantic
- Show HN: Data Painter – different way to interact with data in Jupyter notebook
- Mastering Pandas read_csv() with Examples - A Tutorial by Codes With Pankaj
- What Would Go in Your Dream Documentation Solution?