DataProfiler Alternatives

Similar projects and alternatives to DataProfiler

Tailwind CSS

1,279 78,370 9.4 TypeScript DataProfiler VS Tailwind CSS

A utility-first CSS framework for rapid UI development.
scrcpy

983 101,841 9.4 C DataProfiler VS scrcpy

Display and control your Android device
WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
Pytorch

336 77,783 10.0 Python DataProfiler VS Pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration
jq

306 25,063 0.0 C DataProfiler VS jq

Discontinued Command-line JSON processor [Moved to: https://github.com/jqlang/jq] (by stedolan)
nushell

212 29,864 9.9 Rust DataProfiler VS nushell

A new type of shell
superset

137 58,737 9.9 TypeScript DataProfiler VS superset

Apache Superset is a Data Visualization and Data Exploration Platform
jax

82 27,936 10.0 Python DataProfiler VS jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
miller

63 8,553 9.1 Go DataProfiler VS miller

Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
ydata-profiling

43 12,022 8.5 Python DataProfiler VS ydata-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
fuckitjs

44 4,066 0.0 JavaScript DataProfiler VS fuckitjs

The Original Javascript Error Steamroller
Dask

32 11,982 9.7 Python DataProfiler VS Dask

Parallel computing with task scheduling
FuckIt.py

24 5,001 0.0 Python DataProfiler VS FuckIt.py

The Python error steamroller.
vnlog

24 158 6.7 Perl DataProfiler VS vnlog

Process labelled tabular ASCII data using normal UNIX tools
cudf

23 7,274 9.9 C++ DataProfiler VS cudf

cuDF - GPU DataFrame Library
Flux.jl

22 4,391 8.7 Julia DataProfiler VS Flux.jl

Relax! Flux is the ML library that doesn't make you tensor
lightly

16 2,741 9.0 Python DataProfiler VS lightly

A python library for self-supervised learning on images.
datatable

9 1,788 6.1 C++ DataProfiler VS datatable

A Python package for manipulating 2-dimensional tabular data structures
pyWhat

16 6,352 0.0 Python DataProfiler VS pyWhat

🐸 Identify anything. pyWhat easily lets you identify emails, IP addresses, and more. Feed it a .pcap file or some text and it'll tell you what it is! 🧙‍♀️
usaddress

5 1,488 0.0 Python DataProfiler VS usaddress

:us: a python library for parsing unstructured United States address strings into address components
nio

7 32 6.3 Nim DataProfiler VS nio

Low Overhead Numerical/Native IO library & tools (by c-blake)
SaaSHub

www.saashub.com sponsored

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better DataProfiler alternative or higher similarity.

Suggest an alternative to DataProfiler

DataProfiler reviews and mentions

Posts with mentions or reviews of DataProfiler. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-06-07.

LongRoPE: Extending LLM Context Window Beyond 2M Tokens
1 project | news.ycombinator.com | 22 Feb 2024

It's been possible to skip tokenization for a long time, my team and I did it here - https://github.com/capitalone/DataProfiler
For what it's worth, we actually were working with LSTMs with nearly a billion params back in 2016-2017 area. Transformers made it far more effective to train and execute, but ultimately LSTMs are able to achieve similar results, though slow & require more training data.
Data Profiler – What's in your data?
1 project | news.ycombinator.com | 8 Jun 2023
Data Profiler 0.9.0 -- offering a massive improvement to memory usage during profiling of large datasets
1 project | /r/LanguageTechnology | 7 Jun 2023

1 project | /r/coolgithubprojects | 7 Jun 2023

2 projects | /r/dataengineering | 7 Jun 2023

Great call out -- would you be willing to write up an issue for that on the repo? Thank you! https://github.com/capitalone/DataProfiler/issues/new/choose

1 project | /r/Python | 7 Jun 2023
FLiPN-FLaNK Stack Weekly for 20 March 2023
15 projects | dev.to | 19 Mar 2023
Release 0.8.3 · capitalone/DataProfiler
1 project | /r/LanguageTechnology | 14 Nov 2022

1 project | /r/opensource | 14 Nov 2022

1 project | /r/coolgithubprojects | 14 Nov 2022
A note from our sponsor - WorkOS
workos.com | 25 Apr 2024

The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →

Stats

Basic DataProfiler repo stats

Mentions

Stars

1,362

Activity

6.3

Last Commit

1 day ago

capitalone/DataProfiler is an open source project licensed under Apache License 2.0 which is an OSI approved license.

The primary programming language of DataProfiler is Python.

Popular Comparisons