SaaSHub helps you find the best software and product alternatives Learn more →
DataProfiler Alternatives
Similar projects and alternatives to DataProfiler
-
Tailwind CSS
A utility-first CSS framework for rapid UI development.
-
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
Pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
-
jq
Discontinued Command-line JSON processor [Moved to: https://github.com/jqlang/jq] (by stedolan)
-
-
superset
Apache Superset is a Data Visualization and Data Exploration Platform
-
jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
miller
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
-
ydata-profiling
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
-
-
-
-
-
-
-
-
datatable
A Python package for manipulating 2-dimensional tabular data structures
-
pyWhat
🐸 Identify anything. pyWhat easily lets you identify emails, IP addresses, and more. Feed it a .pcap file or some text and it'll tell you what it is! 🧙♀️
-
usaddress
:us: a python library for parsing unstructured United States address strings into address components
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
DataProfiler reviews and mentions
-
LongRoPE: Extending LLM Context Window Beyond 2M Tokens
It's been possible to skip tokenization for a long time, my team and I did it here - https://github.com/capitalone/DataProfiler
For what it's worth, we actually were working with LSTMs with nearly a billion params back in 2016-2017 area. Transformers made it far more effective to train and execute, but ultimately LSTMs are able to achieve similar results, though slow & require more training data.
- Data Profiler – What's in your data?
-
Data Profiler 0.9.0 -- offering a massive improvement to memory usage during profiling of large datasets
Great call out -- would you be willing to write up an issue for that on the repo? Thank you! https://github.com/capitalone/DataProfiler/issues/new/choose
- FLiPN-FLaNK Stack Weekly for 20 March 2023
- Release 0.8.3 · capitalone/DataProfiler
-
A note from our sponsor - SaaSHub
www.saashub.com | 18 Apr 2024
Stats
capitalone/DataProfiler is an open source project licensed under Apache License 2.0 which is an OSI approved license.
The primary programming language of DataProfiler is Python.