Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
RasgoQL Alternatives
Similar projects and alternatives to RasgoQL
-
fugue
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
-
pygwalker
PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
Data-Science-For-Beginners
10 Weeks, 20 Lessons, Data Science for All!
-
dbt-core
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
-
tempo
API for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, downsampling, and interpolation (by databrickslabs)
-
-
ploomber
The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
100-pandas-puzzles
100 data puzzles for pandas, ranging from short and simple to super tricky (60% complete)
-
ydata-profiling
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
-
feature-engineering-tutorials
Data Science Feature Engineering and Selection Tutorials
-
RasgoQL reviews and mentions
-
Dbt Vs python scripts
I built an open source package to bridge the gap between python and dbt, would love your feedback if you have a chance to check it out: https://github.com/rasgointelligence/RasgoQL
-
How to balance multiple time series data?
I’ve actually solved a similar problem several times in a variety of settings. I’ve had success with boosted trees and feature engineering on the sensor readings over time. I treat each reading as an observation and set the target to be the value I want to forecast (e.g. one hour ahead, the sum over the next day, the value at the same time the next day). There was a recent paper that compared boosted trees to deep learning techniques and found the boosted trees performed really well. Next, I perform feature engineering to aggregate the data up to the current time. These features will include the current value, lagged values over multiple observations for that sensor, more complicated features from moving statistics over different time scales, etc. I actually wrote a blog about creating these features using the open-source package RasgoQL and have similar types of features shared in the open-source repository here. I have also had success creating these sorts of historical features using the tsfresh package. Finally, when evaluating the forecast, use a time based split so earlier data is used to train the model and later data to evaluate the model.
-
[P] Open data transformations in Python, no SQL required
You can check it out here: https://github.com/rasgointelligence/RasgoQL
-
A note from our sponsor - InfluxDB
www.influxdata.com | 29 Mar 2024
Stats
rasgointelligence/RasgoQL is an open source project licensed under GNU Affero General Public License v3.0 which is an OSI approved license.
The primary programming language of RasgoQL is Jupyter Notebook.