|1 day ago||7 days ago|
|MIT License||Apache License 2.0|
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
MDS Newsletter #12
1 project | reddit.com/r/ModernDataStack | 15 Dec 2021
2/ Featured tools this week - Transform and RudderStack
How To Event Stream From Your Gatsby Website Using Open Source RudderStack
3 projects | dev.to | 8 Dec 2021
RudderStack is an open-source Customer Data Pipeline that allows you to track and send real-time events from your web, mobile, and server-side sources to your entire customer data stack. Our primary repository - rudder-server - is open-sourced on GitHub.
Customer Data Pipelines Play a Key Role in Data Privacy
2 projects | dev.to | 1 Dec 2021
This post will explain how your customer data pipeline can help improve your data privacy and how to ensure your data privacy with RudderStack.
Open Source Analytics Stack: Bringing Control, Flexibility, and Data-Privacy to Your Analytics
15 projects | dev.to | 25 Nov 2021
However, limitations to traditional CDPs, especially around connecting to best-of-breed customer tooling and exposing data for use across an organization have driven a new generation of non-CDPs. Solutions like Snowplow's (website, GitHub) data delivery platform and RudderStack's (website, GitHub) customer data platform for developers ingest data from a multitude of sources, apply in-stream transformations, and route data to your data warehouse, like Snowplow, or your warehouse plus your preferred customer tooling destinations for activation, like RudderStack.
RudderStack + Blendo: Better Together
1 project | dev.to | 25 Nov 2021
I learned many lessons from this journey - lessons that deserve a post of their own - but there's one lesson that I learned early on that stands out. In this blog, I talk about why we merged Blendo with RudderStack, building the team and working together to build a great product.
The Open Source Story - Open Sourcing RudderStack Blog and Docs
5 projects | dev.to | 18 Nov 2021
In fact, developers have already started contributing to our documentation. Recently, Benedikt from the Userlist team created the docs for the Userlist destination for RudderStack (see the pull request here). They also built the Userlist integration, submitted a pull request, and it is now live on our platform! This is the beauty of open source!
How to plan and implement a customer data tracking strategy for your Micro-SaaS
1 project | reddit.com/r/ShopifyAppDev | 16 Nov 2021
TLDR: general steps/starting point for setting up an app with Rudderstack ( or Segment) to track customer events
Developing a Custom Plugin using Flutter
5 projects | dev.to | 11 Nov 2021
As a part of our SDK roadmap at RudderStack, we wanted to develop a Flutter SDK. Our existing SDKs include features such as storing event details and persisting user details on the database, and much more. However, these features are already implemented in our Android and iOS SDKs.
Visualize Stripe Payments Data in Postgres using SQL
2 projects | dev.to | 2 Nov 2021
To load Stripe data into Postgres, you can use platforms such as Stitch Data and Rudderstack. In this guide, we will use Stitch Data because it is a cheap and a fast solution.
Dogfooding at RudderStack: Tracking Plans Part 1
1 project | dev.to | 21 Oct 2021
With your Tracking Plans in place, you can use the existing Data Governance API's to evaluate your inbound events, payload samples and metadata to compare them against your plans. You can also use the RudderTyper tool we're releasing alongside Tracking Plans. RudderTyper is a tool for generating strongly-typed RudderStack analytics library wrappers based on your published tracking plan specs, meaning your data will conform to your defined schema upon capture.
Visualization using Pyspark Dataframe
2 projects | reddit.com/r/dataengineering | 14 May 2022
Exactly! Use spark only prepare data for dashboards. Then you can use any visualisation tool like https://superset.apache.org/ which is free.
Apache Superset and Azure - multi-container application deployment
1 project | dev.to | 10 May 2022
I also like to do some data analysis on the side and recently ran across Apache Superset which describes itself as a "modern data exploration and data visualization platform". Coincidentally, Superset has a lot of Python code and can be deployed in containers (nine of them at current count!)
Stack for an small company
1 project | reddit.com/r/BusinessIntelligence | 19 Apr 2022
- Apache Superset / Preset: https://superset.apache.org (hosted version with free tier of 5 people at www.preset.io/product)
With subscription-based software and streaming services continually rise, will we see stronger support for FOSS and open source software?
3 projects | reddit.com/r/opensource | 18 Apr 2022
- Open source software projects that actually reward designers and product folks and include them in the development process: https://www.blender.org/ is my favorite example here! I'll also shamelessly plug the open source project I'm involved with (https://superset.apache.org/ )
Building dashboards over a semantic layer with Superset and Cube
3 projects | dev.to | 14 Apr 2022
Within minutes, you can find conversations on Superset’s Slack or GitHub revolving around identifying broken dashboards and finding ways to reduce the impact on users:
Apache Superset vs Tableau
1 project | reddit.com/r/datascience | 30 Mar 2022
Supercharging Apache Superset Getting Started with Superset
python3-devel and python3-wheel without OS repos.
1 project | reddit.com/r/learnpython | 11 Mar 2022
I'm trying to install Apache Superset "from scratch" on AlmaLinux, but it requires a newer version of Python than is in the repos.
Recommended BI / Dashboard apps cheaper than Looker/Tableau/etc.
1 project | reddit.com/r/BusinessIntelligence | 8 Mar 2022
- Apache Superset: https://superset.apache.org/ (open source), https://preset.io/pricing/ (cloud hosting for $20 user/month or less)
Reference Data Stack for Data-Driven Startups
8 projects | dev.to | 3 Mar 2022
To analyze the data stored in Snowflake and Postgres, we use Metabase. We chose Metabase because of it’s open source offering and easy to use interface. Other open source tools like Lightdash and Superset exist which we may add to the stack as our data team grows.
Analytics Stacks for Startups
8 projects | dev.to | 21 Feb 2022
(Dashboards in Metabase and in Apache Superset)
What are some alternatives?
jupyter-dash - Develop Dash apps in the Jupyter Notebook and JupyterLab
streamlit - Streamlit — The fastest way to build data apps in Python
Metabase - The simplest, fastest way to get business intelligence and analytics to everyone in your company :yum:
react-admin - A frontend Framework for building B2B applications running in the browser on top of REST/GraphQL APIs, using ES6, React and Material Design
django-project-template - The Django project template I use, for installation with django-admin.
Baserow - Baserow is an open source online database tool and Airtable alternative. Create your own database without technical experience. Our user friendly no-code tool gives you the powers of a developer without leaving your browser.
lightdash - An open source alternative to Looker built using dbt. Made for analysts ❤️
Apache Hive - Apache Hive
airbyte - Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
dagster - An orchestration platform for the development, production, and observation of data assets.
appsmith - Low code project to build admin panels, internal tools, and dashboards. Integrates with 15+ databases and any API.
nifi - Apache NiFi