Apache Superset

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
  • superset

    Apache Superset is a Data Visualization and Data Exploration Platform

  • Had a very good experience with Superset.

    Superset allowed us to replace Tableau and not looking back

    Took me a while figure out how to embed it into my app using Superset Embedded SDK.

    Superset Embedded SDK - "Embedded SDK allows you to embed dashboards from Superset into your own app, using your app's authentication. Embedding is done by inserting an iframe, containing a Superset page, into the host application."

    https://github.com/apache/superset/tree/master/superset-embe...

    Superset is based on very high quality and well maintained chart library eChart

    https://echarts.apache.org/examples/en/#chart-type-linesG

    Community Roadmap

    https://github.com/apache/superset/projects?query=is%3Aopen

    Huge respect to Preset.io and its team for contributing to the project and keep it in a great shape

    https://preset.io/blog/

    Superset source code is very easy to read and understand, and as a result it's possible to implement some advanced caching techniques reduce the load on charts.

    No BI is perfect.

    Watching Superset for years gives me confidence the project will work as supposed down the road, and eventually some of its packages can be reusable for all kind of visualizations and data hacking.

  • evidence

    Business intelligence as code: build fast, interactive data visualizations in pure SQL and markdown

  • Full fledged BI tools like Superset and Metabase are amazing for their intended use cases.

    But they may be an overkill if your primary use case is to infrequently build semi-interactive reports for non-technical end-users and your use cases are are mostly covered by standard graphs & tables. Esp. so if you are familiar with SQL and have access to the underlying data source. Two nifty utilities I have found to be very useful for latter kind of use cases are SQLPage and Evidence.

    They make it very convenient to whip out some SQL and convert that to a neat professional looking web ui that can be forwarded to an end user. In case of Evidence it is a statically generated site, and in case of SQLPage it is a web app that connects to a live database.

    SQLPage: https://sql.ophir.dev/

    Evidence: https://evidence.dev

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • superset-operator

    Operator for Apache Superset for Stackable Data Platform

  • We've built a Kubernetes Operator for Apache Superset at Stackable: https://github.com/stackabletech/superset-operator/

    It's part of our Open Source Data Platform and it's one of the few open source BI tools out there and there are not a lot of alternatives in this space. We generally like it.

  • obsidian-dataview

    A data index and query language over Markdown files, for https://obsidian.md/.

  • https://github.com/blacksmithgu/obsidian-dataview

    This whole ideas to have data, visualisations and knowledge base in one private offline place is very appealing

  • openobserve

    πŸš€ 10x easier, πŸš€ 140x lower storage cost, πŸš€ high performance, πŸš€ petabyte scale - Elasticsearch/Splunk/Datadog alternative for πŸš€ (logs, metrics, traces, RUM, Error tracking, Session replay).

  • eCharts is awesome. We moved from plotly after using it for several months to echarts at https://github.com/openobserve/openobserve and are super happy.

  • seaborn

    Statistical data visualization in Python

  • If you are doing data analysis I don't think any of the 3 pieces of software you mentioned are going to be that helpful.

    I see these products as tools for data visualization and reporting i.e. presenting prepared datasets to users in a visually appealing way. They aren't as well suited for serious analytics.

    I can't comment on Superset or Tableau but I am familiar with Power BI (it has been rolled out across my org), the type of statistics you can do with it are fairly rudimentary. If you need to do any thing beyond summarizing (counts, averages, min, max etc). It is not particularly easy.

    For data analysis I use SAS or R. This software allows you do things like multivariate regression, timeseries forecasting, PCA, Cluster analysis etc. There is also plotting capability.

    Both these products are kind of old school, I've been using them since early 2000's, the "new school" seems to be Python. Pretty much all the recent data science people in my organization use Python. Particularly Pandas and libraries like Seaborn (https://seaborn.pydata.org/).

    The "power" users of Power BI in my organization tend to be finance/HR people for use cases like drill down into cost figures or Interactively presenting KPI's and other headline figures to management things like that.

  • cube.js

    πŸ“Š Cube β€” The Semantic Layer for Building Data Applications

  • We use https://cube.dev/ as intermediate layer between data warehouse database and Superset (and other "terminal" apps for BI like report generators). You define your schema (metrics, dimensions, joins, calculated metrics etc) in cube and then access them by any tool that can connect to SQL db

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • SQLpage

    SQL-only webapp builder, empowering data analysts to build websites and applications quickly

  • Full fledged BI tools like Superset and Metabase are amazing for their intended use cases.

    But they may be an overkill if your primary use case is to infrequently build semi-interactive reports for non-technical end-users and your use cases are are mostly covered by standard graphs & tables. Esp. so if you are familiar with SQL and have access to the underlying data source. Two nifty utilities I have found to be very useful for latter kind of use cases are SQLPage and Evidence.

    They make it very convenient to whip out some SQL and convert that to a neat professional looking web ui that can be forwarded to an end user. In case of Evidence it is a statically generated site, and in case of SQLPage it is a web app that connects to a live database.

    SQLPage: https://sql.ophir.dev/

    Evidence: https://evidence.dev

  • shillelagh

    Making it easy to query APIs via SQL

  • It should be possible (have not tried myself):

    https://preset.io/blog/accessing-apis-with-superset/

    "Shillelagh (ΚƒΙͺˈleΙͺlΙͺ) is a Python library and CLI that allows you to query many resources (APIs, files, in memory objects) using SQL. It's both user and developer friendly, making it trivial to access resources and easy to add support for new ones"

    https://github.com/betodealmeida/shillelagh

  • lightdash

    Self-serve BI to 10x your data team ⚑️

  • > YAML, pivoting being done in the frontend, no symmetric aggregates

    (one of the maintainers of Lightdash) You touched on some of our most interesting problems here! Would be especially interested to hear about what you liked / didn't like about symmetric aggregates in Looker and how you find dev with YAML. If you have an idea of how you'd like these to look in Lightdash, the team would be really open to making that a reality.

    For pivoting in the backend, this is coming! Issue here: https://github.com/lightdash/lightdash/issues/2907

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts