Top 4 Scala Analytic Projects
The enterprise-grade behavioral data engine (web, mobile, server-side, webhooks), running cloud-natively on AWS and GCPProject mention: Announcing Hightouch Audiences: Enabling Marketers to Self-Serve their Data | dev.to | 2021-08-31
However Hightouch does not help with event collection: you can still use a CDP or solutions like Snowplow for that.
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads. (by delta-io)Project mention: SnowFlake vs DataBricks lakehouse or both together | reddit.com/r/datascience | 2021-08-31
There's also been huge strides in data lake tech, data lakes now support ACID transactions through delta, this brings cool stuff like rolling back through a transaction log. Whenever delta live tables (DLT) comes out of preview you can also use this to track your data lineage in your lake itself.
Scout APM: A developer's best friend. Try free for 14-days. Scout APM uses tracing logic that ties bottlenecks to source code so you know the exact line of code causing performance issues and can get back to building a great product faster.
Apache Kyuubi is a distributed multi-tenant JDBC server for large-scale data processing and analytics, built on top of Apache SparkProject mention: Release Kyuubi-v1.1.0 | reddit.com/r/apachespark | 2021-03-12
An encrypted data analytics platformProject mention: How to Run Spark SQL on Encrypted Data | dev.to | 2021-08-10
Introducing Opaque SQL, an open-source platform for securely running Spark SQL queries on encrypted data. Built by top systems and security researchers at UC Berkeley, the platform uses hardware enclaves to securely execute queries on private data in an untrusted environment.
What are some of the best open-source Analytic projects in Scala? This list will help you:
Are you hiring? Post a new remote job listing for free.