seatunnel
skywalking
seatunnel | skywalking | |
---|---|---|
31 | 25 | |
7,524 | 23,410 | |
4.0% | 0.6% | |
9.8 | 9.5 | |
1 day ago | 3 days ago | |
Java | Java | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
seatunnel
- SeaTunnel – super high-performance, distributed data integration tool
- Apache SeaTunnel: Next-generation high-performance, distributed integration tool
- FLaNK Weekly 31 December 2023
-
Five Apache projects you probably didn't know about
Apache SeaTunnel is a data integration platform that offers the three pillars of data pipelines: sources, transforms, and sinks. It offers an abstract API over three possible engines: the Zeta engine from SeaTunnel or a wrapper around Apache Spark or Apache Flink. Be careful, as each engine comes with its own set of features.
-
SymmetricDS: Open-Source, cross platform database replication software
looks that way. there is an other project that does similar things Apache SeaTunnel: https://seatunnel.apache.org/
- Breakthrough in the book search field! Use Apache SeaTunnel to improve the efficiency of book title similarity search
-
Questions Regarding design DW
https://seatunnel.apache.org/ Might be an overkill though...
-
SeaTunnel Zeta engine, the first choice for massive data synchronization, is officially released!
See the specific Change log: https://github.com/apache/incubator-seatunnel/releases/tag/2.3.0
-
The Ultimate Beginner’s Guide to Open Source Contribution
Apache SeaTunnel (Incubating) SeaTunnel is a very easy-to-use ultra-high-performance distributed data integration platform that supports real-time synchronization of massive data. It can synchronize tens of billions of data stably and efficiently every day, and has been used in the production of nearly 100 companies. Official website https://seatunnel.apache.org/ GitHub projects https://github.com/apache/incubator-seatunnel
- Major Release! SeaTunnel 2.3.0-beta supports the self-innovate SeaTunnel Engine and more connectors!
skywalking
-
Release Radar · May 2024 Edition: Major updates from the open source community
Are you working on microservices, cloud native, and container-based architectures? Then you need to check out Apache Skywalking. It's an Application Performance Monitoring (APM) system, that provides monitoring, tracing, and diagnosing capabilities for distributed systems in Cloud Native architectures. This latest update has hundreds of changes including support for Java 21 runtime, new functions and parameters, the addition of Golang as a supported language for AMQP, Kafka, RocketMQ, and Pulsar, support for multiple labels in metrics, and tonnes more. Check out all changes in the very comprehensive release notes. All the Apache Skywalking metrics are available via Grafana.
- Show HN: OneUptime – open-source Datadog Alternative
-
Enhancing API Observability Series (Part 3): Tracing
When choosing distributed tracing tools, considerations include your technology stack, business requirements, and monitoring complexity. Zipkin, SkyWalking, and OpenTelemetry are popular distributed tracing solutions, each with its unique features.
-
Five Apache projects you probably didn't know about
Apache SkyWalking is an APM tool, focusing on microservices, Cloud Native apps, and Kuernetes architectures. It builds its architecture on four kinds of components:
- Show HN: Monitor your webapp with minimal setup
-
It's time to let go, Apache Software Foundation
Trying to play devil's advocate here.
> It needs at least a stable set of users, but maintaining a set of users is essentially managing the set of people onboarding and the set of people migrating off.
I could say that I don't care very much about how much users a piece of software has, only that it has enough information on how to use it and enough maintainers to patch any security vulnerabilities and do occasional releases with updated dependencies, as well as address any serious issues or bugs.
For example, Apache Skywalking is an APM solution that most people haven't even heard of (in contrast to something like Sentry), yet it fits those qualities and I see few to no issues with it: https://skywalking.apache.org/
> If you're shrinking then a competitor is providing better options, or your problem space has shifted.
Again, as a user, I might not care that Sentry or another piece of software is better in any number of ways than Apache Skywalking. Similarly, I might not care that something like PostgreSQL is more correct or has a large market share (at least on HN) in comparison to something like MariaDB/MySQL.
If a piece of software meets the needs of my project and won't effectively rot with time, then it's quite possibly good enough as it is, even if it's not the market leader. For my small project's APM needs Apache Skywalking is enough. For my CRUD database needs, something like MariaDB/MySQL will be okay until the time Sun burns out (or PostgreSQL if I'm feeling fancy, but even that's not one of the modern and hip solutions).
Ergo, those better options only become relevant once they're closer to being must haves than nice to haves. Same as how Docker Swarm might be enough for many, even if Kubernetes basically won in the "container wars" and has a way more active community. Swarm will only stop being an option for me once it hits EOL, at least for certain projects where simplicity is appreciated.
Then again, a counterpoint to my own argument here could be the story of LibreOffice and OpenOffice, where the latter was basically donated (instead of the rights to the name being given to the folks behind LibreOffice) and is now in decline while LibreOffice is flourishing - but at the same time they were so close to one another feature wise, that maybe it's not a good point, same as with Gogs and Gitea.
-
JDK 21 Release Notes
> Where's Java primarily used these days?
I've seen a lot of enterprise-y webdev projects use it for back end stuff (Dropwizard, Spring Boot, Vert.X, Quarkus) and in rare cases even front end (like Vaadin or JSF/PrimeFaces). The IDEs are pretty great, especially the ones by JetBrains, the tooling is pretty mature and boring, the performance is really good (memory usage aside) and the language itself is... okay.
Curiously, I wanted to run my own server for OIDC/OAuth2 authn/authz and to have common features like registration, password resets and social login available to me out of the box, for which I chose Keycloak: https://www.keycloak.org/
Surprise surprise, it's running Java under the hood. I wanted to integrate some of my services with their admin API, seems like the Java library is also updated pretty frequently: https://mvnrepository.com/artifact/org.keycloak/keycloak-adm... whereas ones I found for .NET feel like they're stagnating more: https://www.nuget.org/packages?q=keycloak (probably not a dealbreaker, though)
Then, I wanted to run an APM stack with Apache Skywalking (simpler to self-host than Sentry), which also turns out to be a Java app under the hood: https://skywalking.apache.org/
Also you occasionally see like bank auth libraries or e-signing libraries be offered in Java as well first and foremost, at least in my country (maybe PHP sometimes): https://www.eparaksts.lv/en/for_developers/Java_libraries and their app for getting certificates from the government issued eID cards also runs off of Java.
So while Java isn't exactly "hot" tech, it's used all over the place: even in some game engines, like jMonkeyEngine, or in infrastructure code where something like Go might actually be more comfortable to use.
-
OpenTelemetry in 2023
> What should people use?
I recall Apache Skywalking being pretty good, especially for smaller/medium scale projects: https://skywalking.apache.org/
The architecture is simple, the performance is adequate, it doesn't make you spend days configuring it and it even supports various different data stores: https://skywalking.apache.org/docs/main/v9.0.0/en/setup/back...
The problems with it are that it isn't super popular (although has agents for most popular stacks), the docs could be slightly better and I recall them also working on a new UI so there is a little bit of churn: https://skywalking.apache.org/downloads/
Still better versus some of the other options when you need something that just works instead of spending a lot of time configuring something (even when that something might be superior in regards to the features): https://github.com/getsentry/self-hosted/blob/master/docker-...
Sentry is just the first thing that comes to mind (OpenTelemetry also isn't simpler due to how much it tries to do), but compare its complexity to Skywalking: https://github.com/apache/skywalking/blob/master/docker/dock...
I wish there was more self-hosted software like that out there, enough to address certain concerns in a simple way on day 1 and leave branching out to more complex options like OpenTelemetry once you have a separate team for that and the cash is rolling in.
- Apache Skywalking Application performance monitor tool for distributed systems
- Improving Observability of Go Services
What are some alternatives?
airbyte - The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
prometheus - The Prometheus monitoring system and time series database.
kestra - Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.
jaeger - CNCF Jaeger, a Distributed Tracing Platform
Leetcode - Solutions to LeetCode problems; updated daily. Subscribe to my YouTube channel for more.
signoz - SigNoz is an open-source observability platform native to OpenTelemetry with logs, traces and metrics in a single application. An open-source alternative to DataDog, NewRelic, etc. 🔥 🖥. 👉 Open source Application Performance Monitoring (APM) & Observability tool
hudi - Upserts, Deletes And Incremental Processing on Big Data.
Pinpoint - APM, (Application Performance Management) tool for large-scale distributed systems.
com.openai.unity - A Non-Official OpenAI Rest Client for Unity (UPM)
zipkin - Zipkin is a distributed tracing system
Apache Hive - Apache Hive
Grafana - The open and composable observability and data visualization platform. Visualize metrics, logs, and traces from multiple sources like Prometheus, Loki, Elasticsearch, InfluxDB, Postgres and many more.