Top 7 C++ Distributed System Projects
-
xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
By the way, most of the time XGBoost works just as well for projects, would not recommend applying deep learning to every single problem you come across, it's something Stanford CS really likes to showcase when it's well known (1) that sometimes "smaller"/less complex models can perform just as well or have their own interpretive advantages and (2) it is well known within ML and DS communities that deep learning does not perform as well with tabular datasets and using deep learning as a default to every problem is just poor practice. However, if you do (god forbid) get language, speech/audio, vision/imaging, or even time series models then deep learning as a baseline is not the worst idea.
-
-
Scout APM
Less time debugging, more time building. Scout APM allows you to find and fix performance issues with no hassle. Now with error monitoring and external services monitoring, Scout is a developer's best friend when it comes to application development.
-
service-fabric
Service Fabric is a distributed systems platform for packaging, deploying, and managing stateless and stateful distributed applications and containers at large scale.
Proprietary in what way?
-
Project mention: cloud storage "merged" on multiple VPSes | reddit.com/r/linuxquestions | 2021-08-24
Have a look at https://github.com/lizardfs/lizardfs perhaps is what you want
-
Project mention: [P] Bridging Dask and Tensorflow for distributed machine learniing with Vineyard | reddit.com/r/MachineLearning | 2021-09-08
We propose vineyard, https://github.com/v6d-io/v6d to address such challenges, which, provides efficient zero-copy data sharing between different compute engines, without extra cost of copying and serialization, compared other similar solutions.
-
Project mention: Show HN: Turn any data into a fast analytical API | news.ycombinator.com | 2022-04-10
we use our in-house baked engine - open sourced here https://github.com/varchar-io/nebula
Yeah, Tinybird has lots of similarities, I will do more research on it, thanks for the reference.
-
-
SonarLint
Deliver Cleaner and Safer Code - Right in Your IDE of Choice!. SonarLint is a free and open source IDE extension that identifies and catches bugs and vulnerabilities as you code, directly in the IDE. Install from your favorite IDE marketplace today.
C++ Distributed Systems related posts
- CS Internship Questions
- Any SW recommendation to index any kind of file in a External Drive?
- The European Central Bank says it will begin regulating crypto-coins, from the point of view that they are largely scams and Ponzi schemes.
- Show HN: Turn any data into a fast analytical API
- Blockchain Database: Meet BigchainDB: A Complete Guide
- Show HN: Visualize your streaming data in real-time
- OOM with ML Models (SKlearn, XGBoost, etc), workaround/tips for large datasets?
Index
What are some of the best open-source Distributed System projects in C++? This list will help you:
Project | Stars | |
---|---|---|
1 | xgboost | 22,631 |
2 | pixie | 3,318 |
3 | service-fabric | 2,905 |
4 | lizardfs | 861 |
5 | v6d | 583 |
6 | nebula | 119 |
7 | zef | 8 |
Are you hiring? Post a new remote job listing for free.