SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 C++ Python Projects
-
Project mention: TensorFlow-metal on Apple Mac is junk for training | news.ycombinator.com | 2024-01-16
-
Project mention: FlatBuffers – an efficient cross platform serialization library for many langs | news.ycombinator.com | 2023-09-18
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
PaddlePaddle
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
Click to Learn more...
-
-
Kodi Home Theater Software
Kodi is an award-winning free and open source home theater/media center software and entertainment hub for digital media. With its beautiful interface and powerful skinning engine, it's available for Android, BSD, Linux, macOS, iOS, tvOS and Windows.
Project mention: Favorites no longer showing since upgraded to Kodi 21.0 Beta with Confluence skin | /r/kodi | 2023-12-10 -
MMKV
An efficient, small mobile key-value storage framework developed by WeChat. Works on Android, iOS, macOS, Windows, and POSIX.
react-native-mmkv is a wrapper around MMKV that allows you to easily implement secure storage in your app. It is arguably the fastest key-value storage for React Native apps
-
LightGBM
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
Project mention: SIRUS.jl: Interpretable Machine Learning via Rule Extraction | /r/Julia | 2023-06-29SIRUS.jl is a pure Julia implementation of the SIRUS algorithm by Bénard et al. (2021). The algorithm is a rule-based machine learning model meaning that it is fully interpretable. The algorithm does this by firstly fitting a random forests and then converting this forest to rules. Furthermore, the algorithm is stable and achieves a predictive performance that is comparable to LightGBM, a state-of-the-art gradient boosting model created by Microsoft. Interpretability, stability, and predictive performance are described in more detail below.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
I'm investigating using C++ to build a REST server, and would love to know of people's experiences with Crow-- or whether they would recommend something else as a "medium-level" abstraction C++ web server. As background, I started off experimenting with Python/FastAPI, which is great, but there is too much friction to translate from pybind11-exported C++ objects to the format that FastAPI expects, and, of course, there are inherent performance limitations using Python, which could impact scaling up if the project were to be successful.
-
Microsoft also has similar courses on IoT, and Data Science. I found the IoT one really nice [0], and it covers a lot of ground.
-
-
Project mention: Modern Image Processing Algorithms Implementation in C | news.ycombinator.com | 2023-06-06
-
annoy
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
The focus on the top 10 in vector search is a product of wanting to prove value over keyword search. Keyword search is going to miss some conceptual matches. You can try to work around that with tokenization and complex queries with all variations but it's not easy.
Vector search isn't all that new a concept. For example, the annoy library (https://github.com/spotify/annoy) has been around since 2014. It was one of the first open source approximate nearest neighbor libraries. Recommendations have always been a good use case for vector similarity.
Recommendations are a natural extension of search and transformers models made building the vectors for natural language possible. To prove the worth of vector search over keyword search, the focus was always on showing how the top N matches include results not possible with keyword search.
In 2023, there has been a shift towards acknowledging keyword search also has value and that a combination of vector + keyword search (aka hybrid search) operates in the sweet spot. Once again this is validated through the same benchmarks which focus on the top 10.
On top of all this, there is also the reality that the vector database space is very crowded and some want to use their performance benchmarks for marketing.
Disclaimer: I am the author of txtai (https://github.com/neuml/txtai), an open source embeddings database
-
DearPyGui
Dear PyGui: A fast and powerful Graphical User Interface Toolkit for Python with minimal dependencies
For native GUI, DearPyGui[0] as modern as you can.
For browser web-based GUI, you can use nicegui[1]
-
Project mention: Why is remote desktop slow when host monitor is off unless HDMI cable is used? | news.ycombinator.com | 2024-04-10
RDP as a regular or quick solution is actually really decent in this respect.
-
Project mention: Does anyone else agree that the links to the latest development version of Open3D don't work? | /r/cscareerquestions | 2023-07-10
I was going to file a bug about another issue, but I have to download the development version. This is why I want this solved quickly. None of the links seem to work: https://github.com/isl-org/Open3D/issues/6259
-
assimp
The official Open-Asset-Importer-Library Repository. Loads 40+ 3D-file-formats into one unified and clean data structure.
Project mention: The Asset-Importer-Lib Minor Release Version 5.3.0 is out | /r/GraphicsProgramming | 2023-09-26 -
perspective
A data visualization and analytics component, especially well-suited for large and/or streaming datasets.
Project mention: The Design Philosophy of Great Tables (Software Package) | news.ycombinator.com | 2024-04-04Why do you want to render to canvas?
Perspective seems to be the most performant html table. It is more focused on extremely fast updates than styling, although it looks good.
Glide is a newcomer that also renders to canvas.
-
esphome
ESPHome is a system to control your ESP8266/ESP32 by simple yet powerful configuration files and control them remotely through Home Automation systems.
You might want to take a look at https://esphome.io/ for an easy integration of an ESP32/8266 into home Assistant.
-
The interesting thing about Polars is that it does not try to be a drop-in replacement to pandas, like Dask, cuDF, or Modin, and instead has its own expressive API. Despite being a young project, it quickly got popular thanks to its easy installation process and its “lightning fast” performance.
-
Project mention: Wechsel von Windows auf Linux - zu viele Programme Windows-only? | /r/de_EDV | 2023-06-30
-
Project mention: Minimal implementation of Mamba, the new LLM architecture, in 1 file of PyTorch | news.ycombinator.com | 2023-12-20
>"everyone" seems to know Mamba. I never heard of Mamba
Only the "everybody who knows what mamba is" are the ones upvoting and commenting. Think of all the people who ignore it. For me, Mamba is the faster version of Conda [1], and that's why I clicked on the article.
-
Yet another TEDIOUS BATTLE: Python vs. C++/C stack.
This project gained popularity due to the HIGH DEMAND for running large models with 1B+ parameters, like `llama`. Python dominates the interface and training ecosystem, but prior to llama.cpp, non-ML professionals showed little interest in a fast C++ interface library. While existing solutions like tensorflow-serving [1] in C++ were sufficiently fast with GPU support, llama.cpp took the initiative to optimize for CPU and trim unnecessary code, essentially code-golfing and sacrificing some algorithm correctness for improved performance, which isn't favored by "ML research".
NOTE: In my opinion, a true pioneer was DarkNet, which implemented the YOLO model series and significantly outperformed others [2]. Same trick basically like llama.cpp
-
DALI
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
Another option is DALI https://github.com/NVIDIA/DALI For my project while training EfficientNet2, it was a game changer. But it a way harder to implement in code than TorchVision or Kornia.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
C++ Python related posts
- How moving from Pandas to Polars made me write better code without writing better code
- Bug in std:shared_mutex on Windows
- Ask HN: How do you find employment opportunities in 2024?
- Lessons from leetcode: 347 Top K Frequent Elements
- RapidFuzz: Rapid fuzzy string matching in Python
- AMD Funded a Drop-In CUDA Implementation Built on ROCm: It's Open-Source
- Why learn Data structures and Algorithm
-
A note from our sponsor - SaaSHub
www.saashub.com | 18 Apr 2024
Index
What are some of the best open-source Python projects in C++? This list will help you:
Project | Stars | |
---|---|---|
1 | tensorflow | 182,173 |
2 | FlatBuffers | 21,978 |
3 | PaddlePaddle | 21,570 |
4 | CNTK | 17,435 |
5 | Kodi Home Theater Software | 17,357 |
6 | MMKV | 16,791 |
7 | LightGBM | 16,025 |
8 | pybind11 | 14,708 |
9 | IoT-For-Beginners | 14,659 |
10 | codon | 13,809 |
11 | Dlib | 12,991 |
12 | annoy | 12,662 |
13 | DearPyGui | 12,184 |
14 | Sunshine | 12,150 |
15 | Open3D | 10,436 |
16 | assimp | 10,213 |
17 | perspective | 7,493 |
18 | esphome | 7,439 |
19 | cudf | 7,257 |
20 | albert | 7,062 |
21 | mamba | 6,219 |
22 | serving | 6,078 |
23 | DALI | 4,902 |