tensorflow-directml
f-stack
tensorflow-directml | f-stack | |
---|---|---|
5 | 3 | |
450 | 3,726 | |
0.0% | 0.6% | |
0.0 | 7.5 | |
over 1 year ago | 13 days ago | |
C++ | C | |
Apache License 2.0 | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
tensorflow-directml
-
Is the NVidia+MSFT+Olive thing just overblown hype?
and the fork of tensorflow is available here https://github.com/microsoft/tensorflow-directml
-
Share your AMD Vlad Automatic Optimizations
This might be outdated, but it was all I could find
- Is the Intel IRIS XE Graphics good for machine learning?
-
To all C++ professionals, can you state what field you're working in? Is it a niche?
Accelerating convolutional neural networks (ONNX and TensorFlow models) on GPU's (Nvidia/AMD/Intel/Qualcomm...). Since ML is pretty popular with dozens of frameworks out there all competing, it's not niche :b.
-
Is there any chance AMD have Tensorflow Equivalent?
There is also https://github.com/microsoft/tensorflow-directml which should work on AMD. But I haven't used it since forever so I'm not sure of its current state.
f-stack
-
Coroutine made DPDK dev easy
So, we try to use Photon coroutine lib to simplify the development of DPDK applications with the new concurrency model, and provide more functionalities, such as lock, timer and file I/O. First of all, we need to choose a userspace network protocol stack. After investigation, we have chosen Tencent's open source F-Stack project, which has ported the entire FreeBSD 11.0 network protocol stack on top of DPDK. It also has made some code cuts, providing a set of POSIX APIs, such as socket, epoll, kqueue, etc. Of course, its epoll is also simulated by kqueue, since it is essentially FreeBSD.
-
Production Twitter on One Machine: 100Gbps NICs and NVMe Are Fast
I agree most HTTP server benchmarks are highly misleading in that way, and mention in my post how disappointed I am at the lack of good benchmarks. I also agree that typical HTTP servers would fall over at much lower new connection loads.
I'm talking about a hypothetical HTTPS server that used optimized kernel-bypass networking. Here's a kernel-bypass HTTP server benchmarked doing 50k new connections per core second while re-using nginx code: https://github.com/F-Stack/f-stack. But I don't know of anyone who's done something similar with HTTPS support.
-
To all C++ professionals, can you state what field you're working in? Is it a niche?
Software for Internet Service Providers. The current project is based on DPDK, on top of it we use modified version of F-stack and then our application logic. There is some application logic "under" the F-stack too.
What are some alternatives?
ROCm - AMD ROCmâ„¢ Software - GitHub Home [Moved to: https://github.com/ROCm/ROCm]
PhotonLibOS - Probably the fastest coroutine lib in the world!
onnxruntime - ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
onnx - Open standard for machine learning interoperability
automatic - SD.Next: Advanced Implementation of Stable Diffusion and other Diffusion-based generative image models
quant - QUIC implementation for POSIX and IoT platforms
twitterperf - Prototyping the performance of various components of a theoretical faster Twitter
ghz - Simple gRPC benchmarking and load testing tool