compile-time-perf
pcm
compile-time-perf | pcm | |
---|---|---|
3 | 4 | |
73 | 2,566 | |
- | 2.0% | |
0.0 | 9.6 | |
about 3 years ago | 4 days ago | |
CMake | C++ | |
MIT License | BSD 3-clause "New" or "Revised" License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
compile-time-perf
- jrmadsen/compile-time-perf -- High-level compilation overhead metrics
-
Make LLVM fast again
Looks like you've got your own setup with perf, time, etc. but might find this useful: https://github.com/jrmadsen/compile-time-perf
pcm
-
Zen 5's Leaked Slides
I don't know how often it's a problem, but I work for a company doing software video encoding, and we always fill up all the dimm slots on servers to have as much bandwidth as possible, even if we have only really use maybe 1/4 of the RAM.
I'm not sure any of the standard Linux tools can show you memory bandwidth usage easily (maybe perf), I know we use Intel PCM (https://github.com/intel/pcm) and AMDuProfPCM (https://www.amd.com/en/developer/uprof.html)
-
CoreFreq, a CPU monitoring software frequencies, ratios, C-states
The Intel PCM suite does what you seem to be describing w/o a kernel module, and it uses nicely abstracted MSR accesses so that its portable to any OS.
https://github.com/intel/pcm
-
How close are GPUs to utilizing PCIE gen 4?
Measure with pcm-pci / pcm-iio https://github.com/intel/pcm
-
Will you actually be able to utilize the connectivity offered with high-end AM5 boards?
For my purpose, I want 1. Enough, but not excessive lanes for GPU. For upcoming PCIe5 enabled GPUs (which may not even happen just yet), we don't know if they can saturate 16x. Would be nice if someone could run https://github.com/intel/pcm pcm-iio and measure. If it turns out that PCIe5 enabled GPUs aren't using more than 8x bandwidth, then X670E 8/8/4 mode makes sense, if you can saturate the remaining lanes with something else.
What are some alternatives?
timemory - Modular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template API is essentially a framework to creating tools: it is designed to provide a unifying interface for recording various performance measurements alongside data logging and interfaces to other tools.
8-bit-computer-emulator - Emulator of my 8-bit-computer
ninja2wctr - Calculates Wall Clock Time Responsibility for each output from .ninja_log
openFPGALoader - Universal utility for programming FPGA
scouter - Scouter is an open source APM (Application Performance Management) tool.
c2clat - A tool to measure CPU core to core latency
omnitrace - Omnitrace: Application Profiling, Tracing, and Analysis
cr - cr.h: A Simple C Hot Reload Header-only Library
StaticTypeInfo - 🏀 Up your type-game. A small C++ library for compile-time type names and type indices.
optimus-manager-qt - An interface for Optimus Manager that allows to switch GPUs on Optimus laptops.
lighthouse - Automated auditing, performance metrics, and best practices for the web.
CoreFreq - CoreFreq : CPU monitoring and tuning software designed for 64-bit processors.