Our great sponsors
-
timemory
Modular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template API is essentially a framework to creating tools: it is designed to provide a unifying interface for recording various performance measurements alongside data logging and interfaces to other tools.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
Anybody that's done it before on Windows could probably get a prototype ready in an hour or two bc it already supports not using fork -- there's a timem-mpi exe built from the same source that uses MPI_Comm_spawn_multiple instead of fork bc OpenMPI will seg-fault when you fork inside a rank.
Would be nice if it could also calculate the "Wall clock time responsibility" metric. It's a nice metric to calculate how much effect a task effectively has on build time, based on how much can be done in parallel. I've implemented that based on ninja build a while ago: https://github.com/martinus/ninja2wctr