array
async-profiler
Our great sponsors
array | async-profiler | |
---|---|---|
4 | 8 | |
188 | 5,883 | |
- | - | |
6.9 | 8.4 | |
4 months ago | about 1 year ago | |
C++ | C++ | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
array
-
Benchmarking 20 programming languages on N-queens and matrix multiplication
I should have mentioned somewhere, I disabled threading for OpenBLAS, so it is comparing one thread to one thread. Parallelism would be easy to add, but I tend to want the thread parallelism outside code like this anyways.
As for the inner loop not being well optimized... the disassembly looks like the same basic thing as OpenBLAS. There's disassembly in the comments of that file to show what code it generates, I'd love to know what you think is lacking! The only difference between the one I linked and this is prefetching and outer loop ordering: https://github.com/dsharlet/array/blob/master/examples/linea...
-
A basic introduction to NumPy's einsum
If you are looking for something like this in C++, here's my attempt at implementing it: https://github.com/dsharlet/array#einstein-reductions
It doesn't do any automatic optimization of the loops like some of the projects linked in this thread, but, it provides all the tools needed for humans to express the code in a way that a good compiler can turn it into really good code.
async-profiler
-
Migrating a Spring Boot application to Quarkus
Using the Async Profiler we were able to build flamegraphs for the first and second queries to picture the differences in path length of the two transactions execution.
-
Making Code Faster
> The other two languages I’ve used mostly in recent decades are Java and Ruby and the profiler situation is for both those languages is kind of shitty. I had to pay real money to get the Java profiler I used at AWS and while it worked, it was klunky, not fun to use.
These days, async profiler (https://github.com/jvm-profiling-tools/async-profiler) is much better than the Go tooling for performance. It is a joy to use and features a top-like view for the hottest methods. It works for locks, allocations and CPU time. It also integrates with JMH.
-
Why would a Java prime sieve run at only half its speed _some_ of the times?
Also, running it under a profiler (I recommend async-profiler[1]) should give you a good idea of where the slowdown occurs which might help you pin it down further.
-
Is there a way to know if my java game is slow on other computers?
Profile it. async-profiler is really great. Alternatively you can check out VisualVM/JProfiler/YourKit
- Best performance monitoring tools?
-
Rust Option 30x more efficient to return than Java Optional
async-profiler is really great at analyzing allocations, give it a shot!
-
Ask Java: what are some JFR-based tools that you enjoy?
JFR to Flame Graph Converter
-
Utility script for generating flamegraphs from JFR logs without dependencies.
Async Profiler converter tool does support JFR to Flame Graph, JFR to FlameScope, collapsed stacks to Flame Graph -https://github.com/jvm-profiling-tools/async-profiler#download
What are some alternatives?
optimizing-the-memory-layout-of-std-tuple - Optimizing the memory layout of std::tuple
container-jfr - Secure JDK Flight Recorder management for containerized JVMs
NumPy - The fundamental package for scientific computing with Python.
junit-jfr - a JUnit 5 extension that generates JFR events
cadabra2 - A field-theory motivated approach to computer algebra.
jmh - https://openjdk.org/projects/code-tools/jmh
alphafold2 - To eventually become an unofficial Pytorch implementation / replication of Alphafold2, as details of the architecture get released
jfr-libraries - a list of libraries that generate JFR events
Einsum.jl - Einstein summation notation in Julia
Arthas - Alibaba Java Diagnostic Tool Arthas/Alibaba Java诊断利器Arthas
c-examples - Example C code
opentelemetry-java-instrumentation - OpenTelemetry auto-instrumentation and instrumentation libraries for Java