kernel_tuner
bitbar
Our great sponsors
kernel_tuner | bitbar | |
---|---|---|
4 | 50 | |
243 | 17,325 | |
9.9% | - | |
9.1 | 3.3 | |
4 days ago | 12 days ago | |
Python | Go | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
kernel_tuner
-
Ask HN: What apps have you created for your own use?
I've created Kernel Tuner (https://github.com/KernelTuner/kernel_tuner) as a small software development tool, because I was writing a lot of CUDA and OpenCL kernels at the time. I didn't want to manually figure out what best thread block dimensions and work division among threads were on every GPU over and over again.
The tool evolved quite a bit since the first versions. I'm also using it for testing GPU code, teaching, and it has become one of the main drivers behind a lot of the research that I do.
-
PhD'ers, what are you working on? What CS topics excite you?
We have an open science policy, so anyone can use our framework yourself to optimize stuff, if you want! The original paper is linked at the bottom of the GitHub page.
-
How to Optimize a CUDA Matmul Kernel for CuBLAS-Like Performance: A Worklog
This is a great post for people who are new to optimizing GPU code.
It is interesting to see that the author got this far without interchanging the innermost loop over k to the outermost loop, as is done in CUTLASS (https://github.com/NVIDIA/cutlass).
As you can see in this blog post the code ends up with a lot of compile-time constants (e.g. BLOCKSIZE, BM, BN, BK, TM, TN) one way to optimize this code further is to use an auto-tuner to find the optimal value for all of these parameters for your GPU and problem size, for example Kernel Tuner (https://github.com/KernelTuner/kernel_tuner)
- Kernel Tuner
bitbar
-
Home Lab Guide
While no broken out per plug, APC UPS network management cards provide total power output data (current, voltage, frequency, power) via SNMP, which you can log using a wide variety of tools.
And even without external tools, historical power usage logs are available via the APC Web UI.
While I don't currently log anything externally, I use an xbar[1] script[2] to display UPS output current in my Mac menu bar.
[1] https://xbarapp.com
[2] https://jasomill.at/apc-nmc-status.5s.sh
- Ask HN: What apps have you created for your own use?
-
Show current playing sample rate of DAC in MacOS top menu
5. Download and install xbar https://github.com/matryer/xbar
-
How can I have signal on the menu bar of macos
Maybe with https://xbarapp.com/ ?
-
Menu bar - different menu lists for different Macs
Maybe with xhttps://xbarapp.com/bar? You can also take a look here.
- Mac app to display JSON data in menu bar?
-
App that shows bin status in menu bar
Maybe you can use AnyBar https://github.com/tonsky/AnyBar/ or xbar https://xbarapp.com/
-
App LIST!!!
xbar (Free) xbar is free and open source it allows you to put anything in your macOS menu bar, although the learning cure is high, I would put this in development too..
- What are the not-so-obvious tools that you don't want to miss?
- My first idea that I want to write in Go
What are some alternatives?
halutmatmul - Hashed Lookup Table based Matrix Multiplication (halutmatmul) - Stella Nera accelerator
SwiftBar - Powerful macOS menu bar customization tool
pyopencl - OpenCL integration for Python, plus shiny features
Dozer - Hide menu bar icons on macOS
tf-quant-finance - High-performance TensorFlow library for quantitative finance.
skhd - Simple hotkey daemon for macOS
arrayfire-python - Python bindings for ArrayFire: A general purpose GPU library.
argos - Create GNOME Shell extensions in seconds
scikit-cuda - Python interface to GPU-powered libraries
SketchyBar - A highly customizable macOS status bar replacement
BlendLuxCore - Blender Integration for LuxCore
macOCR - Get any text on your screen into your clipboard.