toast
cpython
toast | cpython | |
---|---|---|
3 | 1 | |
43 | 5 | |
- | - | |
1.5 | 10.0 | |
9 days ago | over 5 years ago | |
C++ | Python | |
GNU General Public License v3.0 or later | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
toast
-
How Many Lines of C It Takes to Execute a and B in Python?
I have a real life example in this commit: https://github.com/hpc4cmb/toast/pull/380/commits/a38d1d6dbc...
Replacing 2 lines of python code (with tens of glue code in Numba) with hundreds lines of C++ with glue code.
-
C++ is making me depressed / CUDA question
If you just want to do a matrix multiplication with CUDA (and not inside some CUDA code), you should use cuBLAS rather than CUTLASS (here is some wrapper code I wrote and the corresponding helper functions if your difficulty is using the library rather than linking it / building), it is a fairly straightforward BLAS replacement (it can be a pain to install but that is life with C++/nvidia).
- A new programming language for high-performance computers
cpython
-
How Many Lines of C It Takes to Execute a and B in Python?
A template repo can be found here https://github.com/JacksonKearl/cpython, but it does not implement an ideal malloc as I am not sure if it is still being used as an assignment.
The repo states that even this dummy implementation:
> has a 60% faster startup as compared to base CPython, and in some test cases has marginally better runtime performance as well.
What are some alternatives?
anydsl - Meta project to quickly build dependencies
kill-sticky - Bookmarklet to remove sticky elements and restore scrolling to web pages!
Rust-CUDA - Ecosystem of libraries and tools for writing and executing fast GPU code fully in Rust.
duckduckgo-locales - Translation files for <a href="https://duckduckgo.com"> </a>
atl - A Tensor Language
rustc-hash - Custom hash algorithm used by rustc (plus hashmap/set aliases): fast, deterministic, not secure
nalgebra - Linear algebra library for Rust.
Halide - a language for fast, portable data-parallel computation
verified-scheduling
phobos-next - Various generic reusable D code.
exo - Exocompilation for productive programming of hardware accelerators
CUDA.jl - CUDA programming in Julia.