Top 3 Nim Openmp Projects

Arraymancer

21 1,309 8.2 Nim

A fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends

Project mention: Arraymancer – Deep Learning Nim Library | news.ycombinator.com | 2024-03-28

It is a small DSL written using macros at https://github.com/mratsim/Arraymancer/blob/master/src/array....
Nim has pretty great meta-programming capabilities and arraymancer employs some cool features like emitting cuda-kernels on the fly using standard templates depending on backend !

weave

7 524 3.0 Nim

A state-of-the-art multithreading runtime: message-passing based, fast, scalable, ultra-low overhead (by mratsim)

Project mention: The GIL can now be disabled in Python's main branch | news.ycombinator.com | 2024-03-11

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
laser

6 261 3.6 Nim

The HPC toolbox: fused matrix multiplication, convolution, data-parallel strided tensor primitives, OpenMP facilities, SIMD, JIT Assembler, CPU detection, state-of-the-art vectorized BLAS for floats and integers (by mratsim)

Project mention: From slow to SIMD: A Go optimization story | news.ycombinator.com | 2024-01-23

It depends.
You need 2~3 accumulators to saturate instruction-level parallelism with a parallel sum reduction. But the compiler won't do it because it only creates those when the operation is associative, i.e. (a+b)+c = a+(b+c), which is true for integers but not for floats.
There is an escape hatch in -ffast-math.
I have extensive benches on this here: https://github.com/mratsim/laser/blob/master/benchmarks%2Ffp...

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Nim Openmp related posts

Why does working with a transposed tensor not make the following operations less performant?

2 projects | /r/MLQuestions | 19 Jun 2021

Index

What are some of the best open-source Openmp projects in Nim? This list will help you:

	Project	Stars
1	Arraymancer	1,309
2	weave	524
3	laser	261

Nim Openmp

Top 3 Nim Openmp Projects

Arraymancer

weave

InfluxDB

laser

Nim Openmp related posts

Why does working with a transposed tensor not make the following operations less performant?

Index