LoopVectorization.jl VS nogil

Compare LoopVectorization.jl vs nogil and see what are their differences.

nogil

Multithreaded Python without the GIL (by colesbury)
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
LoopVectorization.jl nogil
10 31
722 2,854
0.6% -
7.0 5.7
5 days ago 2 months ago
Julia Python
MIT License GNU General Public License v3.0 or later
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

LoopVectorization.jl

Posts with mentions or reviews of LoopVectorization.jl. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-02.
  • Mojo – a new programming language for all AI developers
    7 projects | news.ycombinator.com | 2 May 2023
    It is a little disappointing that they're setting the bar against vanilla Python in their comparisons. While I'm sure they have put massive engineering effort into their ML compiler, the demos they showed of matmul are not that impressive in an absolute sense; with the analogous Julia code, making use of [LoopVectorization.jl](https://github.com/JuliaSIMD/LoopVectorization.jl) to automatically choose good defaults for vectorization, etc...

    ```

  • Knight’s Landing: Atom with AVX-512
    1 project | news.ycombinator.com | 10 Dec 2022
  • Python 3.11 is 25% faster than 3.10 on average
    13 projects | news.ycombinator.com | 6 Jul 2022
    > My mistake in retrospect was using small arrays as part of a struct, which being immutable got replaced at each time step with a new struct requiring new arrays to be allocated and initialized. I would not have done that in c++, but julia puts my brain in matlab mode.

    I see. Yes, it's an interesting design space where Julia makes both heap and stack allocations easy enough, so sometimes you just reach for the heap like in MATLAB mode. Hopefully Prem and Shuhei's work lands soon enough to stack allocate small non-escaping arrays so that user's done need to think about this.

    > Alignment I'd assumed, but padding the struct instead of the tuple did nothing, so probably extra work to clear a piece of an simd load. Any insight on why avx availability didn't help would be appreciated. I did verify some avx instructions were in the asm it generated, so it knew, it just didn't use.

    The major differences at this point seem to come down to GCC (g++) vs LLVM and proofs of aliasing. LLVM's auto-vectorizer isn't that great, and it seems to be able to prove 2 arrays are not aliasing less reliably. For the first part, some people have just improved the loop analysis code from the Julia side (https://github.com/JuliaSIMD/LoopVectorization.jl), forcing SIMD onto LLVM can help it make the right choices. But for the second part you do need to do `@simd ivdep for ...` (or use LoopVectorization.jl) to match some C++ examples. This is hopefully one of the things that the JET.jl and other new analysis passes can help with, along with the new effects system (see https://github.com/JuliaLang/julia/pull/43852, this is a pretty huge new compiler feature in v1.8, but right now it's manually specified and will take time before things like https://github.com/JuliaLang/julia/pull/44822 land and start to make it more pervasive). When that's all together, LLVM will have more ammo for proving things more effectively (pun intended).

  • Vectorize function calls
    2 projects | /r/Julia | 25 Apr 2022
    This looks nice too. Seems to be maintained and it even has a vmap-function. What more can one ask for ;) https://github.com/JuliaSIMD/LoopVectorization.jl
  • Implementing dedispersion in Julia.
    4 projects | /r/Julia | 16 Mar 2022
    Have you checked out https://github.com/JuliaSIMD/LoopVectorization.jl ? It may be useful for your specific use case
  • We Use Julia, 10 Years Later
    10 projects | news.ycombinator.com | 14 Feb 2022
    And the "how" behind Octavian.jl is basically LoopVectorization.jl [1], which helps make optimal use of your CPU's SIMD instructions.

    Currently there can some nontrivial compilation latency with this approach, but since LV ultimately emits custom LLVM it's actually perfectly compatible with StaticCompiler.jl [2] following Mason's rewrite, so stay tuned on that front.

    [1] https://github.com/JuliaSIMD/LoopVectorization.jl

    [2] https://github.com/tshort/StaticCompiler.jl

  • Why Lisp? (2015)
    21 projects | news.ycombinator.com | 26 Oct 2021
    Yes, and sorry if I also came off as combative here, it was not my intention either. I've used some Common Lisp before I got into Julia (though I never got super proficient with it) and I think it's an excellent language and it's too bad it doesn't get more attention.

    I just wanted to share what I think is cool about julia from a metaprogramming point of view, which I think is actually its greatest strength.

    > here is a hypothetical question that can be asked: would a julia programmer be more powerful if llvm was written in julia? i think the answer is clear that they would be

    Sure, I'd agree it'd be great if LLVM was written in julia. However, I also don't think it's a very high priority because there are all sorts of ways to basically slap LLVM's hands out of the way and say "no, I'll just do this part myself."

    E.g. consider LoopVectorization.jl [1] which is doing some very advanced program transformations that would normally be done at the LLVM (or lower) level. This package is written in pure Julia and is all about bypassing LLVM's pipelines and creating hyper efficient microkernels that are competitive with the handwritten assembly in BLAS systems.

    To your point, yes Chris' life likely would have been easier here if LLVM was written in julia, but also he managed to create this with a lot less man-power in a lot less time than anything like it that I know of, and it's screaming fast so I don't think it was such a huge impediment for him that LLVM wasn't implemented in julia.

    [1] https://github.com/JuliaSIMD/LoopVectorization.jl

  • A Project of One’s Own
    2 projects | news.ycombinator.com | 8 Jun 2021
    He still holds a few land speed records he set with motorcycles he designed and built.

    But I had no real hobbies or passions of my own, other than playing card games.

    It wasn't until my twenties, after I already graduated college with degrees I wasn't interested in and my dad's health failed, that I first tried programming. A decade earlier, my dad was attending the local Linux meetings when away from his machine shop.

    Programming, and especially performance optimization/loop vectorization are now my passion and consume most of my free time (https://github.com/JuliaSIMD/LoopVectorization.jl).

    Hearing all the stories about people starting and getting hooked when they were 11 makes me feel like I lost a dozen years of my life. I had every opportunity, but just didn't take them. If I had children, I would worry for them.

  • When porting numpy code to Julia, is it worth it to keep the code vectorized?
    1 project | /r/Julia | 7 Jun 2021
    Julia will often do SIMD under the hood with either a for loop or a broadcasted version, so you generally shouldn't have to worry about it. But for more advanced cases you can look at https://github.com/JuliaSIMD/LoopVectorization.jl
  • Julia 1.6 Highlights
    9 projects | news.ycombinator.com | 25 Mar 2021
    Very often benchmarks include compilation time of julia, which might be slow. Sometimes they rightfully do so, but often it's really apples and oranges when benchmarking vs C/C++/Rust/Fortran. Julia 1.6 shows compilation time in the `@time f()` macro, but Julia programmers typically use @btime from the BenchmarkTools package to get better timings (e.g. median runtime over n function calls).

    I think it's more interesting to see what people do with the language instead of focusing on microbenchmarks. There's for instance this great package https://github.com/JuliaSIMD/LoopVectorization.jl which exports a simple macro `@avx` which you can stick to loops to vectorize them in ways better than the compiler (=LLVM). It's quite remarkable you can implement this in the language as a package as opposed to having LLVM improve or the julia compiler team figure this out.

    See the docs which kinda read like blog posts: https://juliasimd.github.io/LoopVectorization.jl/stable/

nogil

Posts with mentions or reviews of nogil. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-06-15.
  • Proof-of-Concept Multithreaded Python Without the GIL
    1 project | news.ycombinator.com | 2 Feb 2024
  • Our Plan for Python 3.13
    10 projects | news.ycombinator.com | 15 Jun 2023
    This might be a dumb question, but why would removing the GIL break FFI? Is it just that existing no-GIL implementations/proposals have discarded/ignored it, or is there a fundamental requirement, e.g. C programs unavoidably interact directly with the GIL? I know that the C-API is only stable between minor releases [0] compiled in the same manner [1], so it's not like the ecosystem is dependent upon it never changing.

    I cannot seem to find much discussion about this. I have found a no-GIL interpreter that works with numpy, scikit, etc. [2][3] so it doesn't seem to be a hard limit. (That said, it was not stated if that particular no-GIL implementation requires specially built versions of C-API libs or if it's a drop-in replacement.)

    [0]: https://docs.python.org/3/c-api/stable.html#c-api-stability

    [1]: https://docs.python.org/3/c-api/stable.html#platform-conside...

    [2]: https://github.com/colesbury/nogil

    [3]: https://discuss.python.org/t/pep-703-making-the-global-inter...

  • Real Multithreading Is Coming to Python
    3 projects | news.ycombinator.com | 15 May 2023
    https://github.com/colesbury/nogil does manage to get rid of the GIL, but it's not certain to make it into Python core. The main problem is the amount of existing libraries that depend on the existence of the GIL without realizing it - breaking those would be extremely disruptive.
  • [D] The hype around Mojo lang
    2 projects | /r/MachineLearning | 5 May 2023
    CPython is also investigating the removal of the GIL (PEP703, nogil). I think requiring the GIL is a wider thing that libraries will need to address anyway. But also, for the same reason as above I'd be surprised if the Modular team thought that saying "you can run all your python code unchanged" was a good idea if there was a secret "except for code that uses numpy" muttered under the breath.
  • PEP 684 was accepted – Per-interpreter GIL in Python 3.12
    2 projects | news.ycombinator.com | 8 Apr 2023
  • PEP 703 – Making the Global Interpreter Lock Optional in CPython
    1 project | /r/Python | 10 Jan 2023
  • Python 3.11.0 final is now available
    11 projects | news.ycombinator.com | 25 Oct 2022
    I'm worried about the speedup

    My understanding is that it's based on the most recent attempt to remove the GIL by Sam Gross

    https://github.com/colesbury/nogil

    In addition to some ways to try to not have nogil have as much overhead he added a lot of unrelated speed improvements so that python without the gil would still be faster not slower in single thread mode. They seem to have merged those performance patches first that means if they add his Gil removal patches in say python 3.12 it will still be substantially slower then 3.11 although faster then 3.10. I hope that doesn't stop them from removing the gil (at least by default)

  • Removed the GIL back in 1996 from Python 1.4, primarily to create a re-entrant Python interpreter.
    1 project | /r/programming | 21 Sep 2022
  • I Tried Removing Python's GIL Back in 1996
    1 project | news.ycombinator.com | 19 Sep 2022
  • Faster CPython 3.12 Plan
    5 projects | news.ycombinator.com | 19 Sep 2022
    Looks like it's still active to me:

    https://github.com/colesbury/nogil/

What are some alternatives?

When comparing LoopVectorization.jl and nogil you can also consider the following projects:

CUDA.jl - CUDA programming in Julia.

hpy - HPy: a better API for Python

julia - The Julia Programming Language

mypyc - Compile type annotated Python to fast C extensions

StaticCompiler.jl - Compiles Julia code to a standalone library (experimental)

numpy - The fundamental package for scientific computing with Python.

cl-cuda - Cl-cuda is a library to use NVIDIA CUDA in Common Lisp programs.

Pytorch - Tensors and Dynamic neural networks in Python with strong GPU acceleration

julia-vim - Vim support for Julia.

python-feedstock - A conda-smithy repository for python.

cmu-infix - Updated infix.cl of the CMU AI repository, originally written by Mark Kantrowitz

sbcl - Mirror of Steel Bank Common Lisp (SBCL)'s official repository