Blas

Open-source projects categorized as Blas

Top 23 Bla Open-Source Projects

  • OpenBLAS

    OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.

  • Project mention: LLaMA Now Goes Faster on CPUs | news.ycombinator.com | 2024-03-31

    The Fortran implementation is just a reference implementation. The goal of reference BLAS [0] is to provide relatively simple and easy to understand implementations which demonstrate the interface and are intended to give correct results to test against. Perhaps an exceptional Fortran compiler which doesn't yet exist could generate code which rivals hand (or automatically) tuned optimized BLAS libraries like OpenBLAS [1], MKL [2], ATLAS [3], and those based on BLIS [4], but in practice this is not observed.

    Justine observed that the threading model for LLaMA makes it impractical to integrate one of these optimized BLAS libraries, so she wrote her own hand-tuned implementations following the same principles they use.

    [0] https://en.wikipedia.org/wiki/Basic_Linear_Algebra_Subprogra...

    [1] https://github.com/OpenMathLib/OpenBLAS

    [2] https://www.intel.com/content/www/us/en/developer/tools/onea...

    [3] https://en.wikipedia.org/wiki/Automatically_Tuned_Linear_Alg...

    [4]https://en.wikipedia.org/wiki/BLIS_(software)

  • blis

    BLAS-like Library Instantiation Software Framework

  • Project mention: Faer-rs: Linear algebra foundation for the Rust programming language | news.ycombinator.com | 2024-04-24

    BLIS is an interesting new direction in that regard: https://github.com/flame/blis

    >The BLAS-like Library Instantiation Software (BLIS) framework is a new infrastructure for rapidly instantiating Basic Linear Algebra Subprograms (BLAS) functionality. Its fundamental innovation is that virtually all computation within level-2 (matrix-vector) and level-3 (matrix-matrix) BLAS operations can be expressed and optimized in terms of very simple kernels.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • lapack

    LAPACK development repository

  • Project mention: Linus Torvalds adds arbitrary tabs to kernel code | news.ycombinator.com | 2024-04-17

    Indentation by steps of 3 spaces is common in old Fortran 77 code. This is LAPACK for example. Scroll down to about line 400 to see actual code, not comments.

    https://github.com/Reference-LAPACK/lapack/blob/master/SRC/c...

  • llama2.c

    Llama 2 Everywhere (L2E) (by trholding)

  • Project mention: What would an LLM OS look like? | news.ycombinator.com | 2024-03-14

    Nice article. We did a demo for booting to LLM and also as Kernel Module: https://github.com/trholding/llama2.c The whole things was funny and buggy, but since then we have been developing in stealth, even trying to raise VC capital. Our goal is to make computers like a buddy to whom you can talk to and explain things and get work done, kinda like a Jarvis. The way we interact with computers haven't changed for decades, its time to disrupt that to get more productivity. I also believe with this approach one can avoid installing different applications, when the computer (models) emulate activities done through applications. For example, cutting and pasting a dog from a dog photo onto a banner for a dog racing competition would not require you to be a graphics artist nor use tools like photshop / gimp. You could tell the computer and it would use segment anything to cut the dog, use Text and SD for banner text and bg paste the dog, seek your approval, search for the fastest, best and cheapest banner printing service and submit it. 10 years ago this could have been sci-fi, but now it is a possibility. Just need to connect the dots, package and polish it to make it a good product.

  • CLBlast

    Tuned OpenCL BLAS

  • Project mention: Hosting Your Own AI Chatbot on Android Devices | dev.to | 2024-04-06

    git clone https://github.com/CNugteren/CLBlast.git cd CLBlast cmake . cmake --build . --config Release mkdir install cmake --install . --prefix ~/CLBlast/install cp libclblast.so* $PREFIX/lib cp ./include/clblast.h ../llama.cpp

  • scikit-cuda

    Python interface to GPU-powered libraries

  • vectorious

    Linear algebra in TypeScript.

  • Project mention: Vectorious: BLAS-accelerated linear algebra for TypeScript | /r/hypeurls | 2023-06-02
  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • SimSIMD

    Up to 200x Faster Inner Products and Vector Similarity — for Python, JavaScript, Rust, and C, supporting f64, f32, f16 real & complex, i8, and binary vectors using SIMD for both x86 AVX2 & AVX-512 and Arm NEON & SVE 📐

  • Project mention: Deep Learning in JavaScript | news.ycombinator.com | 2024-03-28
  • oneMKL

    oneAPI Math Kernel Library (oneMKL) Interfaces

  • oj! Algorithms

    oj! Algorithms

  • rocBLAS

    Next generation BLAS implementation for ROCm platform

  • Project mention: Nvidia DGX GH200: The First 100 Terabyte GPU Memory System | news.ycombinator.com | 2023-05-31

    The same is also true for https://github.com/ROCmSoftwarePlatform/rocBLAS and https://github.com/ROCmSoftwarePlatform/hipBLASLt although the build stack, distribution— leaves a lot to be desired, and otherwise quite unstable.

  • blasjs

    Pure Javascript manually written :ok_hand: implementation of BLAS, Many numerical software applications use BLAS computations, including Armadillo, LAPACK, LINPACK, GNU Octave, Mathematica, MATLAB, NumPy, R, and Julia.

  • Project mention: Blasjs – Pure TypeScript rewrite of BLAS | news.ycombinator.com | 2023-06-13
  • kokkos-kernels

    Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels

  • blasfeo

    Basic linear algebra subroutines for embedded optimization

  • laser

    The HPC toolbox: fused matrix multiplication, convolution, data-parallel strided tensor primitives, OpenMP facilities, SIMD, JIT Assembler, CPU detection, state-of-the-art vectorized BLAS for floats and integers (by mratsim)

  • Project mention: From slow to SIMD: A Go optimization story | news.ycombinator.com | 2024-01-23

    It depends.

    You need 2~3 accumulators to saturate instruction-level parallelism with a parallel sum reduction. But the compiler won't do it because it only creates those when the operation is associative, i.e. (a+b)+c = a+(b+c), which is true for integers but not for floats.

    There is an escape hatch in -ffast-math.

    I have extensive benches on this here: https://github.com/mratsim/laser/blob/master/benchmarks%2Ffp...

  • monolish

    monolish: MONOlithic LInear equation Solvers for Highly-parallel architecture

  • xtensor-blas

    BLAS extension to xtensor

  • sparse

    Sparse matrix formats for linear algebra supporting scientific and machine learning applications

  • hipBLAS

    ROCm BLAS marshalling library

  • rust-blas

    BLAS bindings for Rust

  • intel-mkl-src

    Redistribute Intel MKL as a crate

  • blas

    Wrappers for BLAS (Fortran) (by blas-lapack-rs)

  • hipfort

    Fortran interfaces for ROCm libraries

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Blas related posts

Index

What are some of the best open-source Bla projects? This list will help you:

Project Stars
1 OpenBLAS 5,952
2 blis 2,091
3 lapack 1,396
4 llama2.c 1,382
5 CLBlast 995
6 scikit-cuda 967
7 vectorious 911
8 SimSIMD 715
9 oneMKL 565
10 oj! Algorithms 453
11 rocBLAS 315
12 blasjs 288
13 kokkos-kernels 276
14 blasfeo 276
15 laser 261
16 monolish 189
17 xtensor-blas 154
18 sparse 153
19 hipBLAS 97
20 rust-blas 82
21 intel-mkl-src 74
22 blas 72
23 hipfort 65

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com