StochasticAD.jl vs RecursiveFactorization

StochasticAD.jl

Research package for automatic differentiation of programs containing discrete randomness. (by gaurav-arya)

Suggest topics

Source Code

Suggest alternative

Edit details

RecursiveFactorization

By JuliaLinearAlgebra

Suggest topics

DISCONTINUED

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

StochasticAD.jl		RecursiveFactorization
	Project
3	Mentions	3
181	Stars	-
-	Growth	-
8.7	Activity	-
19 days ago	Latest Commit	-
Julia	Language
MIT License	License	-

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

StochasticAD.jl

Posts with mentions or reviews of StochasticAD.jl. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-02-22.

Yann Lecun: ML would have advanced if other lang had been adopted versus Python
9 projects | news.ycombinator.com | 22 Feb 2023

This is disregarding the development of said ecosystems though. The point is that Python has been quite inhibitory to the development of this ecosystem. There are many corpses of automatic differentiation libraries (starting from autograd and tangent and then to things like theano to finally tensorflow and pytorch) and many corpses of JIT compilers and accelerators (Cython, Numba, pypy, and TensorFlow XLA, now PyTorch v2's JIT, etc.).
What has been found over the last decade is that a large part of that is due to the design of the languages. Jan Vitek for example has a great talk which describes how difficult it is to write a JIT compiler for R due to certain design choices in the language (https://www.youtube.com/watch?v=VdD0nHbcyk4, or the more detailed version https://www.youtube.com/watch?v=HStF1RJOyxI). There are certain language constructs that void lots of optimizations which have to then be worked around, which is why Python JITs choose subsets of the language to avoid specific parts that are not easy to optimize or not possible to optimize. This is why each take a domain-specific subset, a different subset of the language for numba vs jax vs etc., to choose something that is nice for ML vs for more generic codes.
With all of that, it's perfectly reasonable to point out that there have been languages which have been designed to not have the compilation difficulties, which have resulted having a single (JIT) compiler for the language. And by extension, it has made building machine learning and autodiff libraries not something that's a Google or Meta scale project (for example, PyTorch involves building GPU code bindings and a specialized JIT, not something very accessible). Julia is a language to point to here, but I think well-designed static languages like Rust also deserve a mention. How much further would we have gone if every new ML project didn't build a new compiler and a new automatic differentiation engine? What if the development was more modular and people could easy just work on the one thing they cared about?
As a nice example, for last NeurIPS we put out a paper on automatic differentiation of discrete stochastic models, i.e. extending AD to automatically handle cases like agent-based models. The code is open source (https://github.com/gaurav-arya/StochasticAD.jl), and you can see it's almost all written by a (talented) undergraduate over a span of about 6 months. It requires the JIT compilation because it works on a lot of things that are not solely in big matrix multiplication GPU kernels, but Julia provides that. And multiple dispatch gives GPU support. Done. The closest thing in PyTorch, storchastic, gets exponential scaling instead of StochasticAD's linear, and isn't quite compatible with a lot of what's required for ML, so it benchmarks as thousands of times slower than the simple Julia code. Of course, when Meta needs it they can and will put the minds of 5-10 top PhDs on it to build it out into a feature of PyTorch over 2 years and have a nice release. But at the end of the day we really need to ask, is that how it should be?
[P] Stochastic Differentiable Programming: Unbiased Automatic Differentiation for Discrete Stochastic Programs (such as particle filters, agent-based models, and more!)
3 projects | /r/MachineLearning | 18 Oct 2022

Found relevant code at https://github.com/gaurav-arya/StochasticAD.jl + all code implementations here

RecursiveFactorization

Posts with mentions or reviews of RecursiveFactorization. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-01.

Can Fortran survive another 15 years?
7 projects | news.ycombinator.com | 1 May 2023

What about the other benchmarks on the same site? https://docs.sciml.ai/SciMLBenchmarksOutput/stable/Bio/BCR/ BCR takes about a hundred seconds and is pretty indicative of systems biological models, coming from 1122 ODEs with 24388 terms that describe a stiff chemical reaction network modeling the BCR signaling network from Barua et al. Or the discrete diffusion models https://docs.sciml.ai/SciMLBenchmarksOutput/stable/Jumps/Dif... which are the justification behind the claims in https://www.biorxiv.org/content/10.1101/2022.07.30.502135v1 that the O(1) scaling methods scale better than O(log n) scaling for large enough models? I mean.
> If you use special routines (BLAS/LAPACK, ...), use them everywhere as the respective community does.
It tests with and with BLAS/LAPACK (which isn't always helpful, which of course you'd see from the benchmarks if you read them). One of the key differences of course though is that there are some pure Julia tools like https://github.com/JuliaLinearAlgebra/RecursiveFactorization... which outperform the respective OpenBLAS/MKL equivalent in many scenarios, and that's one noted factor for the performance boost (and is not trivial to wrap into the interface of the other solvers, so it's not done). There are other benchmarks showing that it's not apples to apples and is instead conservative in many cases, for example https://github.com/SciML/SciPyDiffEq.jl#measuring-overhead showing the SciPyDiffEq handling with the Julia JIT optimizations gives a lower overhead than direct SciPy+Numba, so we use the lower overhead numbers in https://docs.sciml.ai/SciMLBenchmarksOutput/stable/MultiLang....
> you must compile/write whole programs in each of the respective languages to enable full compiler/interpreter optimizations
You do realize that a .so has lower overhead to call from a JIT compiled language than from a static compiled language like C because you can optimize away some of the bindings at the runtime right? https://github.com/dyu/ffi-overhead is a measurement of that, and you see LuaJIT and Julia as faster than C and Fortran here. This shouldn't be surprising because it's pretty clear how that works?
I mean yes, someone can always ask for more benchmarks, but now we have a site that's auto updating tons and tons of ODE benchmarks with ODE systems ranging from size 2 to the thousands, with as many things as we can wrap in as many scenarios as we can wrap. And we don't even "win" all of our benchmarks because unlike for you, these benchmarks aren't for winning but for tracking development (somehow for Hacker News folks they ignore the utility part and go straight to language wars...).
If you have a concrete change you think can improve the benchmarks, then please share it at https://github.com/SciML/SciMLBenchmarks.jl. We'll be happy to make and maintain another.
Yann Lecun: ML would have advanced if other lang had been adopted versus Python
9 projects | news.ycombinator.com | 22 Feb 2023
Small Neural networks in Julia 5x faster than PyTorch
8 projects | news.ycombinator.com | 14 Apr 2022

Ask them to download Julia and try it, and file an issue if it is not fast enough. We try to have the latest available.
See for example: https://github.com/JuliaLinearAlgebra/RecursiveFactorization...

What are some alternatives?

When comparing StochasticAD.jl and RecursiveFactorization you can also consider the following projects:

Agents.jl - Agent-based modeling framework in Julia

tiny-cuda-nn - Lightning fast C++/CUDA neural network framework

julia - The Julia Programming Language

diffrax - Numerical differential equation solvers in JAX. Autodifferentiable and GPU-capable. https://docs.kidger.site/diffrax/

Zygote.jl - 21st century AD

vectorflow

Octavian.jl - Multi-threaded BLAS-like library that provides pure Julia matrix multiplication

LeNetTorch - PyTorch implementation of LeNet for fitting MNIST for benchmarking.

Distributions.jl - A Julia package for probability distributions and associated functions.

KiteSimulators.jl - Simulators for kite power systems

RecursiveFactorization.jl

SciPyDiffEq.jl - Wrappers for the SciPy differential equation solvers for the SciML Scientific Machine Learning organization

StochasticAD.jl vs Agents.jl RecursiveFactorization vs tiny-cuda-nn StochasticAD.jl vs julia RecursiveFactorization vs diffrax StochasticAD.jl vs Zygote.jl RecursiveFactorization vs vectorflow StochasticAD.jl vs Octavian.jl RecursiveFactorization vs LeNetTorch StochasticAD.jl vs Distributions.jl RecursiveFactorization vs KiteSimulators.jl RecursiveFactorization vs RecursiveFactorization.jl RecursiveFactorization vs SciPyDiffEq.jl

Compare StochasticAD.jl vs RecursiveFactorization and see what are their differences.

StochasticAD.jl

RecursiveFactorization

StochasticAD.jl

RecursiveFactorization

What are some alternatives?