awkward vs uproot5

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

awkward		uproot5
	Project
4	Mentions	2
793	Stars	218
0.6%	Growth	1.4%
9.6	Activity	9.2
6 days ago	Latest Commit	2 days ago
Python	Language	Python
BSD 3-clause "New" or "Revised" License	License	BSD 3-clause "New" or "Revised" License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

awkward

Posts with mentions or reviews of awkward. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-07-03.

Efficient Jagged Arrays
2 projects | news.ycombinator.com | 3 Jul 2023

there's a whole ecosystem in Python originally developed for high energy physics data processing: https://github.com/scikit-hep/awkward all because Numpy demands square N-dimensional array
Same technique used everywhere, here's a simple Julia pkg for the same thing: https://github.com/JuliaArrays/ArraysOfArrays.jl/blob/3a6f5b...
But Julia at least has the decency to just support ragged Vector{Vector} out of the box, and it's not that slow
The hand-picked selection of the best Python libraries released in 2021
12 projects | /r/Python | 21 Dec 2021

Awkward Array.
Awkward: Nested, jagged, differentiable, mixed type, GPU-enabled, JIT'd NumPy
5 projects | news.ycombinator.com | 16 Dec 2021

Numba's @vectorize decorator (https://numba.pydata.org/numba-doc/latest/user/vectorize.htm...) makes a ufunc, and Awkward Array knows how to implicitly map ufuncs. (It is necessary to specify the signature in the @vectorize argument; otherwise, it won't be a true ufunc and Awkward won't recognize it.)
When Numba's JIT encounters a ctypes function, it goes to the ABI source and inserts a function pointer in the LLVM IR that it's generating. Unfortunately, that means that there is function-pointer indirection on each call, and whether that matters depends on how long-running the function is. If you mean that your assembly function is 0.1 ns per call or something, then yes, that function-pointer indirection is going to be the bottleneck. If you mean that your assembly function is 1 μs per call and that's fast, given what it does, then I think it would be alright.
If you need to remove the function-pointer indirection and still run on Awkward Arrays, there are other things we can do, but they're more involved. Ping me in a GitHub Issue or Discussion on https://github.com/scikit-hep/awkward-1.0

uproot5

Posts with mentions or reviews of uproot5. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-12-04.

Potential of the Julia programming language for high energy physics computing
10 projects | news.ycombinator.com | 4 Dec 2023

> I wasn't proposing ROOT to be reimplemented in JS. That was what the GP attributed to me.
Sorry for assuming that. I really felt the pain of thinking of possibility of combining two things I hate so much together (JS+ROOT)
> "Laypeople" may also think that code is optimized to the last cycle in something like HEP simulations. It's made fast enough and the optimization is nowhere near the level of e.g. graphics heavy games.
I understand that in other areas there might be more sophisticated optimizations, but does not change things much inside HEP field community. And it is not optimized only for simulations but for other things too. It is not one problem optimization.
> Real-time usage like high frequency large data collection will probably never happen on the "single language". But I'd guess ROOT is not used at that level either? Also at least last time I checked, ROOT is moving to Python (probably not for the hottest loops of the simulation though).
I did not mean to indicate that ROOT is being used to handle the online processing (In HEP terms). It is usually handled via optimized C++ compiled code. My idea is that you will probably never use JS or any interpreted language (or anything other than C++ to be pessimistic) for that. ROOT at the end of the day is much closer to C++ than anything else. So learning curve wouldn't be that much if you come with some C++ knowledge initially.
> Also at least last time I checked, ROOT is moving to Python (probably not for the hottest loops of the simulation though).
I think you mean PyROOT [1]? This is the official python ROOT interface It provides a set of Python bindings to the ROOT C++ libraries, allowing Python scripts to interact directly with ROOT classes and methods as if they were native Python. But that does not represent and re-writing. It makes things easier for end users who are doing analysis though, while be efficient in terms of performance, especially for operations that are heavily optimized in ROOT.
There is also uproot [2] which is a purely Python-based reader and writer of ROOT files. It is not a part of the official ROOT project and does not depend on the ROOT libraries. Instead, uproot re-implements the I/O functionalities of ROOT in Python. However, it does not provide an interface to the full range of ROOT functionalities. It is particularly useful for integrating ROOT data into a Python-based data analysis pipeline, where libraries like NumPy, SciPy, Matplotlib, and Pandas ..etc are used.
> Off-topic: C++ interpretation like done in ROOT seems like a really bad idea.)
I will agree with you. But to be fair the purpose of ROOT is interactive data analysis but over the decades a lot of things gets added, and many experiments had their own soft forks and things started to get very messy quickly. So that there is no much inertia to fix problems and introduce improvements.
[1] https://root.cern/manual/python/
[2] https://github.com/scikit-hep/uproot5
Root with python
2 projects | /r/ParticlePhysics | 22 Jun 2021

Besides PyROOT, you can also use uproot to read ROOT files, if you want to avoid the ROOT-dependency. The current version (uproot4) does not yet support writing ROOT files, but the previous/deprecated version (uproot3) does. (Please note: uproot is not maintained by the ROOT project team).

What are some alternatives?

When comparing awkward and uproot5 you can also consider the following projects:

sqlmodel - SQL databases in Python, designed for simplicity, compatibility, and robustness.

uproot3 - ROOT I/O in pure Python and NumPy.

DearPyGui - Dear PyGui: A fast and powerful Graphical User Interface Toolkit for Python with minimal dependencies

pyhf - pure-Python HistFactory implementation with tensors and autodiff

django-ninja - 💨 Fast, Async-ready, Openapi, type hints based framework for building APIs

vaex - Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀

numba-dpex - Data Parallel Extension for Numba

iminuit - Jupyter-friendly Python interface for C++ MINUIT2

skweak - skweak: A software toolkit for weak supervision applied to NLP tasks

vddfit

AugLy - A data augmentations library for audio, image, text, and video.

julia - The Julia Programming Language

awkward vs sqlmodel uproot5 vs uproot3 awkward vs DearPyGui uproot5 vs pyhf awkward vs django-ninja uproot5 vs vaex awkward vs numba-dpex uproot5 vs iminuit awkward vs skweak uproot5 vs vddfit awkward vs AugLy uproot5 vs julia

Compare awkward vs uproot5 and see what are their differences.

awkward

uproot5

awkward

uproot5

What are some alternatives?