Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems. Learn more โ
Top 23 Python Benchmarking Projects
-
Project mention: Smoke, stress, spike, soak, and recovery: 5 essential load test profiles | news.ycombinator.com | 2024-08-15
Hi everyone!
Long time lurker, first time poster here. I'm the maintainer of Locust (https://github.com/locustio/locust), and this is the second part of my series about load testing.
Let me know what you think!
-
Judoscale
Save 47% on cloud hosting with autoscaling that just works. Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.
-
Project mention: Merlion: A Machine Learning Framework for Time Series Intelligence | news.ycombinator.com | 2025-02-28
-
AutoRAG
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
๐ AutoRAG with Milvus ๐ ๏ธ ADO ๐ซถ Self Hosting LLM ๐ Noema Declarative AI ๐ New NIM Blueprint for building AI Virtual Assistant ๐ Zilliz Integrations ๐ซถ Using Milvus for Semantic Search ๐ค Contextual Retrieval ๐ Meta: Quantized Light Weight Models ๐ https://arxiv.org/pdf/2407.01219 โ Cool Icons ๐ IBM Watson AI Milvus Bot ๐ The Hacker's Browser ๐ ๏ธ Small and Mighty H2O Model ๐ Zilliz Cloud vs Qdrant ๐ซ Gravatino and Agents ๐ ๏ธ OSS Summit Europe 2024 Report โถ๏ธ RAG Strategi ๐ค MS AI Data Visualizations ๐ Graph RAG ๐ฝ South Bay Meetup 15 Oct 2024 ๐ฆพ Influx and Milvus ๐ฝ Multimodal Pipelines โจ Constrained Sampling from LLM ๐ BAML: Cheaper, Fast and More Accurate Function Calling ๐ Infinite World Generation with outlines txt ๐ป Ollama Client Swift ๐ Atomic Agents ๐ถ๏ธ PYMUPDF4LLM ๐ Milvus for AI Agents ๐ Fine Tuning LLAMA 3 with ORPO ๐ฆพ Run NVIDIA Models ๐ป LLM Training Meta Lingua โจ 1 Bit LLM - MS BitNet ๐ป Intro ๐ถ๏ธ Mastering Chunk ๐ Storm Stanford Tool ๐ DAMO NLP SG CaRing ๐ LLM Reasoners
-
-
-
-
-
InfluxDB
InfluxDB high-performance time series database. Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems.
-
-
CARLA
CARLA: A Python Library to Benchmark Algorithmic Recourse and Counterfactual Explanation Algorithms (by carla-recourse)
-
No interest in a more general tool?
https://github.com/sosy-lab/benchexec
-
fastero
Python timeit CLI for the 21st century! colored output, multi-line input with syntax highlighting and autocompletion and much more!
-
-
Xpedite
A non-sampling profiler purpose built to measure and optimize performance of C++ low latency/real time systems
-
python-benchmark-harness
A micro/macro benchmark framework for the Python programming language that helps with optimizing your software.
-
-
video-quality-metrics
Uses FFmpeg to benchmark video encoders to compare VMAF/SSIM/PSNR with different encoder settings.
-
๐ GitHub Repo โ github.com/qcri/LLMeBench
-
less_slow.py
Playing around "Less Slow" coding practices in Python, from numerical micro-kernels to coroutines, ranges, and polymorphic state machines
Great question! This has been top of mind for me for the last 2โ3 years.
Short answer: sadly, no. I love the "usability" promise of coroutinesโand even have 2โ3 FOSS projects that could be rewritten entirely around C++ or Rust coroutines for better debuggability and extensibilityโbut my experiments show that the runtime cost of most coroutineโlike abstractions is simply too high. Frankly, Iโm not even sure if a better design is possible on modern hardware.
This leads me to conclude that, despite my passion for SIMD and superscalar execution, the highestโimpact new assembly instructions that x86 and Arm could standardize would center on async execution and lightweight context switching... yet I havenโt seen any movement in that direction.
โธป
I also wrote toy examples for various range/async/stream models in C++, Rust, and Python, with measured latencies in inline comments:
- Python:
-
CodSpeed | Founding AI Engineer | On-site (Paris) / Remote (Europe) | Full-time | https://codspeed.io
We're building software performance optimization tools to optimize and measure code performance before it is deployed to production. We avoid regressions that impact UX and help developers solve their performance issues faster. We're already live and trusted by top-tier open-source project teams such as Pydantic, Ruff, and Prisma.
Weโre at an exciting early stage and looking for talented engineers who share our passion for helping to enhance the performance of software used by billions, improving the software development lifecycle, and building tools we love to use ourselves.
Apply at https://codspeed.notion.site/Founding-AI-Engineer-cd1bf4fd73...
-
-
-
-
cloud_benchmarker
Cloud Benchmarker automates performance testing of cloud instances, offering insightful charts and tracking over time.
-
CodeRabbit
CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
Python Benchmarking discussion
Python Benchmarking related posts
-
CatBench Vector Search Playground on Postgres
-
BenchExec: A Framework for Reliable Benchmarking and Resource Measurement
-
Pinpoint performance regressions with CI-Integrated differential profiling
-
Show HN: Cloud Benchmarker: See how fast your cloud instances are for real
-
Cloud Benchmarker: See how fast your cloud instances are for real!
-
Cloud Benchmarker: See how fast your cloud instances are for real! (Python Open-Source)
-
Show HN: 78% MNIST accuracy using GZIP in under 10 lines of code
-
A note from our sponsor - InfluxDB
influxdata.com | 25 Apr 2025
Index
What are some of the best open-source Benchmarking projects in Python? This list will help you:
# | Project | Stars |
---|---|---|
1 | locust | 26,048 |
2 | Merlion | 4,259 |
3 | AutoRAG | 3,856 |
4 | benchmark_VAE | 1,904 |
5 | pytest-benchmark | 1,300 |
6 | pyperf | 847 |
7 | pygraft | 683 |
8 | truvari | 353 |
9 | CARLA | 286 |
10 | benchexec | 255 |
11 | fastero | 237 |
12 | ferret | 212 |
13 | Xpedite | 164 |
14 | python-benchmark-harness | 157 |
15 | CompanyKG | 131 |
16 | video-quality-metrics | 129 |
17 | LLMeBench | 96 |
18 | less_slow.py | 90 |
19 | pytest-codspeed | 77 |
20 | ycecream | 55 |
21 | toml-bench | 52 |
22 | api-performance-tests | 40 |
23 | cloud_benchmarker | 35 |