The hand-picked selection of the best Python libraries and tools of 2022

This page summarizes the projects mentioned and recommended in the original post on /r/Python

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
  • ruff

    An extremely fast Python linter and code formatter, written in Rust.

  • Ruff — a fast linter

  • python-benedict

    :blue_book: dict subclass with keylist/keypath support, built-in I/O operations (base64, csv, html, ini, json, pickle, plist, query-string, toml, xls, xml, yaml), s3 support and many utilities.

  • python-benedict — a dict on steroids

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • memray

    Memray is a memory profiler for Python

  • Memray — a memory profiler

  • codon

    A high-performance, zero-overhead, extensible Python compiler using LLVM

  • Codon — a Python compiler using LLVM

  • langchain

    Discontinued ⚡ Building applications with LLMs through composability ⚡ [Moved to: https://github.com/langchain-ai/langchain] (by hwchase17)

  • LangChain — building LLM-powered apps

  • fugue

    A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.

  • fugue — distributed computing done easy

  • diffusers

    🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

  • Diffusers — generative AI

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • lineapy

    Move fast from data science prototype to pipeline. Capture, analyze, and transform messy notebooks into data pipelines with just two lines of code.

  • LineaPy — notebooks in production

  • whylogs

    An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collection, ensuring safety & robustness. 📈

  • whylogs — model monitoring

  • mito

    The mitosheet package, trymito.io, and other public Mito code.

  • Mito — spreadsheet inside notebooks

  • lingua-py

    The most accurate natural language detection library for Python, suitable for short text and mixed-language text

  • Hi u/dekked_, perhaps you want to add my natural language detection library Lingua to the NLP section of the long tail. It is pretty unique among the natural language detection libraries for Python because it is able to detect multiple languages in mixed-language text. It is also one of the most accurate libraries when detecting the language of short text. I would very much appreciate if you added my library to your list.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts