Python Research

Open-source Python projects categorized as Research

Top 23 Python Research Projects

  • qlib

    Qlib is an AI-oriented quantitative investment platform that aims to realize the potential, empower research, and create value using AI technologies in quantitative investment, from exploring ideas to implementing productions. Qlib supports diverse machine learning modeling paradigms. including supervised learning, market dynamics modeling, and RL.

    Project mention: qlib: NEW Other Models - star count:12463.0 | /r/algoprojects | 2023-09-26
  • mlfinlab

    MlFinLab helps portfolio managers and traders who want to leverage the power of machine learning by providing reproducible, interpretable, and easy to use tools.

    Project mention: mlfinlab: open source library maintained by hudson and thames though much of the content has moved to a subscription model. Idea is to implement academic research in python code and aggregate it as a package. Sources from [Journal of financial data s | /r/algoprojects | 2022-11-13
  • InfluxDB

    Collect and Analyze Billions of Data Points in Real Time. Manage all types of time series data in a single, purpose-built database. Run at any scale in any environment in the cloud, on-premises, or at the edge.

  • software-papers

    📚 A curated list of papers for Software Engineers

    Project mention: Looking for recommendations for academic papers in the computer science domain | /r/learnprogramming | 2022-11-15 has some interesting papers on software engineering

  • acme

    A library of reinforcement learning components and agents

    Project mention: Fast and hackable frameworks for RL research | /r/reinforcementlearning | 2023-03-08

    I'm tired of having my 200m frames of Atari take 5 days to run with dopamine, so I'm looking for another framework to use. I haven't been able to find one that's fast and hackable, preferably distributed or with vectorized environments. Anybody have suggestions? seed-rl seems promising but is archived (and in TF2). sample-factory seems super fast but to the best of my knowledge doesn't work with replay buffers. I've been trying to get acme working but documentation is sparse and many of the features are broken.

  • lingvo


  • scenic

    Scenic: A Jax Library for Computer Vision Research and Beyond (by google-research)

    Project mention: Vid2Seq: A pretrained visual language model for describing multi-event videos | | 2023-03-17

    Anyone figured out how to run this against a video? has an example showing how to "train Vid2Seq on YouCook2" using "python -m scenic.projects.vid2seq.main", but I couldn't see the recipe for using it against a video to return a description.

  • habitat-lab

    A modular high-level library to train embodied AI agents across a variety of tasks and environments.

  • Mergify

    Updating dependencies is time-consuming.. Solutions like Dependabot or Renovate update but don't merge dependencies. You need to do it manually while it could be fully automated! Add a Merge Queue to your workflow and stop caring about PR management & merging. Try Mergify for free.

  • yacs

    YACS -- Yet Another Configuration System (by rbgirshick)

    Project mention: YACS – Yet Another Configuration System | | 2022-12-23
  • PyGame-Learning-Environment

    PyGame Learning Environment (PLE) -- Reinforcement Learning Environment in Python.

  • dreamerv2

    Mastering Atari with Discrete World Models

    Project mention: Sources of Actor Gradients | /r/reinforcementlearning | 2022-11-21

    In fact, they found that just reinforce gradients work in DM control now too: Dreamerv2 GitHub (they just needed to turn off gradients through the action path - which I guess was being passed back with straight-through estimation? I'm actually having a difficult time telling how the gradient is different on the action vs policy.log_prob(action)).

  • pybossa

    PYBOSSA is the ultimate crowdsourcing framework (aka microtasking) to analyze or enrich data that can't be processed by machines alone.

    Project mention: Open Sourced Crowdsourcing tool | /r/opensource | 2022-10-26
  • rtdl

    Research on Tabular Deep Learning (by Yura52)

  • iris

    Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%. (by eloialonso)

    Project mention: From Deep to Long Learning | | 2023-04-09

    Yea, after all these LLMs are predicting one sequence of tokens from another sequence of tokens and the tokens could be anything, it just "happens" that text has the most knowledge and the easiest to input, then there are image, sound, video, but tokens could also be learned from world experience in RL:

    Transformers are Sample-Efficient World Models:

  • rwa

    Machine Learning on Sequential Data Using a Recurrent Weighted Average

  • nn-template

    Generic template to bootstrap your PyTorch project.

  • holodeck

    High Fidelity Simulator for Reinforcement Learning and Robotics Research. (by BYU-PCCL)

  • Mava

    🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX

    Project mention: Starting wth Multi Agent Reinforcement Learning | /r/reinforcementlearning | 2023-04-29

    If you want to play with models and algorithms around MARL, take a look at Mava.

  • asreview

    Active learning for systematic reviews

  • Papers-in-100-Lines-of-Code

    Implementation of papers in 100 lines of code.

    Project mention: How do I run this code from Papers in 100 lines of code? | /r/NeuralRadianceFields | 2023-09-22

    I wanted to try the some code written by Maxime Vandegar

  • stable-baselines3-contrib

    Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code

    Project mention: Understanding Action Masking in RLlib | /r/reinforcementlearning | 2023-03-12

    Here's a theoretical overview and an implementation of action masking for PPO.

  • pathml

    Tools for computational pathology

    Project mention: Hilo Semanal de Consultas IT - Asesoría Técnica, Desarrollo Profesional y Aprendizaje | /r/chileIT | 2023-06-28
  • orion

    Asynchronous Distributed Hyperparameter Optimization. (by Epistimio)

  • tabular-dl-num-embeddings

    (NeurIPS 2022) On Embeddings for Numerical Features in Tabular Deep Learning

  • Sonar

    Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2023-09-26.

Python Research related posts


What are some of the best open-source Research projects in Python? This list will help you:

Project Stars
1 qlib 12,452
2 mlfinlab 3,559
3 software-papers 3,524
4 acme 3,192
5 lingvo 2,755
6 scenic 2,508
7 habitat-lab 1,246
8 yacs 1,160
9 PyGame-Learning-Environment 960
10 dreamerv2 788
11 pybossa 727
12 rtdl 694
13 iris 682
14 rwa 602
15 nn-template 584
16 holodeck 555
17 Mava 549
18 asreview 484
19 Papers-in-100-Lines-of-Code 376
20 stable-baselines3-contrib 334
21 pathml 313
22 orion 265
23 tabular-dl-num-embeddings 190
Write Clean Python Code. Always.
Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.