Open-source projects categorized as Python

Top 23 Python Open-Source Projects

  • GitHub repo tensorflow

    An Open Source Machine Learning Framework for Everyone

    Project mention: How to install GLIBC>=2.29 on Debian 10? | reddit.com/r/debian | 2021-04-15

    https://github.com/tensorflow/tensorflow/issues/53 looks relevant.

  • GitHub repo system-design-primer

    Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

    Project mention: How to shift careers into tech in my 30s? | reddit.com/r/cscareerquestions | 2021-04-09

    i have less experience than you but from my own research everyone suggests this for system design https://github.com/donnemartin/system-design-primer/blob/master/README.md#system-design-topics-start-here

  • GitHub repo TheAlgorithms

    All Algorithms implemented in Python

    Project mention: Gitbash and PyCharm how should I organize a repository? | reddit.com/r/learnpython | 2021-04-13

    Now, I would like to have something similar to this https://github.com/TheAlgorithms/Python - a master branch with folders containing my algorithms/files/whatever.py

  • GitHub repo awesome-python

    A curated list of awesome Python frameworks, libraries, software and resources

    Project mention: I am a proficient Python coder whose learning has plateaued. Any really useful libraries I should look into learning? Taking recommendations. | reddit.com/r/learnpython | 2021-04-16

    I suggest looking at this and this github links which group many of the most used/useful python libraries by their category of use.

  • GitHub repo thefuck

    Magnificent app which corrects your previous console command.

    Project mention: My pretty much fully automated ZSH config | reddit.com/r/commandline | 2021-04-15

    It's got XDG Conformance for various tools (which is also exposed to bash, because we can never escape bash scripts, so they might as well adhere to the rest of the system). It's got auto-detection of various Env tools (JEnv, PyEnv, RbEnv, etc), and auto-configuration of said tools. It's got installer scripts for various useful tools (the various Envs, Docker, Gcloud SDK, etc). It's even got support for interactive niceties like TheFuck, Exa, and FZF.

  • GitHub repo Django

    The Web framework for perfectionists with deadlines.

    Project mention: Curso de programación en Python organizado por estudiantes :) | reddit.com/r/argentina | 2021-04-13
  • GitHub repo Flask

    The Python micro framework for building web applications.

    Project mention: Flask 2.0 is coming, please help us test | reddit.com/r/Python | 2021-04-16

    A few users have asked about background asyncio tasks, which wasn't addressed in the docs. I've proposed this PR to help explain. In summary background tasks are not possible, except for the limited case whereby an ASGI server is used in conjunction with asgiref's WsgiToAsgi adapter.

  • GitHub repo Keras

    Deep Learning for humans

    Project mention: Curso de programación en Python organizado por estudiantes :) | reddit.com/r/argentina | 2021-04-13
  • GitHub repo httpie

    As easy as /aitch-tee-tee-pie/ 🥧 Modern, user-friendly command-line HTTP client for the API era. JSON support, colors, sessions, downloads, plugins & more. https://twitter.com/httpie

    Project mention: Building an API with FastAPI | dev.to | 2021-04-13

    At first, I saved a sample of the trending repositories HTML to avoid sending dozens of requests to Github. I use HTTPie as HTTP client to perform requests via the terminal.

  • GitHub repo Ansible

    Ansible is a radically simple IT automation platform that makes your applications and systems easier to deploy and maintain. Automate everything from code deployment to network configuration to cloud management, in a language that approaches plain English, using SSH, with no agents to install on remote systems. https://docs.ansible.com.

    Project mention: UNH CEPS / General Advice | reddit.com/r/unh | 2021-04-14

    The job market in New England was terrible for junior devs in 2010 and there were only 6-7 CS graduates that year down from the ~130 that started in freshman year. Anyway, I graduated and looked in NH, Boston, Stamford, and NYC and couldn't find anything, but ended up finding a neat gig outside of DC working for BAE as a kernel developer for a proprietary operating system called STOP (https://www.baesystems.com/en/product/stop-os). I worked there for two years before realizing I hated the area and also the DoD's budget got slashed so we were looking at staff reductions. I reached out to a fellow 2010 UNH CS graduate and ex-IOLer working at NetApp and he hooked me up with an interview down in Raleigh for a SAN interoperability role which was basically getting paid a lot more money to do what I did at the IOL. Two years after that NetApp did a ton of layoffs and my whole group got cut in half. Some of the smarter people went to a startup called Nimble Storage, so I decided to follow the smart people and resigned from NetApp and went to Nimble. I was there for 4 years and really got my hands in helping develop the test library and transition it from Perl to Python. Sounds boring, but it was pretty interesting and was my first crack at managing projects across timezones. Nimble got acquired by HPE and after a year of watching our benefits get slashed or taken away completely I resigned to work at Nutanix and join a guy that I had worked at NetApp and Nimble with previously. I was there less than a year before I left for personal reasons and joined a cloud/kubernetes monitoring company called Sysdig that was founded by the Wireshark guy and currently has Nimble's old CEO in charge. The Sysdig office in Raleigh is basically entirely comprised of people from NetApp and Nimble, so it was like a reunion. While there I developed a test framework in Python that leveraged pytest, terraform (https://www.terraform.io/), ansible (https://www.ansible.com/) to deploy resources across multiple clouds and kubernetes environments, install Sysdig components on them, then test the products. I left Sysdig last summer and rejoined Nutanix in a different capacity and now have a bit more project ownership, though I am still in an individual contributor role (not a manager).

  • GitHub repo Pytorch

    Tensors and Dynamic neural networks in Python with strong GPU acceleration

    Project mention: I am trying to install gpu support for tensorflow on ubuntu 20.04. I have a GTX 980 and I keep running into issues. Can anyone help? | reddit.com/r/learnmachinelearning | 2021-04-16

    For more info, see "install pytorch" section on https://pytorch.org/

  • GitHub repo project-based-learning

    Curated list of project-based tutorials

    Project mention: I self-learned to code at 30. And so can you | news.ycombinator.com | 2021-03-17

    I like this nice list for finding those sorts of tutorials https://github.com/tuvtran/project-based-learning

  • GitHub repo scikit-learn

    scikit-learn: machine learning in Python

    Project mention: Beginner's Question: Naive Bayes Implementation for Spam Classification | reddit.com/r/MLQuestions | 2021-04-17

    Look at the Sklearn implementation and check out some of the differences in the fit method (https://github.com/scikit-learn/scikit-learn/blob/95119c13a/sklearn/naive_bayes.py#L593)

  • GitHub repo requests

    A simple, yet elegant HTTP library.

    Project mention: Building an API with FastAPI | dev.to | 2021-04-13

    I executed the script 3 times making 20 requests on each execution. Ok now lets replace the synchronous requests library by the asynchronous aiohttp library. Furthermore, we add the async/await keywords on the right positions. Our final code will look like this:

  • GitHub repo Home Assistant

    :house_with_garden: Open source home automation that puts local control and privacy first

    Project mention: APIs to turn on TVs | reddit.com/r/learnpython | 2021-04-17
  • GitHub repo TensorFlow-Examples

    TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)

    Project mention: Tensorman and RTX 30-Series GPU's | reddit.com/r/pop_os | 2021-03-19

    When I run this simple project, the log output is below. There is a 5-minute pause at 16:48. There is a second pause at the end of the script before the output of the example (final output excluded). This project runs quickly if I exclude "--gpu" and run it on the CPU.

  • GitHub repo Scrapy

    Scrapy, a fast high-level web crawling & scraping framework for Python.

    Project mention: Why is Python popular despite being accused of being slow? | reddit.com/r/programming | 2021-04-16

    I use it regularly for things like web scraping (Scrapy is a joy) and data manipulation. For instance just wrote some fairly complicated scripts for doing address matching to pair up a couple of UK datasets without a common identity field. Human-entered addresses are decidedly fuzzy so you end up with a lot of arbitrary rules and Python is just fast to develop against. I don't really care if the script takes a couple of hours to run on the full datasets (35 million addresses) as opposed to half that time in something else more of a pain to tweak around with.

  • GitHub repo Face Recognition

    The world's simplest facial recognition api for Python and the command line

    Project mention: What algorithm does the face_recognition library use? | reddit.com/r/learnmachinelearning | 2021-04-15

    What algorithm does the face_recognition library use?

  • GitHub repo superset

    Apache Superset is a Data Visualization and Data Exploration Platform

    Project mention: Open source contributions for a Data Engineer? | reddit.com/r/dataengineering | 2021-04-16

    If you want to work more on the visualization side maybe Metabase, Superset and Streamlit.

  • GitHub repo manim

    Animation engine for explanatory math videos

    Project mention: How are mathematical animations made? | reddit.com/r/animation | 2021-04-16

    3blue1brown has in fact written his own software to do these kinds of animations. It's called manim and it's open source on Github ->https://github.com/3b1b/manim

  • GitHub repo fastapi

    FastAPI framework, high performance, easy to learn, fast to code, ready for production

    Project mention: PEP 563, PEP 649 and the future of pydantic and FastAPI | news.ycombinator.com | 2021-04-15

    > 1 - FastAPI was created before the change was announced.

    No, it wasn't. You might be thinking of pydantic.

    FastAPI's first commit was December 2018: https://github.com/tiangolo/fastapi/commit/406c092a3bf65bbd4.... PEP 563 was the first item in the Python 3.7.0 release notes earlier that year.

  • GitHub repo Apache Spark

    Apache Spark - A unified analytics engine for large-scale data processing

    Project mention: A Scala rant | reddit.com/r/scala | 2021-03-31

    yep, nailed it: https://github.com/apache/spark/blob/master/pom.xml#L122

  • GitHub repo Pandas

    Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

    Project mention: Programming for Beginners | dev.to | 2021-04-16

    As we can observe, imperative programming in Python specifies the computational steps to get the desired output, where the declarative style of SQL describes the output. Another thing to mention is that Python does support a (functional) declarative style of programming and SQL-like interface with the use of some modules such as Pandas. Python does even extend for other features of programming such as object-oriented and functional programming, making it a general-purpose language.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2021-04-17.


What are some of the best open-source Python projects? This list will help you:

Project Stars
1 tensorflow 154,864
2 system-design-primer 126,125
3 TheAlgorithms 102,178
4 awesome-python 95,778
5 thefuck 59,786
6 Django 56,793
7 Flask 54,499
8 Keras 51,038
9 httpie 50,448
10 Ansible 47,760
11 Pytorch 47,500
12 project-based-learning 47,262
13 scikit-learn 45,396
14 requests 44,962
15 Home Assistant 41,979
16 TensorFlow-Examples 40,434
17 Scrapy 40,335
18 Face Recognition 39,394
19 superset 38,016
20 manim 32,850
21 fastapi 29,505
22 Apache Spark 29,391
23 Pandas 29,296