Bad PDFs = bad UX. Slow load times, broken annotations, clunky UX frustrates users. Nutrient’s PDF SDKs gives seamless document experiences, fast rendering, annotations, real-time collaboration, 100+ features. Used by 10K+ devs, serving ~half a billion users worldwide. Explore the SDK for free. Learn more →
Top 21 Python Arxiv Projects
-
-
CodeRabbit
CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
-
arxiv-latex-cleaner
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
Project mention: ArXiv LaTeX Cleaner: Clean the LaTeX code of your paper to submit to ArXiv | news.ycombinator.com | 2025-01-31 -
arxiv-vanity
Renders papers from arXiv as responsive web pages so you don't have to squint at a PDF.
-
-
arxiv-sanity-lite
arxiv-sanity lite: tag arxiv papers of interest get recommendations of similar papers in a nice UI using SVMs over tfidf feature vectors based on paper abstracts.
-
-
paper2remarkable
Fetch an academic paper or web article and send it to the reMarkable tablet with a single command
-
Nutrient
Nutrient - The #1 PDF SDK Library. Bad PDFs = bad UX. Slow load times, broken annotations, clunky UX frustrates users. Nutrient’s PDF SDKs gives seamless document experiences, fast rendering, annotations, real-time collaboration, 100+ features. Used by 10K+ devs, serving ~half a billion users worldwide. Explore the SDK for free.
-
-
summarizepaper
An AI-powered arXiv paper summarization website with a virtual assistant for answering questions.
-
-
bibcure
Bibcure helps in boring tasks by keeping your bibfile up to date and normalized...also allows you to easily download all papers inside your bibtex
-
searchthearxiv
The code powering searchthearxiv.com, a simple semantic search engine for more than 300,000 ML papers on arXiv.
staring right in my face https://searchthearxiv.com/about
Code is on github https://github.com/augustwester/searchthearxiv
-
pdf2doi
A python library/command-line tool to extract the DOI or other identifiers of a scientific paper from a pdf file.
-
Auto-Research
Generate custom detailed survey paper with topic clustered sections and proper citations, from just a single query in just under 30 mins !!
-
-
If you're wondering how they prompt the models:
"Perform OCR on this image. Return only the text found in the image as a single continuous string without any newlines, additional text, or commentary. Separate words with single spaces. For any truncated, partially visible, or occluded text, include only the visible portions without attempting to complete or guess the full text. If no text is present, return empty double quotes."
Found in: https://github.com/video-db/ocr-benchmark/blob/main/prompts....
-
Muzero-unplugged
Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations.
-
Paper-Recommendation-System
Web interface to search ArXiv papers using NLP Sentence-Transformers, Faiss and Streamlit
-
Muzero
Pytorch Implementation of MuZero for gym environment. It support any Discrete , Box and Box2D configuration for the action space and observation space. (by DHDev0)
-
ailert
An open-source platform that aggregates AI content from 230+ sources including research papers, GitHub trends, and industry news, making AI knowledge accessible to everyone.
Project mention: Building an Open-Source AI Newsletter Engine: The Story of AiLert | dev.to | 2025-01-12Code: https://github.com/anuj0456/ailert Docs: https://github.com/anuj0456/ailert/blob/main/README.md
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python Arxiv discussion
Python Arxiv related posts
-
ArXiv LaTeX Cleaner: Clean the LaTeX code of your paper to submit to ArXiv
-
My Struggle with Doom Scrolling
-
Hardware Acceleration of LLMs: A comprehensive survey and comparison
-
Show HN: FileKitty – Combine and label text files for LLM prompt contexts
-
Show HN: Command Line Data Aggregation Tool for LLM Ingestion
-
Show HN: Talk to any ArXiv paper just by changing the URL
-
Ask HN: AI/ML papers to catch up with current state of AI?
-
A note from our sponsor - Nutrient
nutrient.io | 15 Mar 2025
Index
What are some of the best open-source Arxiv projects in Python? This list will help you:
# | Project | Stars |
---|---|---|
1 | ChatPaper | 18,769 |
2 | arxiv-latex-cleaner | 5,938 |
3 | arxiv-vanity | 1,619 |
4 | arxiv.py | 1,209 |
5 | arxiv-sanity-lite | 1,191 |
6 | resp | 403 |
7 | paper2remarkable | 356 |
8 | ArxivDigest | 348 |
9 | summarizepaper | 271 |
10 | findpapers | 248 |
11 | bibcure | 200 |
12 | searchthearxiv | 143 |
13 | pdf2doi | 114 |
14 | Auto-Research | 57 |
15 | cobib | 56 |
16 | ocr-benchmark | 31 |
17 | Muzero-unplugged | 27 |
18 | Paper-Recommendation-System | 20 |
19 | Muzero | 17 |
20 | ailert | 13 |
21 | neozot-py | 7 |