Show HN: Transform Your Codebase into a Single Markdown Doc for Feeding into AI

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai
featured
InfluxDB high-performance time series database
Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems.
influxdata.com
featured
  1. repomix

    📦 Repomix (formerly Repopack) is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like Claude, ChatGPT, DeepSeek, Perplexity, Gemini, Gemma, Llama, Grok, and more.

  2. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
  3. gitingest

    Replace 'hub' with 'ingest' in any github url to get a prompt-friendly extract of a codebase

  4. llmcat

    Prepare files and directories for LLM consumption

  5. feed-llm

    A TUI-based file selection tool for feeding code to LLMs.

    This is self-promotional, but https://github.com/nahco314/feed-llm has TUI to choose what to give to llm. There are many similar tools out there, but I think this approach is relatively effective for larger code bases.

  6. ts-module-graph

    Library for constructing a module graph from a set of TypeScript files

    I really want a tool like this that can extract a function and its dependency graph (to a certain depth maybe, and/or exclude node_modules).

    I wrote this library [1] and hope to add the fine-grained "reference resolution" utility to it at some point, which could make implementing such a tool a lot simpler.

    [1]: https://github.com/aleclarson/ts-module-graph

  7. thisismy

  8. codefetch

    Turn code into Markdown for LLMs with one simple terminal command

    There is also https://github.com/regenrek/codefetch which I personally like

  9. InfluxDB

    InfluxDB high-performance time series database. Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems.

    InfluxDB logo
  10. grimoire

    A lightweight tool that converts directory contents into structured output optimized for LLM interpretation, featuring Git-aware file ordering, secret detection/redaction, token counting, and customizable filtering. (by foresturquhart)

    I made a similar tool in Golang, https://github.com/foresturquhart/grimoire. It tries to be a bit cleverer, by prioritising files that have had many commits, respecting .gitignore files, and excluding useless content like binaries or vector images.

  11. yek

    A fast Rust based tool to serialize text-based files in a repository or directory for LLM consumption

  12. code2prompt

    A CLI tool to convert your codebase into a single LLM prompt with source tree, prompt templating, and token counting.

    From a rough glance it looks pretty similar to another tool that I've been using https://github.com/mufeedvh/code2prompt

  13. techdocs

    A Rust-based tool for generating technical documentation from codebases, with support for AI-powered README generation.

    I literally just wrote something similar called techdocs[1] in Rust and uses Claude to generate a README. It includes API and CLI.

    [1] https://github.com/thesurlydev/techdocs

  14. bundle-codebases

    I built a simple tool to do something similar (it's meant for a monorepo and will build each subfolder in to a (subfolder-code.txt) text file that you can upload to AIs.

    https://github.com/manfrin/bundle-codebases

    I don't see much merit in things like markdown or syntax highlighting as that's just extra noise for the AI. My script tries to cut down on any extraneous data since the things I'm working on are near the context limit of consumer AIs.

    My script also ignores anything in .gitignore and will take a .codebundlerwhitelist (i hate this name and have meant to change it) to only bundle files matching patterns you specify.

  15. ggrab

    a manual context builder to for pasting into llms

    I'll be the 10th person to add, I made something like this too! https://github.com/keizo/ggrab

  16. globcat.sh

    Unix shell script to concatenate files, with prepended file paths (to feed them to ChatGPT, and the like).

    I have something similar.

    https://github.com/jzombie/globcat.sh

    Nothing fancy, but gets the job done.

  17. ai-context

    CLI tool to produce MD context files from many sources, to help interact with LLMs (ChatGPT, Llama3, Claude, etc.).

    nice! i made something that converts codebases (local and github urls), as well as youtube videos (transcripts) and blog posts to markdown.

    https://github.com/tanq16/ai-context

  18. your-source-to-prompt.html

    Quickly and securely turn your code projects into LLM prompts, all locally on your own machine!

    If you want to be able to select certain files quickly and visually, and work with private repos or just local files, try this open source tool I made:

    https://github.com/Dicklesworthstone/your-source-to-prompt.h...

  19. your-source-to-prompt.h

    Discontinued [GET https://api.github.com/repos/Dicklesworthstone/your-source-to-prompt.h: 404 - Not Found // See: https://docs.github.com/rest/repos/repos#get-a-repository]

    If you want to be able to select certain files quickly and visually, and work with private repos or just local files, try this open source tool I made:

    https://github.com/Dicklesworthstone/your-source-to-prompt.h...

  20. copcon

    Create a prompt with entire code project including file tree and content directly to clipboard for use with LLMs

    I’ve made a CLI tool that does something similar, called Copcon:

    https://github.com/kasperjunge/copcon

    Point it at a code project directory to get a file tree and content, optionally with a git diff, copied to the clipboard - ready for copy pasting into ChatGPT.

    It is very true that this only works for small projects, as you will bloat the LLM’s context with large codebases.

    My solution to this is two files you can use to steer the tool’s behavior:

    - .copconignore: For ignoring specific files and directories.

    - .copcontarget: For targeting specific files and directories (applied before .copconignore).

    These two files provide great control over what to include and exclude in the copied context.

  21. typst

    A new markup-based typesetting system that is powerful and easy to learn.

    I've never used Quarto, but I might give it a go someday. I currently have a convoluted workflow for generating math-heavy documents that involves generating equations using SymPy in a notebook, accumulating them in a string, and ultimately dumping the string into a Markdown. I would love to simplify this sooner rather than later. I'm also keeping an eye on https://typst.app/ and hoping for a sane alternative to LaTeX to emerge.

  22. Describe

    A tool for describing your codebase.

    A better alternative, which uses a .gitignore-like file to ignore specific files: https://github.com/rodlaf/describe

  23. CodeWeaver

    Weave your codebase into a single, navigable Markdown document

    Updated the project readme with links to (hopefully) all alternatives listed here. Feel free to add new ones via pull requests.

    https://github.com/tesserato/CodeWeaver

  24. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Gemini 2.5 Pro vs. Claude 3.7 Sonnet: Coding Comparison

    10 projects | news.ycombinator.com | 31 Mar 2025
  • Show HN: CLI tool to generate context files for LLMs

    1 project | news.ycombinator.com | 3 Feb 2025
  • Mufeedvh/code2prompt:A CLI tool to convert your codebase in a single LLM prompt

    1 project | news.ycombinator.com | 16 Jan 2025
  • Ask HN: How do you load your code base as context window in ChatGPT?

    1 project | news.ycombinator.com | 7 Jan 2025
  • Code2prompt

    1 project | news.ycombinator.com | 16 Dec 2024

Did you know that Rust is
the 5th most popular programming language
based on number of references?