Facebook open sources Glean: a scalable code search and query engine

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Judoscale - Save 47% on cloud hosting with autoscaling that just works
Judoscale integrates with Rails, Sidekiq, Solid Queue, and more to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up job queues.
judoscale.com
featured
CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai
featured
  1. Glean

    System for collecting, deriving and working with facts about source code.

    Kythe has one schema, whereas with Glean each language has its own schema with arbitrary amounts of language-specific detail. You can get a language-agnostic view by defining an abstraction layer as a schema. Our current (work in progress) language-agnostic layer is called "codemarkup" https://github.com/facebookincubator/Glean/blob/main/glean/s...

    For wiring up the indexer, there are various methods, it tends to depend very much on the language and the build system. For Flow for example, Glean output is just built into the typechecker, you just run it with some flags to spit out the Glean data. For C++, you need to get the compiler flags from the build system to pass to the Clang frontend. For Java the indexer is a compiler plugin; for Python it's built on libCST. Some indexers send their data directly to a Glean server, others generate files of JSON that get sent using a separate command-line tool.

    References use different methods depending on the language. For Flow for example there is a fact for an import that matches up with a fact for the export in the other file. For C++ there are facts that connect declarations with definitions, and references with declarations.

  2. Judoscale

    Save 47% on cloud hosting with autoscaling that just works. Judoscale integrates with Rails, Sidekiq, Solid Queue, and more to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up job queues.

    Judoscale logo
  3. linguist

    Language Savant. If your repository's language is being reported incorrectly, send us a pull request!

    GitHub's linguist library can be used to identify the programming language of a single file: https://github.com/github/linguist#single-file

  4. glean

    Modern cross-platform telemetry (by mozilla)

  5. livegrep

    Interactively grep source code. Source for http://livegrep.com/

    If you've not had to deal with a codebase that takes VSCode longer than a few minutes to index, then you're probably outside their initial target market. If you've not had to setup a hosted code search tool (eg livegrep https://github.com/livegrep/livegrep ) because there's just too much code,

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • GitHub's Language Analysis System Is Configurable

    1 project | news.ycombinator.com | 27 Mar 2024
  • Change F# Color on GitHub

    1 project | news.ycombinator.com | 1 Feb 2024
  • TIL: Github seems to recognize ebuilds as a format. Is this a new github feature? Or has this been here since forever?

    2 projects | /r/Gentoo | 15 Jun 2023
  • Debugging fontification error while writing a new mode

    3 projects | /r/emacs | 18 Dec 2022
  • Show/set repository language

    1 project | /r/github | 16 Oct 2022

Did you know that Ruby is
the 12th most popular programming language
based on number of references?