Yahoo is making a return to search

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • Read the Docs

    The source code that powers readthedocs.org

  • That "/*/tree" rule means that search engine crawlers are allowed to hit the README file of a repo but effectively NONE of the other files in it.

    Which means that if you keep your project documentation on GitHub in a docs/ folder it won't be indexed!

    You need to publish it to a separate site via GitHub Pages, or use https://readthedocs.org/

  • React

    The library for web and native user interfaces.

  • /*/tree is only for directory listings. File contents will be under a /blob/ path, e.g. https://github.com/facebook/react/blob/main/AUTHORS, and should be, AFAIK, indexable.

    (mandatory disclaimer: I'm a GitHub employee, not speaking on behalf of the company)

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • kafka

    A high-throughput, distributed, publish-subscribe messaging system (by a0x8o)

  • Would you mind providing details like the search query and link to the page you expect to be found?

    To test your hypothesis, I did a basic search for exact matches on "we do not synchronize on the update of the broker node" and Google returned 2 search results in 240ms:

    - https://github.com/a0x8o/kafka/blob/master/core/src/main/sca...

    - https://jar-download.com/artifacts/org.apache.kafka/kafka_2....

    Which contain exactly the source code from GitHub that I was looking for. You'll notice that the first result is actually a0x80's fork of apache/kafka. Google states that some entries very similar to the 2 already displayed were omitted, and I'm able to remove that filter. With that filter removed, I can see the same document indexed from apache/kafka on GitHub.

    There's nothing I can do or promise directly, but I can assure you that Google takes the quality of our search results very seriously. If you believe we're not delivering quality results, I strongly encourage you to click that "Send Feedback" link at the bottom of your results so that our teams can act upon your feedback.

    Disclosure: I work on Search at Google.

    Disclaimer: The words, views, and opinions expressed in this post are my own. They are not representative nor do they represent my employer in any capacity.

  • community

    Public feedback discussions for: GitHub Mobile, GitHub Discussions, GitHub Codespaces, GitHub Sponsors, GitHub Issues and more! (by community)

  • I asked about this on the support forum a while ago and never got a satisfactory response: https://github.com/community/community/discussions/20958

  • Puts Debuggerer

    Ruby library for improved puts debugging, automatically displaying bonus useful information such as source line number and source code.

  • > Google states that some entries very similar to the 2 already displayed were omitted, and I'm able to remove that filter.

    I've definitely seen that sort of thing before but there is no such link there at the moment -- at least not when searching from my iPhone, whether or not I'm in desktop mode. I just see a large error box that says "It looks like there aren't many great matches for your search" followed by the link to the a0x80 fork.

    By the way, the a0x80 result highlights a serious problem with search results: the GitHub URL is strangely modified. Instead of showing the full URL or even a prefix leading up to it Google is selecting parts of the URL, showing "https://github.com > src > transaction" on mobile and "https://github.com > kafka > coordinator > transaction" when I request the desktop site. In neither case is it obvious that the content isn't the canonical source from Apache. I've noticed this middle-out truncation for GH urls before but I'm not sure when it started.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts