C++ inverted-index

Open-source C++ projects categorized as inverted-index

Top 4 C++ inverted-index Projects

  • pisa

    PISA: Performant Indexes and Search for Academia

  • Project mention: A Compressed Indexable Bitset | news.ycombinator.com | 2023-07-01

    The EF core algorithm implemented in folly [3] may be a bit faster, and implementing partitioning on top of that is relatively easy.

    It would definitely compress much better than roaring bitmaps. In terms of performance, it depends on the access patterns. If very sparse (large jumps) PEF would likely be faster, if dense (visit a large fraction of the bitmap) it'd be slower.

    It is possible to squeeze a bit more compression out of PEF by introducing a chunk type for Elias-Fano of the chunk complement (for very dense chunks), but you lose the operation of skipping to a given position, which is however not needed in inverted indexes (you only need to skip past a given id, and that can be supported efficiently). That is not mentioned in the paper because at the time I thought the skip-to-position operation was a non-negotiable.

    [1] https://github.com/ot/ds2i/

    [2] https://github.com/pisa-engine/pisa

    [3] https://github.com/facebook/folly/blob/main/folly/experiment...

  • MeTA

    A Modern C++ Data Sciences Toolkit (by meta-toolkit)

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • ds2i

    A library of inverted index data structures

  • Project mention: A Compressed Indexable Bitset | news.ycombinator.com | 2023-07-01

    The EF core algorithm implemented in folly [3] may be a bit faster, and implementing partitioning on top of that is relatively easy.

    It would definitely compress much better than roaring bitmaps. In terms of performance, it depends on the access patterns. If very sparse (large jumps) PEF would likely be faster, if dense (visit a large fraction of the bitmap) it'd be slower.

    It is possible to squeeze a bit more compression out of PEF by introducing a chunk type for Elias-Fano of the chunk complement (for very dense chunks), but you lose the operation of skipping to a given position, which is however not needed in inverted indexes (you only need to skip past a given id, and that can be supported efficiently). That is not mentioned in the paper because at the time I thought the skip-to-position operation was a non-negotiable.

    [1] https://github.com/ot/ds2i/

    [2] https://github.com/pisa-engine/pisa

    [3] https://github.com/facebook/folly/blob/main/folly/experiment...

  • interpolative_coding

    A flexible and efficient C++ implementation of the Binary Interpolative Coding algorithm.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

C++ inverted-index related posts

Index

What are some of the best open-source inverted-index projects in C++? This list will help you:

Project Stars
1 pisa 855
2 MeTA 684
3 ds2i 141
4 interpolative_coding 27

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com