The Dual LLM pattern for building AI assistants that can resist prompt injection

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Our great sponsors
  • SurveyJS - Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • NeMo-Guardrails

    NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

  • Here's "jailbreak detection", in the NeMo-Guardrails project from Nvidia:

    https://github.com/NVIDIA/NeMo-Guardrails/blob/327da8a42d5f8...

    I.e. they ask the llm if the prompt will break the llm. (I believe that more data /some evaluation on how well this performs is intended to be released. Probably fair to call this stuff "not battle tested".)

  • transynthetical-engine

    Applied methods of analytical augmentation to build tools using large-language models.

  • I think the two-layer approach is worthwhile if only for limiting tokens!

    Here’s an example of what I mean:

    https://github.com/williamcotton/transynthetical-engine#brow...

    By keeping the main discourse between the user and the LLM from containing all of the generated code and instead just using that main “thread” to orchestrate instructions to write code it allows for more back-and-forth.

    It’s a good technique in general!

    I’m still too paranoid to execute instructions via email without a very limited set of abilities!

  • SurveyJS

    Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App. With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.

    SurveyJS logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts