Mistral CEO confirms 'leak' of new open source AI model nearing GPT4 performance

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • Apollo-11

    Original Apollo 11 Guidance Computer (AGC) source code for the command and lunar modules.

  • I often like to think about https://github.com/chrislgarry/Apollo-11 as an analogy. It's public domain with available source, in the assembly language in which it was written... so it fills all the definitions of OSS!

    But the process by which that code arose, the ability to modify any line and understand its impact (heh) on a real execution environment, is dependent on a massive process that required billions of dollars and thousands of the smartest people on the planet. For all intents and purposes, without that environment, it is as reliably modifiable as an executable binary in any other context - or a set of weights, in this one!

  • virtualagc

    Virtual Apollo Guidance Computer (AGC) software

  • I don't think that's a great example.

    For instance, I can step through and even modify that code using tooling like AGC emulators like this one http://www.ibiblio.org/apollo/#gsc.tab=0

    What makes it open source is access to the same level of source access that the original developers worked in.

    That's what's missing here. Mistral's engineers do not simply open this binary in their editor to do their job.

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • ollama

    Get up and running with Llama 3, Mistral, Gemma, and other large language models.

  • The authors don't seem to care about the principle of least privilege: https://github.com/ollama/ollama/issues/851#issuecomment-177...

    It makes me wonder what other security issues they might now care about.

  • vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

  • FYI, vLLM also just added experiment multi-lora support: https://github.com/vllm-project/vllm/releases/tag/v0.3.0

    Also check out the new prefix caching, I see huge potential for batch processing purposes there!

  • Awesome-LLM-Watermark

    UP-TO-DATE LLM Watermark paper. 🔥🔥🔥

  • You can either modify the model weights in a way that doesn't cause any real differences (change a few bits somewhere should be enough), or you could watermark the actual text output.

    Here's a list of research for watermarking LLMs.

    https://github.com/hzy312/Awesome-LLM-Watermark?tab=readme-o...

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts