Llama-2-Onnx Alternatives

Similar projects and alternatives to Llama-2-Onnx

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better Llama-2-Onnx alternative or higher similarity.

Llama-2-Onnx reviews and mentions

Posts with mentions or reviews of Llama-2-Onnx. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-09-12.
  • Show HN: Fine-tune your own Llama 2 to replace GPT-3.5/4
    8 projects | news.ycombinator.com | 12 Sep 2023
    System: Here's some docs, answer concisely in a sentence.

    YMMV on cost still, depends on cloud vendor, and my intuition & viewpoint agrees with yours, GPT-3.5 is priced low enough that there isn't a case where it makes sense to use another model.

    It strikes me now that _very_ likely and not just our intuition: OpenAI's $/GPU hour is likely <= any other vendor's.

    The next big step will come from formalizing the stuff rolling around the local LLM community, for months now it's either been one-off $X.c stunts that run on desktop, and the vast majority of the _actual_ usage and progress is coming from porn-y stuff, like all nascent tech.

    Microsoft has LLaMa-2 ONNX available on GitHub[1]. There's budding but very small projects in different languages to wrap ONNX. Once there's a genuine cross-platform[2] ONNX wrapper that makes running LLaMa-2 easy, there will be a step change. It'll be "free"[3] to run your fine-tuned model that does as well as GPT-4 .

    It's not clear to me exactly when this will occur. It's "difficult" now, but only because the _actual usage_ in the local LLM community doesn't have a reason to invest in ONNX, and it's extremely intimidating to figure out how exactly to get LLaMa-2 running in ONNX. Microsoft kinda threw it up on GitHub and moved on, the sample code even still needs a PyTorch model. I see at least one very small company on HuggingFace that _may_ have figured out full ONNX.

    [1] https://github.com/microsoft/Llama-2-Onnx

  • FLaNK Stack Weekly for 14 Aug 2023
    32 projects | dev.to | 14 Aug 2023
  • Llama 2 on ONNX runs locally
    5 projects | news.ycombinator.com | 10 Aug 2023
  • A note from our sponsor - SaaSHub
    www.saashub.com | 8 May 2024
    SaaSHub helps you find the best software and product alternatives Learn more →

Stats

Basic Llama-2-Onnx repo stats
3
987
6.7
4 months ago

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com