Zephyr 141B, a Mixtral 8x22B fine-tune, is now available in Hugging Chat

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Nutrient - The #1 PDF SDK Library
Bad PDFs = bad UX. Slow load times, broken annotations, clunky UX frustrates users. Nutrient’s PDF SDKs gives seamless document experiences, fast rendering, annotations, real-time collaboration, 100+ features. Used by 10K+ devs, serving ~half a billion users worldwide. Explore the SDK for free.
nutrient.io
featured
CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai
featured
  1. text-generation-inference

    Large Language Model Text Generation Inference

    Zephyr 141B is a Mixtral 8x22B fine-tune. Here are some interesting details

    - Base model: Mixtral 8x22B, 8 experts, 141B total params, 35B activated params

    - Fine-tuned with ORPO, a new alignment algorithm with no SFT step (hence much faster than DPO/PPO)

    - Trained with 7K open data instances -> high-quality, synthetic, multi-turn

    - Apache 2

    Everything is open:

    - Final Model: https://huggingface.co/HuggingFaceH4/zephyr-orpo-141b-A35b-v...

    - Base Model: https://huggingface.co/mistral-community/Mixtral-8x22B-v0.1

    - Fine-tune data: https://huggingface.co/datasets/argilla/distilabel-capybara-...

    - Recipe/code to train the model: https://huggingface.co/datasets/argilla/distilabel-capybara-...

    - Open-source inference engine: https://github.com/huggingface/text-generation-inference

    - Open-source UI code https://github.com/huggingface/chat-ui

    Have fun!

  2. Nutrient

    Nutrient - The #1 PDF SDK Library. Bad PDFs = bad UX. Slow load times, broken annotations, clunky UX frustrates users. Nutrient’s PDF SDKs gives seamless document experiences, fast rendering, annotations, real-time collaboration, 100+ features. Used by 10K+ devs, serving ~half a billion users worldwide. Explore the SDK for free.

    Nutrient logo
  3. chat-ui

    Open source codebase powering the HuggingChat app

    Zephyr 141B is a Mixtral 8x22B fine-tune. Here are some interesting details

    - Base model: Mixtral 8x22B, 8 experts, 141B total params, 35B activated params

    - Fine-tuned with ORPO, a new alignment algorithm with no SFT step (hence much faster than DPO/PPO)

    - Trained with 7K open data instances -> high-quality, synthetic, multi-turn

    - Apache 2

    Everything is open:

    - Final Model: https://huggingface.co/HuggingFaceH4/zephyr-orpo-141b-A35b-v...

    - Base Model: https://huggingface.co/mistral-community/Mixtral-8x22B-v0.1

    - Fine-tune data: https://huggingface.co/datasets/argilla/distilabel-capybara-...

    - Recipe/code to train the model: https://huggingface.co/datasets/argilla/distilabel-capybara-...

    - Open-source inference engine: https://github.com/huggingface/text-generation-inference

    - Open-source UI code https://github.com/huggingface/chat-ui

    Have fun!

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • LocalPilot: Open-source GitHub Copilot on your MacBook

    6 projects | news.ycombinator.com | 19 Oct 2023
  • [P] What are the latest "out of the box solutions" for deploying the very large LLMs as API endpoints?

    3 projects | /r/MachineLearning | 23 Feb 2023
  • Hugging Face reverts the license back to Apache 2.0

    1 project | news.ycombinator.com | 8 Apr 2024
  • HuggingFace text-generation-inference is reverting to Apache 2.0 License

    2 projects | news.ycombinator.com | 8 Apr 2024
  • AI Code assistant for about 50-70 users

    4 projects | /r/LocalLLaMA | 6 Dec 2023