Zephyr 141B, a Mixtral 8x22B fine-tune, is now available in Hugging Chat

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • text-generation-inference

    Large Language Model Text Generation Inference

  • Zephyr 141B is a Mixtral 8x22B fine-tune. Here are some interesting details

    - Base model: Mixtral 8x22B, 8 experts, 141B total params, 35B activated params

    - Fine-tuned with ORPO, a new alignment algorithm with no SFT step (hence much faster than DPO/PPO)

    - Trained with 7K open data instances -> high-quality, synthetic, multi-turn

    - Apache 2

    Everything is open:

    - Final Model: https://huggingface.co/HuggingFaceH4/zephyr-orpo-141b-A35b-v...

    - Base Model: https://huggingface.co/mistral-community/Mixtral-8x22B-v0.1

    - Fine-tune data: https://huggingface.co/datasets/argilla/distilabel-capybara-...

    - Recipe/code to train the model: https://huggingface.co/datasets/argilla/distilabel-capybara-...

    - Open-source inference engine: https://github.com/huggingface/text-generation-inference

    - Open-source UI code https://github.com/huggingface/chat-ui

    Have fun!

  • chat-ui

    Open source codebase powering the HuggingChat app

  • Zephyr 141B is a Mixtral 8x22B fine-tune. Here are some interesting details

    - Base model: Mixtral 8x22B, 8 experts, 141B total params, 35B activated params

    - Fine-tuned with ORPO, a new alignment algorithm with no SFT step (hence much faster than DPO/PPO)

    - Trained with 7K open data instances -> high-quality, synthetic, multi-turn

    - Apache 2

    Everything is open:

    - Final Model: https://huggingface.co/HuggingFaceH4/zephyr-orpo-141b-A35b-v...

    - Base Model: https://huggingface.co/mistral-community/Mixtral-8x22B-v0.1

    - Fine-tune data: https://huggingface.co/datasets/argilla/distilabel-capybara-...

    - Recipe/code to train the model: https://huggingface.co/datasets/argilla/distilabel-capybara-...

    - Open-source inference engine: https://github.com/huggingface/text-generation-inference

    - Open-source UI code https://github.com/huggingface/chat-ui

    Have fun!

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • LocalPilot: Open-source GitHub Copilot on your MacBook

    6 projects | news.ycombinator.com | 19 Oct 2023
  • [P] What are the latest "out of the box solutions" for deploying the very large LLMs as API endpoints?

    3 projects | /r/MachineLearning | 23 Feb 2023
  • Hugging Face reverts the license back to Apache 2.0

    1 project | news.ycombinator.com | 8 Apr 2024
  • HuggingFace text-generation-inference is reverting to Apache 2.0 License

    2 projects | news.ycombinator.com | 8 Apr 2024
  • AI Code assistant for about 50-70 users

    4 projects | /r/LocalLLaMA | 6 Dec 2023