nebuly vs hh-rlhf

nebuly

The user analytics platform for LLMs (by nebuly-ai)

Source Code

nebuly.com

Suggest alternative

Edit details

hh-rlhf

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback" (by anthropics)

Suggest topics

Source Code

arxiv.org

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

nebuly		hh-rlhf
	Project
105	Mentions	6
8,363	Stars	1,447
0.1%	Growth	2.5%
8.4	Activity	3.6
7 months ago	Latest Commit	8 months ago
Python	Language
Apache License 2.0	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

nebuly

Posts with mentions or reviews of nebuly. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-09-04.

Nebuly – The LLM Analytics Platform
1 project | news.ycombinator.com | 7 Oct 2023
Ask HN: Any tools or frameworks to monitor the usage of OpenAI API keys?
2 projects | news.ycombinator.com | 4 Sep 2023
What are you building with LLMs? I'm writing an article about what people are building with LLMs
1 project | /r/programming | 27 Mar 2023

Hi everyone. I’m the creator of ChatLLaMA https://github.com/nebuly-ai/nebullvm/tree/main/apps/accelerate/chatllama, an opensource framework to train LLMs with limited resources and create There’s been amazing usage of LLMs in these days, from chatbots to retrieve about company’s product information, to cooking assistants for traditional dishes, and much more. And you? What you building or would love to build with LLMs? Let me know and I’ll share the article about your stories soon. https://qpvirevo4tz.typeform.com/to/T3PruEuE Cheers
Show HN: ChatLLaMA – A ChatGPT style chatbot for Facebook's LLaMA
10 projects | news.ycombinator.com | 22 Mar 2023

How does it differentiate from the original ChatLLaMA? https://github.com/nebuly-ai/nebullvm/tree/main/apps/acceler...
🤖🌟 Unlock the Power of Personal AI: Introducing ChatLLaMA, Your Custom Personal Assistant! 🚀💬
1 project | /r/singularity | 19 Mar 2023

Was this made with the ChatLLaMA library? https://github.com/nebuly-ai/nebullvm/tree/main/apps/accelerate/chatllama
Meta LLM LLaMA leaked, all over the internet as we speak
1 project | /r/singularity | 2 Mar 2023
Meta LLM LLAMA leaked, it's all over the internet as we speak.
3 projects | /r/ChatGPT | 2 Mar 2023
Meta LLM LLAMMA leaked, it's all over the internet as we speak.
1 project | /r/singularity | 2 Mar 2023
Plug and play modules to optimize the performances of your AI systems
3 projects | news.ycombinator.com | 1 Mar 2023

Some of the available modules include:
Speedster: Automatically apply the best set of SOTA optimization techniques to achieve the maximum inference speed-up on your hardware. https://github.com/nebuly-ai/nebullvm/blob/main/apps/acceler...
Nos: Automatically maximize the utilization of GPU resources in a Kubernetes cluster through real-time dynamic partitioning and elastic quotas. https://github.com/nebuly-ai/nos
ChatLLaMA: Build faster and cheaper ChatGPT-like training process based on LLaMA architectures. https://github.com/nebuly-ai/nebullvm/tree/main/apps/acceler...
OpenAlphaTensor: Increase the computational performances of an AI model with custom-generated matrix multiplication algorithm fine-tuned for your specific hardware. https://github.com/nebuly-ai/nebullvm/tree/main/apps/acceler...
Forward-Forward: The Forward Forward algorithm is a method for training deep neural networks that replaces the backpropagation forward and backward passes with two forward passes. https://github.com/nebuly-ai/nebullvm/tree/main/apps/acceler...
Open source implementation for LLaMA-based ChatGPT
1 project | /r/patient_hackernews | 27 Feb 2023

hh-rlhf

Posts with mentions or reviews of hh-rlhf. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-14.

Meta wants its open source AI model to be as capable as OpenAI’s best model
1 project | news.ycombinator.com | 11 Sep 2023

If you ask an LLM to complete a sentence like '[Insert name] stole the fruit (true/false):'
An aligned LLM will be biased towards refusing to answer at all with something like: "I can't tell you because I don't know them."
An "uncensored" LLM will very happily return <"true"> or <"false"> with a probability attached to each. Even OpenAI's GPT-3 does with a low enough temperature.
_
Of course, LLM attention doesn't work like that. The tokens are just a bag of numbers:
- The fact the name 'John' is mentioned in the Bible a lot affects the distribution when you ask if any John stole, because John is always [7554]
- The fact that 'Olf' is part of Adolf and Adolf Hitler is mentioned in a lot of negative sentences will drag the distribution, because 'Olf' is always [4024] and Adolf is always [324, 4024]
You could have asked something with no logical probability difference at all, like:
- 'The store attendant's name was [name], did the child in Long Island drop his ball (true/false):'
And unless you train the model to give you disclaimers it still follows the instruction faithfull and returns true/false with probabilities, demonstrating a deep regression in reasoning...
That's why for models past a certain size, alignment increases performance: https://arxiv.org/abs/2204.05862.
Training a Helpful and Harmless Assistant with Reinforcement Learning from Human
1 project | news.ycombinator.com | 3 Aug 2023
OpenDILab Awesome Paper Collection: RL with Human Feedback （3）
2 projects | /r/u_OpenDILab | 14 May 2023

Title: Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback
Show HN: ChatLLaMA – A ChatGPT style chatbot for Facebook's LLaMA
10 projects | news.ycombinator.com | 22 Mar 2023

It just hasn't been prompted or fine-tuned to have the neutral, self effacing personality of ChatGPT.
It's doing the pure, "try to guess the most likely next token" task on which they were both trained (https://heartbeat.comet.ml/causal-language-modeling-with-gpt...) (before the reinforcement from human feedback to make them more tool-like https://arxiv.org/abs/2204.05862), with a bit of randomness added for variety's sake (https://huggingface.co/blo1g/how-to-generate).
[D] Is Anthropic influential in research?
1 project | /r/MachineLearning | 30 Dec 2022

They have done good work like releasing their paper and dataset for training an assistant RLHF model. https://github.com/anthropics/hh-rlhf
[R] Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned - Anthropic - Ganguli et al 2022
1 project | /r/MachineLearning | 26 Aug 2022

Github: https://github.com/anthropics/hh-rlhf

What are some alternatives?

When comparing nebuly and hh-rlhf you can also consider the following projects:

tvm - Open deep learning compiler stack for cpu, gpu and specialized accelerators

stanford_alpaca - Code and documentation to train Stanford's Alpaca models, and generate the data.

AITemplate - AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

awesome-RLHF - A curated list of reinforcement learning with human feedback resources (continually updated)

text-generation-webui - A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

alpaca-7b-truss

alpaca-lora - Instruct-tune LLaMA on consumer hardware

TensorRT - NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

LLM-As-Chatbot - LLM as a Chatbot Service

deepsparse - Sparsity-aware deep learning inference runtime for CPUs

nebuly vs tvm hh-rlhf vs stanford_alpaca nebuly vs AITemplate hh-rlhf vs awesome-RLHF nebuly vs text-generation-webui hh-rlhf vs alpaca-7b-truss nebuly vs alpaca-lora hh-rlhf vs alpaca-lora nebuly vs TensorRT hh-rlhf vs LLM-As-Chatbot nebuly vs deepsparse hh-rlhf vs text-generation-webui

Compare nebuly vs hh-rlhf and see what are their differences.

nebuly

hh-rlhf

nebuly

hh-rlhf

What are some alternatives?