Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Hallucination-leaderboard Alternatives
Similar projects and alternatives to hallucination-leaderboard
-
SuperAGI
<⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
haystack
:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
-
autogen
A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap
-
h2ogpt
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://codellama.h2o.ai/
-
quivr
Your GenAI Second Brain 🧠 A personal productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, Groq that you can share with users ! Local & Private alternative to OpenAI GPTs & ChatGPT powered by retrieval-augmented generation.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
amazon-bedrock-with-builder-and-command-patterns
A simple, yet powerful implementation in Java that allows developers to write a rather straightforward code to create the API requests for the different foundation models supported by Amazon Bedrock.
-
dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
-
Woodpecker
✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models. The first work to correct hallucinations in MLLMs. (by BradyFU)
-
awesome-generative-ai
A curated list of modern Generative Artificial Intelligence projects and services
-
awesome-generative-deep-art
Discontinued A curated list of Generative AI tools, works, models, and references [Moved to: https://github.com/filipecalegario/awesome-generative-ai]
-
selfcheckgpt
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
hallucination-leaderboard reviews and mentions
-
How to Detect AI Hallucinations
To checkout the Hallucination leaderboard click here
-
Launch HN: Danswer (YC W24) – Open-source AI search and chat over private data
Nice to see yet another open source approach to LLM/RAG. For those who do not want to meddle with the complexity of do-it-youself, Vectara (https://vectara.com) provides a RAG-as-a-service approach - pretty helpful if you want to stay away from having to worry about all the details, scalability, security, etc - and just focus on building your RAG application.
-
Went down the rabbit hole of 100% local RAG, it works but are there better options?
Check this leaderboard, it is specific for RAG use case: https://github.com/vectara/hallucination-leaderboard
-
Which LLM framework(s) do you use in production and why?
You should also check us out (https://vectara.com) - we provide RAG as a service so you don't have to do all the heavy lifting and putting together the pieces yourself.
-
Ask HN: Best Alternatives to OpenAI ChatGPT?
Llama 2 (and variants). Has the lowest hallucination rate (https://github.com/vectara/hallucination-leaderboard), and its open source and so we know what went into it, and the community can improve it
-
Inflection-2: the next step up
This is just typical of so much work in the field. They pick and choose which models to compare against and on which benchmarks. If this model was truly great, they would be comparing against Claude 2 and GPT4 across a bunch of different benchmarks. Instead they compare against Palm 2, which in a lot of tests is a weak model (https://venturebeat.com/ai/google-bard-fails-to-deliver-on-i....) and prone to hallucination (https://github.com/vectara/hallucination-leaderboard).
- LLMs by Hallucination Rate
-
A note from our sponsor - InfluxDB
www.influxdata.com | 13 May 2024
Stats
vectara/hallucination-leaderboard is an open source project licensed under Apache License 2.0 which is an OSI approved license.
Popular Comparisons
- hallucination-leaderboard VS Woodpecker
- hallucination-leaderboard VS SuperAGI
- hallucination-leaderboard VS nohide
- hallucination-leaderboard VS h2ogpt
- hallucination-leaderboard VS autogen
- hallucination-leaderboard VS YiVal
- hallucination-leaderboard VS awesome-generative-ai
- hallucination-leaderboard VS ChatGPT-Prompts
- hallucination-leaderboard VS awesome-generative-deep-art
- hallucination-leaderboard VS amazon-bedrock-with-builder-and-command-patterns
Sponsored