BooookScore
searchGPT
BooookScore | searchGPT | |
---|---|---|
1 | 3 | |
75 | 570 | |
- | - | |
7.0 | 7.2 | |
2 months ago | about 1 year ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
BooookScore
-
Evaluating faithfulness and content selection of LLMs in book-length summaries
With a link to https://arxiv.org/pdf/2310.00785.pdf - which then links to another GitHub repository, https://github.com/lilakk/BooookScore which has a bunch of prompts in https://github.com/lilakk/BooookScore/tree/main/prompts
Which makes me think that this original paper isn't evaluating LLMs so much as it's evaluating that one particular prompting technique for long summaries.
Gemini Pro 1.5 has 1m token context length, which should remove the need for weird hierarchical summary tricks. I wonder how well it would score?
searchGPT
What are some alternatives?
chatGPT-cheatsheet - An ever-evolving introduction to ChatGPT, AI, and machine learning (including prompt examples and Python-built chatbots)
chatgpt-extractive-shortener - Shortens a paragraph of text with ChatGPT, using successive rounds of word-level extractive summarization.
gpt4docstrings - Generating Python docstrings with OpenAI ChatGPT!!
AutoLearn-GPT - ChatGPT learns automatically.
llm-leaderboard - A joint community effort to create one central leaderboard for LLMs.
aidoc - A simple CLI tool to generate documentation for your Python projects automatically.
gmail-assist - Get control of your overflowing inbox using GPT-3 to classify your emails by importance.
embedchain - Personalizing LLM Responses
LLMChat - A Discord chatbot that supports popular LLMs for text generation and ultra-realistic voices for voice chat.
agents - An Open-source Framework for Autonomous Language Agents
draviz - A method for assessing the data readiness of NLP projects, as well as the code necessary for visualizing the outcome of the method.
SHREC2023-ANIMAR - Source codes of team TikTorch (1st place solution) for track 2 and 3 of the SHREC2023 Challenge