promptbench
Inbox Zero
promptbench | Inbox Zero | |
---|---|---|
4 | 4 | |
2,214 | 2,009 | |
6.9% | - | |
9.1 | 9.9 | |
10 days ago | 6 days ago | |
Python | TypeScript | |
MIT License | GNU Affero General Public License v3.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
promptbench
-
Show HN: Times faster LLM evaluation with Bayesian optimization
Fair question.
Evaluate refers to the phase after training to check if the training is good.
Usually the flow goes training -> evaluation -> deployment (what you called inference). This project is aimed for evaluation. Evaluation can be slow (might even be slower than training if you're finetuning on a small domain specific subset)!
So there are [quite](https://github.com/microsoft/promptbench) [a](https://github.com/confident-ai/deepeval) [few](https://github.com/openai/evals) [frameworks](https://github.com/EleutherAI/lm-evaluation-harness) working on evaluation, however, all of them are quite slow, because LLM are slow if you don't have infinite money. [This](https://github.com/open-compass/opencompass) one tries to speed up by parallelizing on multiple computers, but none of them takes advantage of the fact that many evaluation queries might be similar and all try to evaluate on all given queries. And that's where this project might come in handy.
- FLaNK Weekly 31 December 2023
- FLaNK 25 December 2023
- Promptbench: A Unified Library for Evaluating and Understanding LLMs
Inbox Zero
-
Show HN: Simple email mode for Gmail's 20th anniversary
On April 1st 2004, Google released Gmail.
Twenty years later, I'm releasing an open source email app that helps you reach inbox zero for a single day: Simple Email Mode
Simple Mode makes handling less overwhelming:
* Handle emails in batches of 5
* Long emails summarized
* Timer to gamify maintain focus
* Set aside what you want to handle later. Archive the rest
You can try it out at https://getinboxzero.com under the Early Access tab.
And check out the GitHub repo: https://github.com/elie222/inbox-zero
Only for Gmail users at this time.
- FLaNK Weekly 31 December 2023
-
Show HN: Inbox Zero – open-source email assistant
You can also see when we deploy each time on GitHub: https://github.com/elie222/inbox-zero/deployments
- InboxZero – Organize your inbox with the help of AI
What are some alternatives?
awesome-gpt-prompt-engineering - A curated list of awesome resources, tools, and other shiny things for GPT prompt engineering.
OpenVoice - Instant voice cloning by MyShell.
osgameclones - Open Source Clones of Popular Games
basestack - The Open-Source Stack for Developers and Startups
opencompass - OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
lightllm - LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
JavaOnRaspberryPi - Sources and scripts for the book "Getting started with Java on the Raspberry Pi"
nextjs-resources - A web application for sharing useful Next.js resources
Zolver - Automatic jigsaw puzzle solver
fullstack-graphql-app - An opinionated fullstack GraphQL monorepo boilerplate using pnpm, Turborepo, Prisma, GraphQL Yoga 2, Fastify, Nextjs, urql, and React
FLiPStackWeekly - FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...
neomutt - ✉️ Teaching an Old Dog New Tricks -- IRC: #neomutt on irc.libera.chat