-
BIG-bench
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
I was thinking about evaluating the accuracy of the results: something around the lines of https://github.com/EleutherAI/lm-evaluation-harness or https://github.com/google/BIG-bench
I was thinking about evaluating the accuracy of the results: something around the lines of https://github.com/EleutherAI/lm-evaluation-harness or https://github.com/google/BIG-bench
Either way, anyone relying-on or building-upon bard is an idiot. It's more predictable than Lucy and Charlie Brown with the football.