Example_Data Alternatives

Similar projects and alternatives to Example_Data

transformer-debugger

3 3,852 6.9 Python Example_Data VS transformer-debugger
InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better Example_Data alternative or higher similarity.

Suggest an alternative to Example_Data

Example_Data reviews and mentions

Posts with mentions or reviews of Example_Data. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-03-11.

OpenAI – Transformer Debugger Release
3 projects | news.ycombinator.com | 11 Mar 2024

We may well look back in future years and view the underlying approach introduced in Reexpress as among the more significant results of the first quarter of the 21st century. With Reexpress, we can generate reliable probability estimates over high-dimensional objects (e.g., LLMs), including in the presence of a non-trivial subset of distribution shifts seen in practice. A non-vacuous argument can be made that this solves the alignment/super-alignment problem (the ultimate goal of the line of work in the post above, and why I mention this here), because we can achieve this behavior via composition with networks of arbitrary size.
Because the parameters of the large neural networks are non-identifiable (in the statistical sense), we operate at the unit of analysis of labeled examples/exemplars (i.e., the observable data), with a direct connection between the Training set and the Calibration set.
This has important practical implications. It works with essentially any generative AI model. For example, we can build an 'uncertainty-aware GPT-4' for use in enterprise and professional settings, such as law: https://github.com/ReexpressAI/Example_Data/blob/main/tutori...
(The need for reliable, controllable estimates is critical regardless of any notion of AGI, since the existing LLMs are already getting baked into higher-risk settings, such as medicine, finance, and law.)
Efficient LLM fine-tuning for classification on Mac
1 project | news.ycombinator.com | 5 Jan 2024
How to locally run a semantic search with representations fine-tuned on your Mac
1 project | news.ycombinator.com | 3 Jan 2024
Show HN: On-device, no-code LLMs with guardrails (for Apple Silicon)
1 project | news.ycombinator.com | 14 Dec 2023

We've been working to make uncertainty quantification and interpretability first-class properties of LLMs. Reexpress one, a macOS app, is our first effort to make these properties widely available.
Perhaps counter-intuitively, and contrary to common wisdom, LLMs can in fact be transformed to generate very reliable uncertainty estimates (i.e., "knowing what they do and don't know" by assigning a probability to the output).
Getting there is a bit complicated, with vector matching/databases, prediction-time data dependencies, complicated inference, and multiple models flying all over the place.
We've made it simple and efficient to use in practice with an on-device, no-code approach. Common document classification tasks can be handled with the on-device models (up to 3.2 billion parameters). Additionally, you can add these capabilities to another LLM (e.g., for QA or more complicated tasks) by connecting your existing model by simply uploading the output logits into the app. For example, if you're using an on-device Mistral AI model, or cloud-based genAI model, just upload the output logits into the app.
Would be great to get feedback. Also, if you have another use case with a scale that doesn't fully fit into the on-device setting, happy to discuss and collaborate for your setting.
And if anyone finds this interesting and wants to get involved more in building reliable AI, let us know!
(Note that an Apple silicon Mac is required; ideally M1 Max or better with 64gb of RAM. You train the model yourself, which requires labeled data. The tutorial 1 video has a link to sentiment data in the JSON lines format; it's a good place to start: https://github.com/ReexpressAI/Example_Data/blob/main/tutori...)
A note from our sponsor - InfluxDB
www.influxdata.com | 22 May 2024

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Stats

Basic Example_Data repo stats

Mentions

Stars

Activity

7.3

Last Commit

2 months ago

The primary programming language of Example_Data is Python.

Example_Data

Example_Data Alternatives

Similar projects and alternatives to Example_Data

transformer-debugger

InfluxDB

Example_Data reviews and mentions

Stats