Hello OLMo: A Open LLM

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Scout Monitoring - Free Django app performance insights with Scout Monitoring
Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
www.scoutapm.com
featured
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
  • dbrx

    Code examples and resources for DBRX, a large language model developed by Databricks

  • One thing I wanted to add and call attention to is the importance of licensing in open models. This is often overlooked when we blindly accept the vague branding of models as “open”, but I am noticing that many open weight models are actually using encumbered proprietary licenses rather than standard open source licenses that are OSI approved (https://opensource.org/licenses). As an example, Databricks’s DBRX model has a proprietary license that forces adherence to their highly restrictive Acceptable Use Policy by referencing a live website hosting their AUP (https://github.com/databricks/dbrx/blob/main/LICENSE), which means as they change their AUP, you may be further restricted in the future. Meta’s Llama is similar (https://github.com/meta-llama/llama/blob/main/LICENSE ). I’m not sure who can depend on these models given this flaw.

  • llama

    Inference code for Llama models

  • One thing I wanted to add and call attention to is the importance of licensing in open models. This is often overlooked when we blindly accept the vague branding of models as “open”, but I am noticing that many open weight models are actually using encumbered proprietary licenses rather than standard open source licenses that are OSI approved (https://opensource.org/licenses). As an example, Databricks’s DBRX model has a proprietary license that forces adherence to their highly restrictive Acceptable Use Policy by referencing a live website hosting their AUP (https://github.com/databricks/dbrx/blob/main/LICENSE), which means as they change their AUP, you may be further restricted in the future. Meta’s Llama is similar (https://github.com/meta-llama/llama/blob/main/LICENSE ). I’m not sure who can depend on these models given this flaw.

  • Scout Monitoring

    Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

    Scout Monitoring logo
  • OLMo

    Modeling, training, eval, and inference code for OLMo

  • It looks like the weights [0] and code [1] are Apache licensed, but the training data [2] is using the license that OP is quoting from.

    [0] https://huggingface.co/allenai/OLMo-7B

    [1] https://github.com/allenai/OLMo

    [2] https://huggingface.co/datasets/allenai/dolma

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • DBRX: A New Open LLM

    6 projects | news.ycombinator.com | 27 Mar 2024
  • EU Approves AI Act

    1 project | news.ycombinator.com | 25 May 2024
  • Prompt Fuzzer: how to keep your agents on the right path

    1 project | dev.to | 20 May 2024
  • Practical Llama 3 inference implemented in a single Java file

    1 project | news.ycombinator.com | 19 May 2024
  • Practical Llama 3 inference implemented in a single Java file

    1 project | news.ycombinator.com | 18 May 2024