UC Berkley: World Model on Million-Length Video and Language with RingAttention

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • LWM

  • looking at https://github.com/LargeWorldModel/LWM - they seem to indeed suggest to use a TPU vm

  • llama.cpp

    LLM inference in C/C++

  • https://github.com/ggerganov/llama.cpp/discussions/2948

    You can run ollama (and a web UI) pretty trivially via docker:

    docker run -d --gpus=all -v /some/dir/for/ollama/data:/root/.ollama -p 11434:11434 --name ollama ollama/ollama:latest

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • ollama

    Get up and running with Llama 3, Mistral, Gemma, and other large language models.

  • docker run -d -p 3000:8080 --add-host=host.docker.internal:host-gateway --name ollama-webui ghcr.io/ollama-webui/ollama-webui:main

    That particular webui will let you upload models (with configuration). Other wise, you can use the api directly (you'll need to POST a `blob` first):

    https://github.com/ollama/ollama/blob/main/docs/api.md#creat...

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts