Wrapyfi for distributing LLaMA by Meta on different machines

This page summarizes the projects mentioned and recommended in the original post on /r/foss

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • wrapyfi

    Python Wrapper for Message-Oriented and Robotics Middleware

  • The authors present an example of combining Wrapyfi (https://github.com/fabawi/wrapyfi), a Python wrapper for message-oriented and robotics middleware, with LLaMA (https://github.com/facebookresearch/llama), a series of large language models from Meta AI. They demonstrate how Wrapyfi can enable running LLaMA on multiple mid-range machines with high inference speed and low cost. They also provide links to their GitHub repository (https://github.com/modular-ml/wrapyfi-examples_llama) and paper (https://arxiv.org/abs/2302.09648) for more details. They state that this example can revolutionize natural language processing tasks such as text generation, summarization, question answering, sentiment analysis, etc. without having to buy new hardware and use their existing infrastructure!

  • llama

    Inference code for Llama models

  • The authors present an example of combining Wrapyfi (https://github.com/fabawi/wrapyfi), a Python wrapper for message-oriented and robotics middleware, with LLaMA (https://github.com/facebookresearch/llama), a series of large language models from Meta AI. They demonstrate how Wrapyfi can enable running LLaMA on multiple mid-range machines with high inference speed and low cost. They also provide links to their GitHub repository (https://github.com/modular-ml/wrapyfi-examples_llama) and paper (https://arxiv.org/abs/2302.09648) for more details. They state that this example can revolutionize natural language processing tasks such as text generation, summarization, question answering, sentiment analysis, etc. without having to buy new hardware and use their existing infrastructure!

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • wrapyfi-examples_llama

    Inference code for facebook LLaMA models with Wrapyfi support

  • The authors present an example of combining Wrapyfi (https://github.com/fabawi/wrapyfi), a Python wrapper for message-oriented and robotics middleware, with LLaMA (https://github.com/facebookresearch/llama), a series of large language models from Meta AI. They demonstrate how Wrapyfi can enable running LLaMA on multiple mid-range machines with high inference speed and low cost. They also provide links to their GitHub repository (https://github.com/modular-ml/wrapyfi-examples_llama) and paper (https://arxiv.org/abs/2302.09648) for more details. They state that this example can revolutionize natural language processing tasks such as text generation, summarization, question answering, sentiment analysis, etc. without having to buy new hardware and use their existing infrastructure!

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Wrapify is a middleware communication wrapper for running the same script on multiple machines. Run the Python script everywhere and choose where each method executes by simply declaring it as a publisher or a listener [currently supports YARP; ROS and ROS2 coming soon]

    1 project | /r/opensource | 21 Jan 2022
  • Wrapify is a middleware communication wrapper for running the same script on multiple machines. Run the Python script everywhere and choose where each method executes by simply declaring it as a publisher or a listener [currently supports YARP; ROS and ROS2 coming soon]

    1 project | /r/coolgithubprojects | 21 Jan 2022
  • Fzf based Pokedex

    7 projects | /r/commandline | 18 Apr 2023
  • Shuffling large data at constant memory in Dask

    1 project | /r/Python | 17 Apr 2023
  • foss browser vs brave/firefox

    3 projects | /r/fossdroid | 2 Apr 2023