Discussion: Biggest Roadblocks to Deploy LLMs to Production

Our great sponsors

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

SaaSHub - Software Alternatives and Reviews

Our great sponsors

zep

15 1,978 9.0 Go

Zep: Long-Term Memory for ‍AI Assistants.

You may want to take a look at Zep, which supports stateless agents by storing messages out of process (in a memory store). It does a bunch of other things, too, such as summarizing, entity extraction, vector search over historical memory. Disclosure: I’m a coauthor. https://github.com/getzep/zep

local_llama

10 179 6.6 Python

This repo is to showcase how you can run a model locally and offline, free of OpenAI dependencies.

I work with AWS daily, terraform, Python and java creating and maintaining enterprise solutions. I have played with sagemaker but it is so expensive I hate to leave it up for longer than a day. I downloaded and created a chat with your docs (entirely in airplane mode) here point being that I’ve hosted models both locally and in the cloud. But just ended up sticking to API calls as it’s so cheap

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project