Fine-tuning LLMs locally: A step-by-step guide

This page summarizes the projects mentioned and recommended in the original post on dev.to

Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.
Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
getstream.io
featured
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com
featured
  1. examples

    TensorFlow examples (by tensorflow)

    Installation of PyTorch or TensorFlow

  2. Stream

    Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.

    Stream logo
  3. CoreNLP

    CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.

    A suitable LLM library such as Hugging Face's Transformers or Stanford's CoreNLP

  4. Pytorch

    Tensors and Dynamic neural networks in Python with strong GPU acceleration

    Next, we define a training loop that uses our prepared data and optimizes the weights of the model. Here's an example using PyTorch:

  5. transformers

    🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

    The first step is to prepare your data for fine-tuning. This usually involves tokenizing your text and converting it into a format that the LLM can understand. Here's an example using the Transformers library:

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • First open source text to video 1.7 billion parameter diffusion model is out

    8 projects | /r/StableDiffusion | 19 Mar 2023
  • KerasHub Welcomes Qwen3

    1 project | news.ycombinator.com | 30 May 2025
  • None of the top 10 projects in GitHub is actually a software project 🤯

    6 projects | dev.to | 10 May 2025
  • Top Programming Languages for AI Development in 2025

    9 projects | dev.to | 29 Apr 2025
  • Show HN: Txeo – A Modern C++ Wrapper for TensorFlow

    3 projects | news.ycombinator.com | 21 Feb 2025