DeepSeek-V3 VS DeepSeek-R1

Compare DeepSeek-V3 vs DeepSeek-R1 and see what are their differences.

Judoscale - Save 47% on cloud hosting with autoscaling that just works
Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.
judoscale.com
featured
CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai
featured
DeepSeek-V3 DeepSeek-R1
14 29
95,724 88,421
6.4% 8.3%
8.3 7.6
12 days ago 12 days ago
Python
MIT License MIT License
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

DeepSeek-V3

Posts with mentions or reviews of DeepSeek-V3. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2025-02-10.
  • DeepSeek V3-0324 vs. Claude 3.7 Sonnet Base: Which AI Codes Better?
    1 project | dev.to | 29 Mar 2025
  • Deepseek API Complete Guide: Mastering the DeepSeek API for Developers
    1 project | dev.to | 19 Mar 2025
    What distinguishes DeepSeek-V3 is its training efficiency—completed using only 2.664M H800 GPU hours on 14.8 trillion tokens, making it remarkably cost-effective for its size. Technical specifications are available on the GitHub page for DeepSeek-V3.
  • Analyzing DeepSeek API Instability: What API Gateways Can and Can't Do
    2 projects | dev.to | 10 Feb 2025
    DeepSeek, known for its high-performance AI models like R1 and V3, has been a game-changer in the AI landscape. However, recent reports have highlighted issues with API instability, affecting developers and users who rely on these services. Understanding the root causes of this instability is essential for addressing and mitigating these issues.
  • DeepSeek not as disruptive as claimed, firm has 50k GPUs and spent $1.6B
    1 project | news.ycombinator.com | 4 Feb 2025
    It is not FOSS. The LLM industry has repurposed "open source" to mean "you can run the model yourself." They've released the model, but it does not meet the 'four freedoms' standard: https://github.com/deepseek-ai/DeepSeek-V3/blob/main/LICENSE...
  • Build your next AI Tech Startup with DeepSeek
    6 projects | dev.to | 3 Feb 2025
    Typically, training parts of an AI model usually meant updating the whole thing, even if some parts didn't contribute anything, which lead to a massive waste of resources. To solve this, they introduced an Auxiliary-Loss-Free (ALS) Load Balancing. The ALS Load Balancing works by introducing a bias factor to prevent overloading one chip, while under-utilizing another (Source). This resulted in only 5% of the model's parameters being trained per-token, and around 91% cheaper cost to train than GPT 4 (GPT 4 costed $63 million to train (Source) and V3 costed $5.576 million to train. (Source))
  • Is DeepSeek’s Influence Overblown?
    1 project | dev.to | 31 Jan 2025
    According to the official paper, DeepSeek took only $5.6 mln to train with impressive results. This is a remarkable achievement for a large language model (LLM). In comparison, OpenAI's CEO Sam Altman admitted that training OpenAI GPT-4 took over $100 mln, not saying how much more. Some AI specialists assume that the estimation of the DeepSeek training expense is underreported. Nevertheless, the hidden gem is not how much it cost to train but how drastically it improved runtime requirements.
  • Maybe you missed this file when looking at DeepSeek?
    1 project | news.ycombinator.com | 30 Jan 2025
  • DeepSeek proves the future of LLMs is open-source
    4 projects | news.ycombinator.com | 29 Jan 2025
    > If the magic values are some kind of microcode or firmware, or something else that is executed in some way, then no, it is not really open source.

    To my understanding, the contents of a .safetensors file is purely numerical weights - used by the model defined in MIT-licensed code[0] and described in a technical report[1]. The weights are arguably only really "executed" to the same extent kernel weights of a gaussian blur filter would be, though there is a large difference in scale and effect.

    [0]: https://github.com/deepseek-ai/DeepSeek-V3/blob/main/inferen...

    [1]: https://arxiv.org/html/2412.19437v1

  • DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via RL
    8 projects | news.ycombinator.com | 25 Jan 2025
  • AI and Startup Moats
    1 project | news.ycombinator.com | 7 Jan 2025
    But the cost is _definitely_ falling. For a recent example, see DeepSeek V3[1]. It's a model that's competitive with GPT-4, Claude Sonnet. But cost ~$6 Million to train.

    This is ridiculously cheaper than what we had before. Inference is basically getting an 10x cheaper per year!

    We're spending more because bigger models are worth the investment. But the "price per unit of [intelligence/quality]" is getting lower and _fast_.

    Saying that models are getting more expensive is confusing the absolute value spent with the value for money.

    - [1] https://github.com/deepseek-ai/DeepSeek-V3/tree/main

DeepSeek-R1

Posts with mentions or reviews of DeepSeek-R1. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2025-02-10.
  • Proximal Policy Optimization (PPO) and Generalized Reinforcement Learning with Proximal Optimizer (GRPO)
    1 project | dev.to | 19 Apr 2025
    Both Proximal Policy Optimization (PPO) and Generalized Reinforcement Learning with Proximal Optimizer (GRPO) are the algorithm of Reinforcement Learning (RL). In this blog, I am going to explain both the algorithms in reference to a chess game. I came across these terms while reading DeepSeek R1's Research Papers.
  • DeepSeek drops recommended R1 deployment settings
    1 project | news.ycombinator.com | 14 Feb 2025
  • Analyzing DeepSeek API Instability: What API Gateways Can and Can't Do
    2 projects | dev.to | 10 Feb 2025
    DeepSeek, known for its high-performance AI models like R1 and V3, has been a game-changer in the AI landscape. However, recent reports have highlighted issues with API instability, affecting developers and users who rely on these services. Understanding the root causes of this instability is essential for addressing and mitigating these issues.
  • LIMO: Less Is More for Reasoning
    5 projects | news.ycombinator.com | 9 Feb 2025
    We kind-of have that in DeepSeek-R1-zero [1], but it has problem. From the original authors:

    > With RL, DeepSeek-R1-Zero naturally emerged with numerous powerful and interesting reasoning behaviors. However, DeepSeek-R1-Zero encounters challenges such as endless repetition, poor readability, and language mixing.

    A lot of these we can probably solve, but as other have pointed out we want a model that humans can converse with, not an AI for the purpose of other AI.

    That said, it seems like a promising area of research:

    > DeepSeek-R1-Zero demonstrates capabilities such as self-verification, reflection, and generating long CoTs, marking a significant milestone for the research community.

    [1] https://github.com/deepseek-ai/DeepSeek-R1

  • Decoding DeepSeek R1's Research Abstract
    1 project | dev.to | 5 Feb 2025
    Thank you for reading the blog. Here is the DeepSeek R1's Research Paper in case you want to check it out.
  • DeepSeek, Efficiency, and Big Tech’s Response
    1 project | dev.to | 3 Feb 2025
    deepseek-ai/DeepSeek-R1
  • Build your next AI Tech Startup with DeepSeek
    6 projects | dev.to | 3 Feb 2025
    Benchmarks of reasoning models. Source
  • Efficient Reasoning with Hidden Thinking
    1 project | news.ycombinator.com | 3 Feb 2025
    For R1-Zero, the RL process focused on accuracy and making sure that thinking was inside the correct tags, but it did not enforce any particular structure on the thinking itself. [1]

    The pure RL approach improved reasoning, but had readability issues that led to using SFT thinking data in the final version of R1:

    > DeepSeek-R1-Zero struggles with challenges like poor readability, and language mixing. To make reasoning processes more readable and share them with the open community, we explore DeepSeek-R1, a method that utilizes RL with human-friendly cold-start data.

    [1] https://github.com/deepseek-ai/DeepSeek-R1/blob/main/DeepSee...

  • DeepSeek R1 Will Change the World: Advanced AI Accessible to Everyone
    3 projects | dev.to | 31 Jan 2025
    If that weren't enough, they've published the code as Open Source. Transparent. Ready to be studied, modified, and improved by anyone (Including competitors).
  • DeepSeek is now available on Microsoft Azure 🌚
    1 project | dev.to | 30 Jan 2025
    Access the Repository: Visit the DeepSeek R1 GitHub repository to explore the model's codebase.

What are some alternatives?

When comparing DeepSeek-V3 and DeepSeek-R1 you can also consider the following projects:

DeepSeek-LLM - DeepSeek LLM: Let there be answers

TinyZero - Clean, minimal, accessible reproduction of DeepSeek R1-Zero

open-r1 - Fully open reproduction of DeepSeek-R1

example-deepseek-r1 - A lightweight Node.js proxy server for interacting with locally-hosted large language models through Ollama. This implementation specifically demonstrates running the DeepSeek-R1 7B model.

sglang - SGLang is a fast serving framework for large language models and vision language models.

Judoscale - Save 47% on cloud hosting with autoscaling that just works
Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.
judoscale.com
featured
CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai
featured