FEAT: [Model] Support DeepSeek-V3.1 Quantization and tool #4022

Jun-Howie · 2025-08-29T06:51:40Z

# ❗there are glitches with vllm 0.10.1.1, still looking for resolutions❗
# ❗downgrade vllm for now ❗
pip install vllm==0.9.2 transformers==4.53.0

SITE_PACKAGES=$(pip -V | awk '{print $4}' | sed 's/\/pip$//')
# ❗patch up AWQ MoE quant config, otherwise some modules cannot be properly loaded❗
cp awq_marlin.py "$SITE_PACKAGES/vllm/model_executor/layers/quantization/awq_marlin.py"
# ❗patch up for fp32 e_score_correction_bias, see https://www.github.com/vllm-project/vllm/pull/23640❗
cp deepseek_v2.py "$SITE_PACKAGES/vllm/model_executor/models/deepseek_v2.py"

qinxuye

LGTM

FEAT :Support DeepSeek-V3.1 Quantization and tool

13f5b7e

XprobeBot added this to the v1.x milestone Aug 29, 2025

JunHowie added 2 commits August 29, 2025 14:53

FEAT :Support DeepSeek-V3.1 Quantization and tool

134147c

Fix:activated_size_in_billions for the DeepSeek V3 series

92ed332

qinxuye approved these changes Aug 29, 2025

View reviewed changes

qinxuye changed the title ~~FEAT :Support DeepSeek-V3.1 Quantization and tool~~ FEAT: [Model] Support DeepSeek-V3.1 Quantization and tool Aug 29, 2025

XprobeBot added the feature label Aug 29, 2025

qinxuye merged commit 954c544 into xorbitsai:main Aug 29, 2025
4 of 13 checks passed

Jun-Howie deleted the DeepSeek-V3.1 branch August 30, 2025 01:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

FEAT: [Model] Support DeepSeek-V3.1 Quantization and tool #4022

FEAT: [Model] Support DeepSeek-V3.1 Quantization and tool #4022

Uh oh!

Jun-Howie commented Aug 29, 2025

Uh oh!

qinxuye left a comment

Uh oh!

Uh oh!

Uh oh!

FEAT: [Model] Support DeepSeek-V3.1 Quantization and tool #4022

FEAT: [Model] Support DeepSeek-V3.1 Quantization and tool #4022

Uh oh!

Conversation

Jun-Howie commented Aug 29, 2025

Uh oh!

qinxuye left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!