Skip to content

Lora微调Qwen3-8B的权重文件里没有生成adapter_config.json,无法加载训练好的权重 #421

@fppccc

Description

@fppccc

Qwen3-8B所有环境部署均按照教程来的,但是微调后每个checkpoint文件夹下都没有adapter_config.json,该怎么处理呢?
没有adapter_config.json就没法加载微调后的模型

环境如下:
Package Version


accelerate 1.7.0
aiohappyeyeballs 2.6.1
aiohttp 3.12.2
aiosignal 1.3.2
airportsdata 20250523
annotated-types 0.7.0
anyio 4.9.0
astor 0.8.1
attrs 25.3.0
bitsandbytes 0.46.0
blake3 1.0.5
boto3 1.38.23
botocore 1.38.23
cachetools 6.0.0
certifi 2025.4.26
charset-normalizer 3.4.2
click 8.2.1
cloudpickle 3.1.1
compressed-tensors 0.9.3
cupy-cuda12x 13.4.1
datasets 3.6.0
Deprecated 1.2.18
depyf 0.18.0
dill 0.3.8
diskcache 5.6.3
distro 1.9.0
dnspython 2.7.0
einops 0.8.1
email_validator 2.2.0
fastapi 0.115.12
fastapi-cli 0.0.7
fastrlock 0.8.3
filelock 3.18.0
frozenlist 1.6.0
fsspec 2025.3.0
gguf 0.16.3
googleapis-common-protos 1.70.0
grpcio 1.71.0
h11 0.16.0
hf-xet 1.1.2
httpcore 1.0.9
httptools 0.6.4
httpx 0.28.1
huggingface-hub 0.32.1
idna 3.10
importlib_metadata 8.0.0
interegular 0.3.3
Jinja2 3.1.6
jiter 0.10.0
jmespath 1.0.1
jsonschema 4.24.0
jsonschema-specifications 2025.4.1
lark 1.2.2
llguidance 0.7.24
llvmlite 0.44.0
lm-format-enforcer 0.10.11
markdown-it-py 3.0.0
MarkupSafe 3.0.2
mdurl 0.1.2
mistral_common 1.5.6
modelscope 1.26.0
mpmath 1.3.0
msgpack 1.1.0
msgspec 0.19.0
multidict 6.4.4
multiprocess 0.70.16
nest-asyncio 1.6.0
networkx 3.4.2
ninja 1.11.1.4
numba 0.61.2
numpy 2.1.2
nvidia-cublas-cu12 12.4.5.8
nvidia-cuda-cupti-cu12 12.4.127
nvidia-cuda-nvrtc-cu12 12.4.127
nvidia-cuda-runtime-cu12 12.4.127
nvidia-cudnn-cu12 9.1.0.70
nvidia-cufft-cu12 11.2.1.3
nvidia-curand-cu12 10.3.5.147
nvidia-cusolver-cu12 11.6.1.9
nvidia-cusparse-cu12 12.3.1.170
nvidia-cusparselt-cu12 0.6.2
nvidia-ml-py 12.575.51
nvidia-nccl-cu12 2.21.5
nvidia-nvjitlink-cu12 12.4.127
nvidia-nvtx-cu12 12.4.127
openai 1.82.0
opencv-python-headless 4.11.0.86
opentelemetry-api 1.26.0
opentelemetry-exporter-otlp 1.26.0
opentelemetry-exporter-otlp-proto-common 1.26.0
opentelemetry-exporter-otlp-proto-grpc 1.26.0
opentelemetry-exporter-otlp-proto-http 1.26.0
opentelemetry-proto 1.26.0
opentelemetry-sdk 1.26.0
opentelemetry-semantic-conventions 0.47b0
opentelemetry-semantic-conventions-ai 0.4.9
outlines 0.1.11
outlines_core 0.1.26
packaging 25.0
pandas 2.2.3
partial-json-parser 0.2.1.1.post5
peft 0.15.2
pillow 11.0.0
pip 25.1.1
prometheus_client 0.22.0
prometheus-fastapi-instrumentator 7.1.0
propcache 0.3.1
protobuf 4.25.7
psutil 7.0.0
py-cpuinfo 9.0.0
pyarrow 20.0.0
pycountry 24.6.1
pydantic 2.11.5
pydantic_core 2.33.2
Pygments 2.19.1
pynvml 12.0.0
python-dateutil 2.9.0.post0
python-dotenv 1.1.0
python-json-logger 3.3.0
python-multipart 0.0.20
pytz 2025.2
PyYAML 6.0.2
pyzmq 26.4.0
ray 2.46.0
referencing 0.36.2
regex 2024.11.6
requests 2.32.3
rich 14.0.0
rich-toolkit 0.14.6
rpds-py 0.25.1
s3transfer 0.13.0
safetensors 0.5.3
scipy 1.15.3
sentencepiece 0.2.0
setuptools 80.9.0
shellingham 1.5.4
six 1.17.0
sniffio 1.3.1
starlette 0.46.2
swankit 0.1.8
swanlab 0.5.9
sympy 1.13.1
tiktoken 0.9.0
tokenizers 0.21.1
torch 2.6.0
torchaudio 2.6.0
torchvision 0.21.0
tqdm 4.67.1
transformers 4.51.3
triton 3.2.0
typer 0.16.0
typing_extensions 4.13.2
typing-inspection 0.4.1
tzdata 2025.2
urllib3 2.4.0
uvicorn 0.34.2
uvloop 0.21.0
vllm 0.8.5.post1
watchfiles 1.0.5
websockets 15.0.1
wheel 0.45.1
wrapt 1.17.2
xformers 0.0.29.post2
xgrammar 0.1.18
xxhash 3.5.0
yarl 1.20.0
zipp 3.22.0

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions