fix rope deltas in training #423

Goekdeniz-Guelmez · 2025-07-10T13:18:54Z

No description provided.

Goekdeniz-Guelmez · 2025-07-10T13:19:50Z

addreses #404

Goekdeniz-Guelmez · 2025-07-10T13:20:42Z

this is not tested, I will do so when I'm home.

Goekdeniz-Guelmez · 2025-07-10T13:39:06Z

also #409

Blaizzy · 2025-07-22T00:38:50Z

this is not tested, I will do so when I'm home.

Is it ready?

Blaizzy · 2025-09-03T09:10:05Z

ping @Goekdeniz-Guelmez

Goekdeniz-Guelmez · 2025-09-03T09:37:59Z

shoudl be working, can you try it out too @Blaizzy ?

Goekdeniz-Guelmez · 2025-09-03T10:05:55Z

python -m mlx_vlm.lora
--model-path mlx-community/Qwen2-VL-2B-Instruct-bf16
--dataset TIGER-Lab/VisualWebInstruct-Seed --dataset-config 'reference'
--output-path /Volumes/T7_Shield/mlx-vlm
--batch-size 1
--steps 20
--learning-rate 1e-4
INFO:main:Loading model from mlx-community/Qwen2-VL-2B-Instruct-bf16
Fetching 11 files: 100%|█████████████████████████████████████████████| 11/11 [00:00<00:00, 13768.23it/s]
The image processor of type Qwen2VLImageProcessor is now loaded as a fast processor by default, even if the model checkpoint was saved with a slow processor. This is a breaking change and may produce slightly different outputs. To continue using the slow processor, instantiate this class with use_fast=False. Note that this behavior will be extended to all models in a future release.
Fetching 11 files: 100%|█████████████████████████████████████████████| 11/11 [00:00<00:00, 74295.24it/s]
INFO:main:Loading dataset from TIGER-Lab/VisualWebInstruct-Seed
INFO:main:Applying chat template to the dataset
INFO:main:Setting up LoRA
#trainable params: 11.54048 M || all params: 1543.714304 M || trainable%: 0.748%
INFO:main:Setting up optimizer
INFO:main:Setting up trainer
INFO:main:Training model
{'Epoch': 0, 'Step': 0, 'Loss': '1.2262'}
{'Epoch': 0, 'Step': 10, 'Loss': '1.7623'}
100%|████████████████████████████████████| 20/20 [00:51<00:00, 2.55s/it, Epoch=0, Step=19, Loss=1.9172]

adding sanitizing on the params after optimizer

b8d106f

Goekdeniz-Guelmez and others added 3 commits July 10, 2025 21:00

Merge branch 'main' into fix-rope-deltas-in-training

9b7161e

fix image token

fd83282

Merge branch 'main' into fix-rope-deltas-in-training

dc771c2

Merge branch 'main' into fix-rope-deltas-in-training

07a7622

Goekdeniz-Guelmez mentioned this pull request Sep 3, 2025

Multiple erros with lora.py and Qwen 2.5 VL #409

Open

fix

7a1c9a9

Merge branch 'Blaizzy:main' into fix-rope-deltas-in-training

55ac081

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

fix rope deltas in training #423

fix rope deltas in training #423

Uh oh!

Goekdeniz-Guelmez commented Jul 10, 2025

Uh oh!

Goekdeniz-Guelmez commented Jul 10, 2025

Uh oh!

Goekdeniz-Guelmez commented Jul 10, 2025

Uh oh!

Goekdeniz-Guelmez commented Jul 10, 2025

Uh oh!

Blaizzy commented Jul 22, 2025

Uh oh!

Blaizzy commented Sep 3, 2025

Uh oh!

Goekdeniz-Guelmez commented Sep 3, 2025

Uh oh!

Goekdeniz-Guelmez commented Sep 3, 2025

Uh oh!

Uh oh!

Uh oh!

fix rope deltas in training #423

Are you sure you want to change the base?

fix rope deltas in training #423

Uh oh!

Conversation

Goekdeniz-Guelmez commented Jul 10, 2025

Uh oh!

Goekdeniz-Guelmez commented Jul 10, 2025

Uh oh!

Goekdeniz-Guelmez commented Jul 10, 2025

Uh oh!

Goekdeniz-Guelmez commented Jul 10, 2025

Uh oh!

Blaizzy commented Jul 22, 2025

Uh oh!

Blaizzy commented Sep 3, 2025

Uh oh!

Goekdeniz-Guelmez commented Sep 3, 2025

Uh oh!

Goekdeniz-Guelmez commented Sep 3, 2025

Uh oh!

Uh oh!