Fastvlm #495

jonathanmiddleton · 2025-09-04T18:42:35Z

Add FastVLM (https://huggingface.co/papers/2412.13303) support.

FastVlm model
sideband load of Core ML model package to overcome HF cache issue
(removed stale tests)

- Add `test_fastvlm` function to validate new ML model functionality. - Introduce `FastVLMMultiModalProjector` and `FastVLM` model classes to support vision-text multimodal operations. - Enhance `convert.py` by adding CoreML vision tower support and new CLI options (`--only-llm`, `--skip-vision`). - Refactor utilities to handle CoreML vision tower loading and add `force_download` option. - Cleanup unused and redundant test cases in `test_utils.py`. - Add configuration for `llava_qwen2` in prompt and model utilities.

Copilot

Pull Request Overview

This PR adds FastVLM support to the mlx-vlm project, implementing a new vision-language model with unique Core ML vision tower loading capabilities. FastVLM uses a sideband loading mechanism to overcome Hugging Face cache compatibility issues with Core ML model packages.

Add FastVLM model implementation with Core ML vision tower support
Implement sideband loading system for Core ML model packages to work around HF cache limitations
Update utilities and conversion scripts to support new model architecture

Reviewed Changes

Copilot reviewed 10 out of 11 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
mlx_vlm/utils.py	Add Core ML import, model type mapping, and vision tower loading logic
mlx_vlm/tests/test_utils.py	Comment out stale quantization tests
mlx_vlm/tests/test_models.py	Add comprehensive FastVLM model tests
mlx_vlm/prompt_utils.py	Add message format support for llava_qwen2 model type
mlx_vlm/models/fastvlm/	New FastVLM model implementation with language model and Core ML integration
mlx_vlm/hf_tools/mlpackage_cache.py	New Core ML package caching and resolution utilities
mlx_vlm/convert.py	Add conversion support with Core ML file copying and new CLI options

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

mlx_vlm/utils.py

mlx_vlm/models/fastvlm/fastvlm.py

Co-authored-by: Copilot <[email protected]>

jonathanmiddleton added 3 commits September 4, 2025 13:55

- Add support for FastVlm

805cab2

- Add support for FastVlm

e72c814

Copilot AI review requested due to automatic review settings September 4, 2025 18:42

Copilot AI reviewed Sep 4, 2025

View reviewed changes

mlx_vlm/utils.py Show resolved Hide resolved

mlx_vlm/utils.py Show resolved Hide resolved

mlx_vlm/models/fastvlm/fastvlm.py Outdated Show resolved Hide resolved

mlx_vlm/models/fastvlm/fastvlm.py Outdated Show resolved Hide resolved

jonathanmiddleton and others added 2 commits September 4, 2025 16:01

Update mlx_vlm/models/fastvlm/fastvlm.py

819b9ad

Co-authored-by: Copilot <[email protected]>

- remove old method

52c72e9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fastvlm #495

Fastvlm #495

Uh oh!

jonathanmiddleton commented Sep 4, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Fastvlm #495

Are you sure you want to change the base?

Fastvlm #495

Uh oh!

Conversation

jonathanmiddleton commented Sep 4, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!