Skip to content

Conversation

yuanxion
Copy link
Contributor

@yuanxion yuanxion commented Sep 12, 2025

Details

Description of the issue

Symptom

manual_yolo11 model performance dropped from 362.4 FPS to 318.25 FPS on GPU.

Root cause

  • previous PR will force all crop primitives followed by onednn concatenation to clean its GPU memory by filling with zeros if it is blocked format.
  • manual_yolo11 model also has many such crop primitives, so its performance will drop.

How to fix it

  • Found that filling GPU memory with zeros can be skipped if the crop primitive uses eltwise_blocked_opt kernel and is not dynamic, so just skip it by checking crop primitive's kernel name.

The code and line that caused this issue

if (get_node().is_type<crop>() && get_node().can_share_buffer() && _impl_params->get_output_layout(0).format.is_blocked() &&

Reproduction step and snapshot

  • benchmark_app
    benchmark_app -inference_only false -b 1 -t 60 -nireq 4 -d GPU.0 -hint none -nstreams 2 -m INT8/1/ov/optimized/manual_yolo11.xml

Problematic graph

  • crop primitive (eltwise_blocked_opt kernel) followed by onednn concatenation in manual_yolo11
image

Checklist

  • Is it a proper fix? (not a workaround)
  • Did you include test case for this fix, if necessary? No need
  • Did you review existing test that can be extended to cover this scenario? Which test did you review?

Tickets:

@yuanxion yuanxion requested review from a team as code owners September 12, 2025 07:01
@github-actions github-actions bot added the category: GPU OpenVINO GPU plugin label Sep 12, 2025
@yuanxion
Copy link
Contributor Author

Also checked that the change in this PR will not cause accuracy drop for hbonat/nanodet models in PR #31140.

@p-durandin p-durandin added this to the 2025.4 milestone Sep 12, 2025
@p-durandin p-durandin added this pull request to the merge queue Sep 16, 2025
Merged via the queue into openvinotoolkit:master with commit 67095bf Sep 16, 2025
186 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
category: GPU OpenVINO GPU plugin
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants