fix the bug where convert bn to sync_bn in the model for evaluation during the training phase will change the training model graph #300

FlyingQianMM · 2023-04-04T06:59:56Z

The origin implementation converts bn to sync_bn in the model for evaluation during the multi-gpu training phase, which is not necessarily and changes the training model graph, causes the gradient of multi-gpu cannot be gathered during later training backward:

so we remove the convertion.

…uring the training phase will change the training model graph

CLAassistant · 2024-09-30T03:22:58Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

fix the bug where convert bn to sync_bn in the model for evaluation d…

c0d4069

…uring the training phase will change the training model graph

nepeplwu self-assigned this Feb 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix the bug where convert bn to sync_bn in the model for evaluation during the training phase will change the training model graph #300

fix the bug where convert bn to sync_bn in the model for evaluation during the training phase will change the training model graph #300

Uh oh!

FlyingQianMM commented Apr 4, 2023

Uh oh!

CLAassistant commented Sep 30, 2024

Uh oh!

Uh oh!

fix the bug where convert bn to sync_bn in the model for evaluation during the training phase will change the training model graph #300

Are you sure you want to change the base?

fix the bug where convert bn to sync_bn in the model for evaluation during the training phase will change the training model graph #300

Uh oh!

Conversation

FlyingQianMM commented Apr 4, 2023

Uh oh!

CLAassistant commented Sep 30, 2024

Uh oh!

Uh oh!