Fix: Make sft script work when chat template is None #3995

rabinadk1 · 2025-09-02T11:53:41Z

What does this PR do?

This pull request makes improvements to the trl/scripts/sft.py script, primarily focusing on import optimization, correcting lazy importing, and minor API adjustments for clarity and correctness. The most significant changes are grouped below:

Import optimization and lazy loading:

Removed the eager import of AutoModelForCausalLM at the top of the file and moved it inside the relevant code path in the main function to load it lazily.

API and function signature improvements:

Added _added_tokens as the third argument because clone_chat_template returns three values, not two.
Refactored the make_parser function to make the type hint for the subparsers accurate and simplified the logic for returning the correct parser instance.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a GitHub issue? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

kashif · 2025-09-02T13:32:02Z

trl/scripts/sft.py

@@ -123,7 +126,7 @@ def main(script_args, training_args, model_args, dataset_args):
    # Set default chat template if needed
    if tokenizer.chat_template is None:
        # TODO: source should be passed as an argument
-        model, tokenizer = clone_chat_template(model, tokenizer, "Qwen/Qwen3-0.6B")
+        model, tokenizer, _added_tokens = clone_chat_template(model, tokenizer, "Qwen/Qwen3-0.6B")


Suggested change

model, tokenizer, _added_tokens = clone_chat_template(model, tokenizer, "Qwen/Qwen3-0.6B")

model, tokenizer, _ = clone_chat_template(model, tokenizer, "Qwen/Qwen3-0.6B")

I wanted to do this as well, but then I made it more explicit to the users.

as you like... ruff etc. might complain... i think this and the Optional should be the only change in this PR

agree with @kashif

qgallouedec · 2025-09-02T18:40:17Z

trl/scripts/sft.py


 from accelerate import logging
 from datasets import load_dataset
-from transformers import AutoConfig, AutoModelForCausalLM, AutoTokenizer


I think you can revert this change

qgallouedec · 2025-09-02T18:41:26Z

trl/scripts/sft.py

-    if subparsers is not None:
-        parser = subparsers.add_parser("sft", help="Run the SFT training script", dataclass_types=dataclass_types)
-    else:
-        parser = TrlParser(dataclass_types)
-    return parser


IMO, using a consistent indent level for the return is a better practice

rabinadk1 added 2 commits September 2, 2025 17:33

Fix: Make sft script work when chat template is None

c4ece6e

Use python 3.9 syntax for Union types

0f7ab22

kashif reviewed Sep 2, 2025

View reviewed changes

qgallouedec reviewed Sep 2, 2025

View reviewed changes

qgallouedec mentioned this pull request Sep 8, 2025

💬 Remove setting chat template in sft script #4037

Merged

qgallouedec closed this in #4037 Sep 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix: Make sft script work when chat template is None #3995

Fix: Make sft script work when chat template is None #3995

Uh oh!

rabinadk1 commented Sep 2, 2025

Uh oh!

kashif Sep 2, 2025

Uh oh!

rabinadk1 Sep 2, 2025

Uh oh!

kashif Sep 2, 2025

Uh oh!

qgallouedec Sep 2, 2025

Uh oh!

qgallouedec Sep 2, 2025

Uh oh!

qgallouedec Sep 2, 2025

Uh oh!

Uh oh!

	model, tokenizer, _added_tokens = clone_chat_template(model, tokenizer, "Qwen/Qwen3-0.6B")
	model, tokenizer, _ = clone_chat_template(model, tokenizer, "Qwen/Qwen3-0.6B")

Fix: Make sft script work when chat template is None #3995

Fix: Make sft script work when chat template is None #3995

Uh oh!

Conversation

rabinadk1 commented Sep 2, 2025

What does this PR do?

Before submitting

Who can review?

Uh oh!

kashif Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

rabinadk1 Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

kashif Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

qgallouedec Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

qgallouedec Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

qgallouedec Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!