support no apllication of chat template for instructions #3

Ssukriti · 2024-02-26T06:58:15Z

This change exposes a flag apply_chat_instruction_template to allow users to control if chat template is applied or not by default to prompt/completion data format supported by SFT Trainer.

if flag is set to False, the prompt and completion are just concatenated and masking is applied using length of prompt .

Signed-off-by: Sukriti-Sharma4 <[email protected]>

alex-jw-brooks · 2024-02-26T18:20:57Z

trl/trainer/sft_trainer.py

@@ -250,7 +251,7 @@ def make_inputs_require_grad(module, input, output):
        if formatting_func is None and dataset_text_field is None:
            # check if dataset has ChatML format or instruction format and is supported
            # if not stays #None
-            formatting_func = get_formatting_func_from_dataset(train_dataset, tokenizer)
+            formatting_func = get_formatting_func_from_dataset(train_dataset, tokenizer, apply_chat_instruction_template)

        requires_input_output_keys = False


This variable name should probably be changed - it doesn't make sense if the keys are prompt / completion

ya I changed it below and left that one place, thanks for catching it

Signed-off-by: Sukriti-Sharma4 <[email protected]>

support no apllication of chat template for instructions

b8aec18

Signed-off-by: Sukriti-Sharma4 <[email protected]>

alex-jw-brooks reviewed Feb 26, 2024

View reviewed changes

change variable name

6973b15

Signed-off-by: Sukriti-Sharma4 <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support no apllication of chat template for instructions #3

support no apllication of chat template for instructions #3

Ssukriti commented Feb 26, 2024

alex-jw-brooks Feb 26, 2024

Ssukriti Feb 26, 2024

support no apllication of chat template for instructions #3

Are you sure you want to change the base?

support no apllication of chat template for instructions #3

Conversation

Ssukriti commented Feb 26, 2024

alex-jw-brooks Feb 26, 2024

Choose a reason for hiding this comment

Ssukriti Feb 26, 2024

Choose a reason for hiding this comment