Add unit tests for various edge cases #97

alex-jw-brooks · 2024-03-21T14:31:22Z

This PR adds some extra unit tests for various edge cases. It also fixes (one of) the edge cases that has validation which is currently unreachable. It also changes a few stray sys.exit() calls to just raise errors, since I don't think there's a special reason to have those there at the moment.

NOTE: Some of these tests are testing errors that are thrown by HF libraries for things we don't explicitly validate ourselves to ensure the behavior is stable with version bumps.

Related issue number

This is a follow-up on: #74

We made the decision to split the PR into two parts, to get the tests covering the base functionality it earlier.

How to verify the PR

Was the PR tested

I have added >=1 unit test(s) for every new method I have added.
I have ensured all unit tests pass

alex-jw-brooks · 2024-03-21T15:29:22Z

tests/test_sft_trainer.py

+
+### Tests for model dtype edge cases
+@pytest.mark.skipif(
+    not (torch.cuda.is_available() and torch.cuda.is_bf16_supported()),


cuda check is here needed because the bf16 check throws if no Nvidia drivers are available

anhuong

Great test cases, thanks Alex! Left a few questions...

anhuong · 2024-03-26T16:29:42Z

tests/test_sft_trainer.py

+def test_data_path_does_not_exist():
+    """Ensure that we get a FileNotFoundError if the data is missing completely."""
+    TRAIN_KWARGS = {
+        **BASE_PEFT_KWARGS,
+        **{"training_data_path": "/foo/bar/foobar", "output_dir": "foo/bar/baz"},
+    }
+    model_args, data_args, training_args, tune_config = causal_lm_train_kwargs(
+        TRAIN_KWARGS
+    )
+    with pytest.raises(FileNotFoundError):
+        sft_trainer.train(model_args, data_args, training_args, tune_config)


this test already exists -- https://github.com/foundation-model-stack/fms-hf-tuning/blob/main/tests/test_sft_trainer.py

nice catch, removed, thanks!

anhuong · 2024-03-26T16:34:38Z

tuning/sft_trainer.py

+            # TODO: Fix this, currently unreachable due to crashing in batch encoding tokenization
+            # We should do this validation up front, then do the encoding, then handle the collator
+            raise ValueError("Response template is None, needs to be set for training")


are you saying it fails before hitting this ValueError perhaps on line 158 with

response_template_ids = tokenizer.encode( data_args.response_template, add_special_tokens=False )[2:]

in which should this validation be moved up?

Yeah, you can't encode a None type with a tokenizer since tokenizers generally expect an input of type Union[TextInputSequence, Tuple[InputSequence, InputSequence]]. It would be best to do that in a separate PR to keep things atomic even though it's a simple change, since some of the validation logic is a little bit delicate

anhuong · 2024-03-26T16:35:26Z

tests/test_sft_trainer.py

+    reason="Only runs if bf16 is unsupported",
+)
+def test_bf16_still_tunes_if_unsupported():
+    """Ensure that even if bf16 is not supported, tuning still works without problems."""


interesting test case! can you explain why it doesn't fail and tuning still works and why this is the preferred expected behavior?

As far as I understand, in devices where bfloat16 is unsupported, there is usually fallback behavior to a supported data type, which is usually float32 since bfloat16 and float32 have the same exponent size!

interesting! appreciate knowing the details

Signed-off-by: Alex-Brooks <[email protected]>

anhuong

Other than the linter error for a line being too long, the tests look good! Thanks Alex

tests/test_sft_trainer.py:422:0: C0301: Line too long (105/100) (line-too-long)

Signed-off-by: Alex-Brooks <[email protected]>

anhuong

Thanks Alex!

* Add unit tests for various edge cases Signed-off-by: Alex-Brooks <[email protected]> * Fix bf16 check in skipped test Signed-off-by: Alex-Brooks <[email protected]> * Remove redundant test Signed-off-by: Alex-Brooks <[email protected]> * Fix linting Signed-off-by: Alex-Brooks <[email protected]> --------- Signed-off-by: Alex-Brooks <[email protected]> Signed-off-by: aaron.chew1 <[email protected]>

alex-jw-brooks requested review from anhuong and Ssukriti as code owners March 21, 2024 14:31

alex-jw-brooks commented Mar 21, 2024

View reviewed changes

anhuong reviewed Mar 26, 2024

View reviewed changes

alex-jw-brooks added 3 commits April 18, 2024 12:23

Add unit tests for various edge cases

445a0fa

Signed-off-by: Alex-Brooks <[email protected]>

Fix bf16 check in skipped test

6a02e1b

Signed-off-by: Alex-Brooks <[email protected]>

Remove redundant test

5b89073

Signed-off-by: Alex-Brooks <[email protected]>

alex-jw-brooks force-pushed the edge_case_tests_cp branch from efa446f to 5b89073 Compare April 18, 2024 17:25

anhuong previously approved these changes Apr 18, 2024

View reviewed changes

Fix linting

b0f7450

Signed-off-by: Alex-Brooks <[email protected]>

alex-jw-brooks dismissed anhuong’s stale review via b0f7450 April 24, 2024 16:28

anhuong approved these changes Apr 24, 2024

View reviewed changes

anhuong merged commit 8548a6d into foundation-model-stack:main Apr 24, 2024
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add unit tests for various edge cases #97

Add unit tests for various edge cases #97

alex-jw-brooks commented Mar 21, 2024 •

edited

Loading

alex-jw-brooks Mar 21, 2024

anhuong left a comment

anhuong Mar 26, 2024

alex-jw-brooks Apr 18, 2024

anhuong Mar 26, 2024

alex-jw-brooks Apr 18, 2024

anhuong Mar 26, 2024

alex-jw-brooks Apr 8, 2024

anhuong Apr 18, 2024

anhuong left a comment

anhuong left a comment

Add unit tests for various edge cases #97

Add unit tests for various edge cases #97

Conversation

alex-jw-brooks commented Mar 21, 2024 • edited Loading

Related issue number

How to verify the PR

Was the PR tested

Choose a reason for hiding this comment

anhuong left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

anhuong left a comment

Choose a reason for hiding this comment

anhuong left a comment

Choose a reason for hiding this comment

alex-jw-brooks commented Mar 21, 2024 •

edited

Loading