finetuning problem with evfsam2 #26

vvvvvjdy · 2024-09-23T14:19:30Z

Nice work of extending sams ability to text guidied!
We have used your evfsam1 as our baseline to a new area and showd a signifcant performance. However, when we finetuned your evfsam2, it was easy to overfiiting(didnt see in evfsam1).
Did you met the same prob when your fintune sam2 ? or some hyperpramters different from sam1?
Hope to recieve your suggestion!

CoderZhangYx · 2024-09-23T16:05:45Z

The main differences between sam1 and sam2 lie in:

pre-process: sam2 uses resize(1024), while sam1 uses resizelongest(1024) + padding.
sam2 uses hierachical image encoder and sam1 uses ViT
sam2 applies skip-connection to mask decoder.

Might these differences affect your training?

yi-ming-qian · 2024-11-30T00:08:38Z

@vvvvvjdy can you share your fine-tuning script? many thanks.

vvvvvjdy · 2024-12-04T17:14:48Z

Sorry for the late reply. During my experiment，I found that the data augmentation caused the overfitting probelm. Once I used the same augmentation as used in sam2 pretrain (only random hflip for image) ,the problem was solved. I assume that stronger data augmentation for sam2 than pre-training may cause the model to learn some unnatural shortcut features. The fine-tuneing scripe is simple，just like many sam1 fine-tuneing works. Only difference is the backbone. (I just train my model on images) Regards. yi-ming-qian ***@***.***> 于 2024年11月30日周六上午8:09写道：

…

@vvvvvjdy <https://github.com/vvvvvjdy> can you share your fine-tuning script? many thanks. — Reply to this email directly, view it on GitHub <#26 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/BACXUVSRN6EEIJO7MEBG35L2DD6ZZAVCNFSM6AAAAABOWGI2AWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDKMBYG42DKNJXGY> . You are receiving this because you were mentioned.Message ID: ***@***.***>

CoderZhangYx · 2024-12-05T02:12:52Z

The augmentations influence the model performance in another way. In referring segmentation tasks, text prompts contain geometric words like "on the left". Once flipping or cropping or some other augmentations are applied, the prompts would fail. So only non-geometric augmentations are recommended.

vvvvvjdy · 2024-12-05T16:34:25Z

@CoderZhangYx Quite agree with this statement. But even without such prompts in my finetuning data, the stronger data augmentation than the pretraining may cause this problem (some works have demonstrate the small-size model slike resnet-18 are especially sensitive to aug)，I m shocked that such a large foundation model sam2 has such characteristics.

CoderZhangYx · 2024-12-06T07:07:44Z

That's amazing. What aug did you use? Could it be the reason that the aug didn't applied to the source fed to multi_model_extractor? Curious about this bug, honestly.

vvvvvjdy · 2024-12-06T16:30:47Z

@CoderZhangYx I originally use large scale jittering(strong) for the input image(both beit and sam1 or 2) and gtmask, and found it works well on evfsam1 but not on evfsam2. Did you use the same aug for evfsam1 and 2, and which aug did you use ? ( it not mentioned in the paper)?

CoderZhangYx · 2024-12-09T08:05:05Z

In fact we use no aug when training our model. It is so strange that scale jittering affect performance of sam2. Inform me if you find out any other reasons, thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

finetuning problem with evfsam2 #26

finetuning problem with evfsam2 #26

vvvvvjdy commented Sep 23, 2024

CoderZhangYx commented Sep 23, 2024 •

edited

Loading

yi-ming-qian commented Nov 30, 2024

vvvvvjdy commented Dec 4, 2024 via email

CoderZhangYx commented Dec 5, 2024

vvvvvjdy commented Dec 5, 2024

CoderZhangYx commented Dec 6, 2024

vvvvvjdy commented Dec 6, 2024

CoderZhangYx commented Dec 9, 2024

finetuning problem with evfsam2 #26

finetuning problem with evfsam2 #26

Comments

vvvvvjdy commented Sep 23, 2024

CoderZhangYx commented Sep 23, 2024 • edited Loading

yi-ming-qian commented Nov 30, 2024

vvvvvjdy commented Dec 4, 2024 via email

CoderZhangYx commented Dec 5, 2024

vvvvvjdy commented Dec 5, 2024

CoderZhangYx commented Dec 6, 2024

vvvvvjdy commented Dec 6, 2024

CoderZhangYx commented Dec 9, 2024

CoderZhangYx commented Sep 23, 2024 •

edited

Loading