-
Notifications
You must be signed in to change notification settings - Fork 205
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Initial Weights Initialization #92
Comments
@Nadavc220 sorry for the confusion. In this repo, the pre-trained weight for the VGG backbone is on ImageNet, while the one for the ResNet is on ImageNet + COCO (obtained from DeepLab). In practice, we found that using ImageNet pre-trained weight converges slower but will eventually achieve a similar result as ImageNet + COCO. Similarly, if using a GTA5 pre-trained weight, it should also achieve a similar result with faster convergency. |
Thanks for the quick response. Thanks |
@Nadavc220 This is a good point! Internally, the learning behavior would be quite different as we need to consider their domain gaps. For example, initializing from GTA5 could provide a more stable training procedure, as GTA5 is already a driving-scene dataset. However, since GTA5 also has a large domain gap to Cityscapes, pre-training on GTA5 for too many iterations would not be also a good practice (fitting too much in the GTA5 data distribution). This is something we already empirically tried, but of course it is still an open research problem to explore. To make it simpler, we just use the ImageNet (+COCO) as the pre-trained weight. |
When training the model you are initialzing the model weights with the weights found here:
'http://vllab.ucmerced.edu/ytsai/CVPR18/DeepLab_resnet_pretrained_init-f81d91e8.pth'
The paper states that the initial baseline is a Deeplab pre-trained on ImageNet dataset. Are these weights the ImageNet pre trained or are you using a GTA5 pretrained network to initialize the model?
Thanks.
The text was updated successfully, but these errors were encountered: