Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about pretraining in stereo task #37

Open
wmn931201 opened this issue Mar 30, 2021 · 5 comments
Open

Question about pretraining in stereo task #37

wmn931201 opened this issue Mar 30, 2021 · 5 comments

Comments

@wmn931201
Copy link

Thanks for your wonderful work!
One thing that bothers me is why you don’t use sceneflow dataset during pre-training in stereo task. The sceneflow dataset has more data than flythings3D subset, and there are monka and driving subset in sceneflow dataset. In theory, using sceneflow dataset has better generalization performance.

Thanks~

@yzcjtr
Copy link
Collaborator

yzcjtr commented Mar 30, 2021

Hi, a pretty good question! I agree that pretraining on the SceneFlow dataset can be rewarding. When we did this work, we found many contemporary work in stereo used different training data actually (though such information can not always be found in the paper but in the implementation), which makes the comparison not quite fair. Therefore, we ended up using the minimal data and already found the numbers look good. So we didn't bother to use additional data. As a trade-off, the provided pretrained model might not be quite generalizable. If you have any better model, feel free to contribute :)

@wmn931201
Copy link
Author

Hi, I've tried to pretrain on sceneflow dataset,but the result is very strange, the output result is all zero. In the previous issue, someone have asked the similar questions, but he or she only used the flythings3d dataset in sceneflow. and when reading images, he or she use Image.open(filename).convert(RGB), and I do the same with reading images, because some images in sceneflow is RGBA format which has 4 channels. His or her final conclusion is that there is somthing wrong in dataloader, I haven't found the reason yet. and I am exploring in code.

Thank you very much!

@yzcjtr
Copy link
Collaborator

yzcjtr commented Apr 1, 2021

Cool. Can you reference the issue here? Not sure which one exactly you are referring to.

As for the data loader, I don't find any problems when I'm using the datasets I mentioned in the readme file. I think you do need to watch out for it when you handle a new dataset such as SceneFlow. Maybe you could preprocess all the images to get rid of those RGBA format? I would suggest running a few examples and make sure the input images and labels given by the data loader all look good. Tensorboard would be also helpful for checking this.

@wmn931201
Copy link
Author

The issue is #24. Yes, I am checking dataset now. and RGBA images are the majority of the Sceneflow dataset, so they can't be ignored. Thanks!

@wmn931201
Copy link
Author

Hi,I find the value of annotations is negative in flythings3D_subset, but the value of annotations is positive in Sceneflow dataset. In your code, when reading labels, disp = np.expand_dims(-read_pfm_file(file_name), axis=-1) is used to transform disp to be positive . So I guess that I should use disp = np.expand_dims(read_pfm_file(file_name), axis=-1), ignoring the “-” operation while training sceneflow dataset. Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants