Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

data integrity issue in DDD 17 dataset #10

Open
youkaichao opened this issue Jan 14, 2022 · 8 comments
Open

data integrity issue in DDD 17 dataset #10

youkaichao opened this issue Jan 14, 2022 · 8 comments

Comments

@youkaichao
Copy link

Hi, thanks for your valuable efforts in providing such a large dataset. When I try to use the DDD 17 dataset, I encountered several integrity issues:

  1. I download the DDD 17 dataset via resilio sync. It seems run5/rec1487858093.hdf5 is missing. Is it a problem at the server side or the client side? If it is a client-side issue, I can re-download that file.
  2. After exporting data from those hdf5 files using export_ddd20_hdf.py, I found that several recordings have some integrity issues. In some recordings, timestamps of event data are expected to be increasing but they are actually not; In some recordings, timestamps of frame_ts data are expected to be increasing but they are actually not. Will sorting by timestamps solve the problem? Or is it caused by some deeper reason meaning that the entire recording is invalid?

Below are a list of recordings with data integrity issues:

run3/rec1487355090.hdf5
run3/rec1487356509.hdf5
run3/rec1487417411.hdf5
run3/rec1487419513.hdf5
run3/rec1487424147.hdf5
run3/rec1487427200.hdf5
run3/rec1487430438.hdf5
run3/rec1487433587.hdf5
run3/rec1487594667.hdf5
run3/rec1487600962.hdf5
run5/rec1487849663.hdf5
run5/rec1487860613.hdf5
run5/rec1487864316.hdf5

@tobidelbruck
Copy link
Contributor

tobidelbruck commented Jan 14, 2022 via email

@youkaichao
Copy link
Author

The DDD 20 is just too large for me to store, so I just downloaded the DDD 17 dataset.

@tobidelbruck
Copy link
Contributor

tobidelbruck commented Jan 14, 2022 via email

@youkaichao
Copy link
Author

Well, I'm not asking for a few samples from DDD 20. Because of practical issues, I would like to stick with DDD 17. And then I found some integrity issues in DDD 17. I want to know if these integrity issues can be resolved. e.g. will sorting by timestamps solve the problem? Or is it caused by some deeper reason meaning that the entire recording is invalid? If the latter case is true, maybe you can mention it on the DDD 17 homepage or just remove those invalid recordings to avoid unnecessary download of invalid files.

@tobidelbruck
Copy link
Contributor

tobidelbruck commented Jan 15, 2022 via email

@youkaichao
Copy link
Author

Thanks. The DDD 17 dataset is stored in a headless server and so I cannot view it. By original dataset paper, do you mean the ICML workshop paper "DDD17: End-To-End DAVIS Driving Dataset"? I don't see a ICLR paper.

@tobidelbruck
Copy link
Contributor

tobidelbruck commented Jan 15, 2022 via email

@tobidelbruck
Copy link
Contributor

tobidelbruck commented Jan 19, 2022 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants