Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Granule search based on its filename #18

Open
slesaad opened this issue Jan 11, 2021 · 7 comments
Open

Granule search based on its filename #18

slesaad opened this issue Jan 11, 2021 · 7 comments

Comments

@slesaad
Copy link
Contributor

slesaad commented Jan 11, 2021

It would be useful for scientists to be able to search for a granule based on its filename. Currently, there is no way to do it.

@jceaser
Copy link
Contributor

jceaser commented Jan 13, 2021

Are you talking about the GranuleMetaDataFile field?

@clynnes
Copy link

clynnes commented Jan 13, 2021

To clarify: "As a scientist, I can paste the filename of a data file I am looking at into some TBD tool, and search CMR for the granule metadata record corresponding to that data file." (Assuming that I have not changed the filename from what it was when I originally downloaded it, natch.)
Currently, this is possible only if you also supply some other qualifier, like the provider, or the collection conceptId. Which the scientist does not always have readily available.

@jceaser
Copy link
Contributor

jceaser commented Jan 13, 2021

that sounds like the echo onlineaccessurl AKA the related URL field

@clynnes
Copy link

clynnes commented Jan 13, 2021

Mmmm...I don't think so. We have only the filename, not the front part of the URL. Slesa and I were planning to loop through the providers until we found the filename in the readable_granule_id. (Seemed more efficient than looping through all 7500 collections until we found it.)

@clynnes
Copy link

clynnes commented Jan 13, 2021

(I do have a wild and crazy idea about inferring filename regex patterns, adding them as tags to the collection, and then searching through the collections until we find the regex matching the filename, then querying that collection.)

@jceaser
Copy link
Contributor

jceaser commented Jan 14, 2021

I don't see any granule field which accepts wild cards, I do find ranges. Partial field search would require some discussion I think.

@clynnes
Copy link

clynnes commented Jan 14, 2021

No, that's not where the regex comes in. The regex is used to figure out which collection a data file came from. We can then search for the granule by its exact filename within a single collection.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants