Metadata file fetching via the API #2741

dannon · 2016-08-05T13:47:23Z

This will address #2560

nsoranzo · 2016-08-05T13:56:07Z

I think this is also needed for galaxyproject/bioblend#192 .

dannon · 2016-08-05T14:23:04Z

@nsoranzo Yep, it is. His email to the mailing list is what made me look into this.

dannon · 2016-08-05T17:15:22Z

Happy to take suggestions for renaming the endpoint, this is just a first stab.

carlfeberhard · 2016-08-05T17:29:39Z

Well, since urls are forever (or something): It'd be best to just call it metadata_files. The get is already there.

Asides (+/-0):

you probably want to use web._future_expose... so you can just let the exception bubble up. The decorator will convert the exception to JSON and you won't have to return a string.
we've also used that valid chars thing in quite a few places now. /shrugs Might be out-of-scope.

dannon · 2016-08-05T17:30:37Z

@carlfeberhard Good catch on valid chars -- I saw the same when I was working on this and meant to go back and refactor it. Will do.

…erhard

…erhard.

jmchilton · 2016-08-05T18:52:18Z

tools/genomespace/genomespace_importer.py

@@ -192,7 +191,7 @@ def download_from_genomespace_importer( username, token, json_parameter_file, ge
        # if using tmp file, move the file to the new file path dir to get scooped up later
        if using_temp_file:
            original_filename = filename
-            filename = ''.join( c in VALID_CHARS and c or '-' for c in filename )
+            filename = ''.join( c in FILENAME_VALID_CHARS and c or '-' for c in filename )


This removes space as a valid character right? Is that intentional?

Hrmm. The valid chars set the genomespace tool uses is indeed different. Should this be the same set of chars, or not? And, if so, which chars?

(ping @blankenberg maybe)

This also adds ,^_ as valid characters (good catch, 🐦 👀 @jmchilton!)

So, it's definitely a slightly different set of characters that this particular tool was using, yes. The question is, what's the set we actually want for exported user-downloaded files? Either way, we should pick a single set and go with it.

My understanding is:

For files going into Galaxy - it does seem that Dan explicitly wanted to allow spaces here and history items can include names with spaces (and most of our flagship tools generate items with spaces) so I don't know why would exclude them here. I didn't make that decision, but it seems reasonable.

For downloads coming from Galaxy - I'm guessing we exclude spaces because they make the files easier to work with on the command-line. I didn't make that decision, but it seems reasonable.

So I don't think we need to be consistent about whitespace handling across these two different use cases. Can you explain more why you think they should be consistent - and if so do you want spaces in downloads or do you want to exclude spaces when importing from genomespace?

I guess it's a bigger problem than spaces, now that I think about it more. Right now we also rip out non-ascii characters like ä on egress that we don't on ingress, etc.

So, forgetting genomespace for a second, I can upload Müßiggänger.txt, which gets entered into the history exactly like that.

But when I download it, it's Galaxy140-[M__igg_nger.txt].txt, which is unfortunate and I think unreasonable.

That said, this was a random refactoring enhancement that got looped into this PR and I'm happy to rip out those particular Genomespace changes to move this forward if we'd all rather revisit it separately.

Here or in a new PR I'd be very happy to see any unicode alpha-numeric character added. I'm a little more +/- 0 on white listing shell relevant characters.

When I was investigating shell-safe characters for CWL stuff - I came across this library (https://pypi.python.org/pypi/regex) - which unicode-friendly extended character classes.

Sounds good. I'm just going to revert the genomespace changes for now since I really want to hear from @blankenberg on that and I don't want to hold this PR up any more.

Will follow up on extending the valid character set in a separate endeavor.

jmchilton · 2016-08-11T15:51:18Z

Cool beans @dannon - thanks!

Minimally functional metadata file fetching via the API

0bf9417

dannon added status/WIP kind/feature area/API labels Aug 5, 2016

dannon added this to the 16.10 milestone Aug 5, 2016

Add download url for metadata file.

e219dfe

scottx611x mentioned this pull request Aug 5, 2016

[Enhancement/Question] Be able to download meta files of Workflow outputs galaxyproject/bioblend#192

Open

dannon added status/review and removed status/WIP labels Aug 5, 2016

dannon added 2 commits August 5, 2016 13:57

Consolidate usage of 'valid_chars' in the application.

f87b08e

Rename get_metadata_file route to just metadata_file. Thanks @carlfeb…

ce04973

…erhard.

jmchilton reviewed Aug 5, 2016
View reviewed changes

Revert genomespace tool changes related to FILENAME_VALID_CHARS

c9ac723

jmchilton merged commit b40e993 into galaxyproject:dev Aug 11, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Metadata file fetching via the API #2741

Metadata file fetching via the API #2741

dannon commented Aug 5, 2016

nsoranzo commented Aug 5, 2016

dannon commented Aug 5, 2016

dannon commented Aug 5, 2016

carlfeberhard commented Aug 5, 2016

dannon commented Aug 5, 2016

jmchilton Aug 5, 2016

dannon Aug 5, 2016 •

edited

Loading

nsoranzo Aug 9, 2016

dannon Aug 10, 2016 •

edited

Loading

jmchilton Aug 10, 2016

dannon Aug 10, 2016 •

edited

Loading

jmchilton Aug 10, 2016

dannon Aug 11, 2016

jmchilton commented Aug 11, 2016

Metadata file fetching via the API #2741

Metadata file fetching via the API #2741

Conversation

dannon commented Aug 5, 2016

nsoranzo commented Aug 5, 2016

dannon commented Aug 5, 2016

dannon commented Aug 5, 2016

carlfeberhard commented Aug 5, 2016

dannon commented Aug 5, 2016

jmchilton Aug 5, 2016

Choose a reason for hiding this comment

dannon Aug 5, 2016 • edited Loading

Choose a reason for hiding this comment

nsoranzo Aug 9, 2016

Choose a reason for hiding this comment

dannon Aug 10, 2016 • edited Loading

Choose a reason for hiding this comment

jmchilton Aug 10, 2016

Choose a reason for hiding this comment

dannon Aug 10, 2016 • edited Loading

Choose a reason for hiding this comment

jmchilton Aug 10, 2016

Choose a reason for hiding this comment

dannon Aug 11, 2016

Choose a reason for hiding this comment

jmchilton commented Aug 11, 2016

dannon Aug 5, 2016 •

edited

Loading

dannon Aug 10, 2016 •

edited

Loading

dannon Aug 10, 2016 •

edited

Loading