-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
LC_ALL
Environment Variable Not Applied to sort
Command in Galaxy Sort Tool
#18967
Comments
Good find @lybCNU! @bgruening can you transfer the issue to your repo? It refers to https://github.com/bgruening/galaxytools/blob/master/tools/text_processing/text_processing/sort.xml I guess just adding a semicolon after |
I only see repos from this organisation, not my repo. I don't get why a semicolon should help. We need to make sure we have a test for this. |
Currently, I have only tested on usegalaxy.eu and usegalaxy.org, and the bug was only found on usegalaxy.eu. By accessing the command line via Jupyter notebook, I observed that on usegalaxy.org, According to ChatGPT, the reason is that with |
This makes sense. But we have |
That makes sense. However, for some reason, the actual command being run does not have |
Ah gotcha, PR with the fix is here: bgruening/galaxytools#1520 Thanks for reporting @lybCNU! |
Describe the bug
The
LC_ALL
environment variable is only applied tosed
when the Galaxy sort tool is used with a non-zero number of header lines. This leads to inconsistent sorting results as theLC_ALL
environment variable is not applied to thesort
command.This bug was discovered while using the Data Manipulation Olympics tutorial , created by @shiltemann, from training.galaxyproject.org as a teaching resource.
Galaxy Version and/or server at which you observed the bug
Galaxy Version: [24.1.2.dev0] at https://usegalaxy.eu/
To Reproduce
Steps to reproduce the behavior:
Number of header lines
to 1.Number of header lines
to 0 and observe the following generated command:Number of header lines
set to 1, the athlete listed at the top is Adolf Schmal instead of A. Grigoriadis.Expected behavior
The
LC_ALL
environment variable should be applied to bothsed
andsort
commands, ensuring consistent sorting results. The expected command should be:Additional context
The issue occurs because
LC_ALL=C
is not applied to thesort
command whenNumber of header lines
is set to 1. This inconsistency results in incorrect sorting behavior. ApplyingLC_ALL=C
to the entire command resolves the issue. The test dataset used isolympics.tsv
.The text was updated successfully, but these errors were encountered: