From 5b62f012671f2afeea36948ad1dc8ed4e8e09eca Mon Sep 17 00:00:00 2001 From: Johan Gustafsson <50811354+supernord@users.noreply.github.com> Date: Tue, 31 Oct 2023 20:50:22 +1030 Subject: [PATCH 1/8] Add bio.tools ID to cd_hit_dup.xml --- tools/cd_hit_dup/cd_hit_dup.xml | 5 ++++- 1 file changed, 4 insertions(+), 1 deletion(-) diff --git a/tools/cd_hit_dup/cd_hit_dup.xml b/tools/cd_hit_dup/cd_hit_dup.xml index 3aa3ee75..75c2cc24 100644 --- a/tools/cd_hit_dup/cd_hit_dup.xml +++ b/tools/cd_hit_dup/cd_hit_dup.xml @@ -5,6 +5,9 @@ cd-hit-auxtools + + cd-hit + @@ -122,4 +125,4 @@ cd-hit-dup provides a number of options to tune how the duplicates are removed:: 10.1093/bioinformatics/bts565 - \ No newline at end of file + From ec34180f2d10dd289fe1cdbd3d3f9dfc590f3a04 Mon Sep 17 00:00:00 2001 From: Johan Gustafsson <50811354+supernord@users.noreply.github.com> Date: Tue, 31 Oct 2023 22:25:05 +1030 Subject: [PATCH 2/8] Add bio.tools ID to multispecies_MicrosatDataGenerator_interrupted_GALAXY.xml --- .../multispecies_MicrosatDataGenerator_interrupted_GALAXY.xml | 3 +++ 1 file changed, 3 insertions(+) diff --git a/tools/multispecies_orthologous_microsats/multispecies_MicrosatDataGenerator_interrupted_GALAXY.xml b/tools/multispecies_orthologous_microsats/multispecies_MicrosatDataGenerator_interrupted_GALAXY.xml index 9193a3c6..f0b500c1 100755 --- a/tools/multispecies_orthologous_microsats/multispecies_MicrosatDataGenerator_interrupted_GALAXY.xml +++ b/tools/multispecies_orthologous_microsats/multispecies_MicrosatDataGenerator_interrupted_GALAXY.xml @@ -1,5 +1,8 @@ for multiple (>2) species alignments + + multispecies_orthologous_microsats + multispecies_MicrosatDataGenerator_interrupted_GALAXY.pl $input1 From 446167a2ffe653465fd880bf79dc66aa18bd3537 Mon Sep 17 00:00:00 2001 From: Johan Gustafsson <50811354+supernord@users.noreply.github.com> Date: Tue, 31 Oct 2023 22:58:40 +1030 Subject: [PATCH 3/8] Add bio.tools ID to quality_filter.xml --- tools/quality_filter/quality_filter.xml | 3 +++ 1 file changed, 3 insertions(+) diff --git a/tools/quality_filter/quality_filter.xml b/tools/quality_filter/quality_filter.xml index 6c4820b5..73d5b0c9 100644 --- a/tools/quality_filter/quality_filter.xml +++ b/tools/quality_filter/quality_filter.xml @@ -4,6 +4,9 @@ bx-python numpy + + qualityfilter + quality_filter.py $input From 112d35531f9689fc4b615ff49c16ad07a4771bf9 Mon Sep 17 00:00:00 2001 From: Johan Gustafsson <50811354+supernord@users.noreply.github.com> Date: Tue, 31 Oct 2023 23:48:58 +1030 Subject: [PATCH 4/8] Add bio.tools ID to gi2taxonomy.xml --- tool_collections/taxonomy/gi2taxonomy/gi2taxonomy.xml | 3 +++ 1 file changed, 3 insertions(+) diff --git a/tool_collections/taxonomy/gi2taxonomy/gi2taxonomy.xml b/tool_collections/taxonomy/gi2taxonomy/gi2taxonomy.xml index d0da3a9c..d7675725 100644 --- a/tool_collections/taxonomy/gi2taxonomy/gi2taxonomy.xml +++ b/tool_collections/taxonomy/gi2taxonomy/gi2taxonomy.xml @@ -3,6 +3,9 @@ taxonomy + + gi2taxonomy + gi2taxonomy.py $input $giField $idField $out_file1 ${GALAXY_DATA_INDEX_DIR} From be8032c0245e3e073ffdc43e0f8df661475b7ed8 Mon Sep 17 00:00:00 2001 From: Johan Gustafsson <50811354+supernord@users.noreply.github.com> Date: Wed, 1 Nov 2023 00:11:14 +1030 Subject: [PATCH 5/8] Add bio.tools ID to kraken2tax.xml --- tool_collections/taxonomy/kraken2tax/kraken2tax.xml | 3 +++ 1 file changed, 3 insertions(+) diff --git a/tool_collections/taxonomy/kraken2tax/kraken2tax.xml b/tool_collections/taxonomy/kraken2tax/kraken2tax.xml index 60cdd392..0a0cf4c2 100644 --- a/tool_collections/taxonomy/kraken2tax/kraken2tax.xml +++ b/tool_collections/taxonomy/kraken2tax/kraken2tax.xml @@ -4,6 +4,9 @@ gawk gb_taxonomy_tools + + kraken2tax + "${out_file}" From 19257a91b92f53598592b7980e59dd134c8471b0 Mon Sep 17 00:00:00 2001 From: Johan Gustafsson <50811354+supernord@users.noreply.github.com> Date: Fri, 3 Nov 2023 13:52:47 +1030 Subject: [PATCH 6/8] Add bio.tools ID to lca.xml --- tool_collections/taxonomy/lca_wrapper/lca.xml | 157 +++++++++--------- 1 file changed, 80 insertions(+), 77 deletions(-) diff --git a/tool_collections/taxonomy/lca_wrapper/lca.xml b/tool_collections/taxonomy/lca_wrapper/lca.xml index 11c3ac60..f2724faf 100644 --- a/tool_collections/taxonomy/lca_wrapper/lca.xml +++ b/tool_collections/taxonomy/lca_wrapper/lca.xml @@ -1,46 +1,49 @@ - - taxonomy - + + taxonomy + + + lca1 + lca.py $input1 $out_file1 $rank_bound - - - - - - - - - - - - - - - - - - - - - - - - + + + + + + + + + + + + + + + + + + + + + + + + - + - - - - + + + + @@ -53,48 +56,48 @@ - - - - - -**What it does** - -This tool identifies the lowest taxonomic rank for which a mategenomic sequencing read is diagnostic. It takes datasets produced by *Fetch Taxonomic Ranks* tool (aka Taxonomy format) as the input. - -------- - -**Example** - -Suppose you have two reads, **read_1** and **read_2**, with the following taxonomic profiles (scroll sideways to see the entire dataset):: - - read_1 1 root superkingdom1 kingdom1 subkingdom1 superphylum1 phylum1 subphylum1 superclass1 class1 subclass1 superorder1 order1 suborder1 superfamily1 family1 subfamily1 tribe1 subtribe1 genus1 subgenus1 species1 subspecies1 - read_1 2 root superkingdom1 kingdom1 subkingdom1 superphylum1 phylum1 subphylum1 superclass1 class1 subclass1 superorder1 order1 suborder1 superfamily1 family1 subfamily1 tribe1 subtribe1 genus2 subgenus2 species2 subspecies2 - read_2 3 root superkingdom1 kingdom1 subkingdom1 superphylum1 phylum3 subphylum3 superclass3 class3 subclass3 superorder3 order3 suborder3 superfamily3 family3 subfamily3 tribe3 subtribe3 genus3 subgenus3 species3 subspecies3 - read_2 4 root superkingdom1 kingdom1 subkingdom1 superphylum1 phylum4 subphylum4 superclass4 class4 subclass4 superorder4 order4 suborder4 superfamily4 family4 subfamily4 tribe4 subtribe4 genus4 subgenus4 species4 subspecies4 - -For **read_1** taxonomic labels are consistent until the genus level, where the taxonomy splits into two branches, one ending with *subspecies1* and the other with *subspecies2*. This implies **that the lowest taxomomic rank read_1 can identify is SUBTRIBE**. Similarly, read_2 is diagnostic up until the **superphylum** level. As a results the output of this tool will be:: - - read_1 2 root superkingdom1 kingdom1 subkingdom1 superphylum1 phylum1 subphylum1 superclass1 class1 subclass1 superorder1 order1 suborder1 superfamily1 family1 subfamily1 tribe1 subtribe1 n n n n - read_2 3 root superkingdom1 kingdom1 subkingdom1 superphylum1 n n n n n n n n n n n n n n n n n - -where, **n** means *EMPTY*. - --------- - -**What's up with the drop down?** - -Why do we need the *require the lowest rank to be at least* dropdown? Let's look at the above example again. Suppose you need to find only those reads that are diagnostic on at least phylum level. To do this you need to set the *require the lowest rank to be at least* to **phylum**. As a result your output will look like this:: - - read_1 2 root superkingdom1 kingdom1 subkingdom1 superphylum1 phylum1 subphylum1 superclass1 class1 subclass1 superorder1 order1 suborder1 superfamily1 family1 subfamily1 tribe1 subtribe1 n n n n - -.. class:: infomark - -Note, that **read_2** is now omitted as it matches two phyla (**phylum3** and **phylum4**) and therefore is not diagnostic (but rather cosmopolitan) on *phylum* level. - - - - - - - + + + + + +**What it does** + +This tool identifies the lowest taxonomic rank for which a mategenomic sequencing read is diagnostic. It takes datasets produced by *Fetch Taxonomic Ranks* tool (aka Taxonomy format) as the input. + +------- + +**Example** + +Suppose you have two reads, **read_1** and **read_2**, with the following taxonomic profiles (scroll sideways to see the entire dataset):: + + read_1 1 root superkingdom1 kingdom1 subkingdom1 superphylum1 phylum1 subphylum1 superclass1 class1 subclass1 superorder1 order1 suborder1 superfamily1 family1 subfamily1 tribe1 subtribe1 genus1 subgenus1 species1 subspecies1 + read_1 2 root superkingdom1 kingdom1 subkingdom1 superphylum1 phylum1 subphylum1 superclass1 class1 subclass1 superorder1 order1 suborder1 superfamily1 family1 subfamily1 tribe1 subtribe1 genus2 subgenus2 species2 subspecies2 + read_2 3 root superkingdom1 kingdom1 subkingdom1 superphylum1 phylum3 subphylum3 superclass3 class3 subclass3 superorder3 order3 suborder3 superfamily3 family3 subfamily3 tribe3 subtribe3 genus3 subgenus3 species3 subspecies3 + read_2 4 root superkingdom1 kingdom1 subkingdom1 superphylum1 phylum4 subphylum4 superclass4 class4 subclass4 superorder4 order4 suborder4 superfamily4 family4 subfamily4 tribe4 subtribe4 genus4 subgenus4 species4 subspecies4 + +For **read_1** taxonomic labels are consistent until the genus level, where the taxonomy splits into two branches, one ending with *subspecies1* and the other with *subspecies2*. This implies **that the lowest taxomomic rank read_1 can identify is SUBTRIBE**. Similarly, read_2 is diagnostic up until the **superphylum** level. As a results the output of this tool will be:: + + read_1 2 root superkingdom1 kingdom1 subkingdom1 superphylum1 phylum1 subphylum1 superclass1 class1 subclass1 superorder1 order1 suborder1 superfamily1 family1 subfamily1 tribe1 subtribe1 n n n n + read_2 3 root superkingdom1 kingdom1 subkingdom1 superphylum1 n n n n n n n n n n n n n n n n n + +where, **n** means *EMPTY*. + +-------- + +**What's up with the drop down?** + +Why do we need the *require the lowest rank to be at least* dropdown? Let's look at the above example again. Suppose you need to find only those reads that are diagnostic on at least phylum level. To do this you need to set the *require the lowest rank to be at least* to **phylum**. As a result your output will look like this:: + + read_1 2 root superkingdom1 kingdom1 subkingdom1 superphylum1 phylum1 subphylum1 superclass1 class1 subclass1 superorder1 order1 suborder1 superfamily1 family1 subfamily1 tribe1 subtribe1 n n n n + +.. class:: infomark + +Note, that **read_2** is now omitted as it matches two phyla (**phylum3** and **phylum4**) and therefore is not diagnostic (but rather cosmopolitan) on *phylum* level. + + + + + + + From d85e97369f0657210ac52a9ca532b24a5343f7e5 Mon Sep 17 00:00:00 2001 From: Johan Gustafsson <50811354+supernord@users.noreply.github.com> Date: Fri, 3 Nov 2023 13:57:55 +1030 Subject: [PATCH 7/8] Add bio.tools ID to t2ps_wrapper.xml --- tool_collections/taxonomy/t2ps/t2ps_wrapper.xml | 3 +++ 1 file changed, 3 insertions(+) diff --git a/tool_collections/taxonomy/t2ps/t2ps_wrapper.xml b/tool_collections/taxonomy/t2ps/t2ps_wrapper.xml index 7b651388..d025913b 100644 --- a/tool_collections/taxonomy/t2ps/t2ps_wrapper.xml +++ b/tool_collections/taxonomy/t2ps/t2ps_wrapper.xml @@ -3,6 +3,9 @@ taxonomy + + t2ps + t2ps_wrapper.py $input $out_file1 $max_tree_level $font_size $max_leaves 1 From 9e05ed4e7097e666dcd9b57668bf042c8df13d14 Mon Sep 17 00:00:00 2001 From: Johan Gustafsson <50811354+supernord@users.noreply.github.com> Date: Fri, 3 Nov 2023 14:03:40 +1030 Subject: [PATCH 8/8] Add bio.tools ID to t2t_report.xml --- tool_collections/taxonomy/t2t_report/t2t_report.xml | 3 +++ 1 file changed, 3 insertions(+) diff --git a/tool_collections/taxonomy/t2t_report/t2t_report.xml b/tool_collections/taxonomy/t2t_report/t2t_report.xml index ca6cc99d..67cab9e2 100644 --- a/tool_collections/taxonomy/t2t_report/t2t_report.xml +++ b/tool_collections/taxonomy/t2t_report/t2t_report.xml @@ -3,6 +3,9 @@ taxonomy + + t2t_report + taxonomy2tree $input 0 /dev/null $out_file1 0