Adding Seurat version of the PBMC clustering tutorial #5491

MarisaJL · 2024-10-31T13:30:35Z

This is the Seurat version of the Clustering3k PMBCs with Scanpy tutorial for single cell.

Removing old images to replace with newer ones

pavanvidem · 2024-11-04T10:05:41Z

@MarisaJL I will review it soon.

pavanvidem

reviewed and tested until the preprocessing.

topics/single-cell/tutorials/scrna-seurat-pbmc3k/tutorial.md

pavanvidem · 2024-11-06T18:07:53Z

topics/single-cell/tutorials/scrna-seurat-pbmc3k/tutorial.md

+>        - *"Include features detected in at least this many cells"*: `3`
+>        - *"Include cells where at least this many features are detected"*: `200`
+>        - *"Calculate percentage of mito genes in each cell"*: `No`
+>


include selection of genes.tsv and barcodes.tsv in params

pavanvidem · 2024-11-06T18:13:45Z

topics/single-cell/tutorials/scrna-seurat-pbmc3k/tutorial.md

+> 4. Check that the format is `RDS`
+{: .hands_on}
+
+We can't look at the RDS file directly as it is designed for computers to read, rather than humans, but the Seurat tools will now be able to interact with the data. 


Please add an inspect step here. A question on the number of features and cells filtered out with our thresholds should give an idea of what happened in the last step.

topics/single-cell/tutorials/scrna-seurat-pbmc3k/tutorial.md

pavanvidem · 2024-11-06T23:41:38Z

topics/single-cell/tutorials/scrna-seurat-pbmc3k/tutorial.md

+>        - *"Genes to calculate residual features for"*: `all genes`
+>            - *"How to set variable features"*: `set number of variable features`
+>        - *"Output list of most variable features"*: `Yes`
+>        - *"Variable(s) to regress out"*: `percent.mt`


isn't it different from the normal scaling step? In the normal scaling step, there was no variable regressed out but in scTransform, percent.mt regressed out

pavanvidem · 2024-11-07T15:16:33Z

@MarisaJL the tutorial is well made! Some complex topics were explained in easy language. One key improvement I would suggest for this tutorial is to establish a clearer separation between log normalization and SCTransform preprocessing. There is "choose your own tutorial" option but applied only in the initial step. In later hands-on steps, both preprocessing methods are run together with side-by-side comparisons, which could be confusing for beginners trying to learn the analysis from scratch. Tracking two different analyses and managing two separate SeuratObjects may feel overwhelming at this stage.

It would be nice if the "choose your own tutorial" option carried through the entire tutorial. This way, users could follow a single workflow from start to finish before diving into comparisons. I really like your explanations and comparisons. Moving the comparisons to a dedicated section toward the end might also be beneficial. This final section could include guidance on which preprocessing method to use in specific scenarios.

Do you think it makes sense? Sorry, it sounds like a lot of restructuring.

MarisaJL · 2024-11-07T16:09:33Z

Thanks @pavanvidem - that does make sense! Since this is a tutorial designed for beginners, it's probably best to keep the two options separate. I think it is possible to have the cyoa follow on between steps, so I'll try and restructure it.

pavanvidem

Tested the tutorial and it works perfectly! The content looks great!!

pavanvidem · 2024-11-07T15:39:22Z

topics/single-cell/tutorials/scrna-seurat-pbmc3k/tutorial.md

+> 1. {% tool [Seurat Visualize](toolshed.g2.bx.psu.edu/repos/iuc/seurat_plot/seurat_plot/5.0+galaxy0) %} with the following parameters:
+>    - {% icon param-file %} *"Input file with the Seurat object"*: `rds_out` (output of **Seurat Run Dimensional Reduction** {% icon tool %})
+>    - *"Method used"*: `Determine dimensionality with 'ElbowPlot'`
+>        - *"Number of dimensions to plot standard deviation for"*: If you ran the separate preprocessing steps then enter `30` here, if you used SCTransfrom then enter `50`


either change from 50 to 30 or upload a new Elbow plot for scTransform with 50 PCs.

pavanvidem · 2024-11-07T15:45:27Z

topics/single-cell/tutorials/scrna-seurat-pbmc3k/tutorial.md

+>        - *"Output list of top genes"*: `Yes`
+>
+{: .hands_on}
+


Maybe rename this output because it is used in many following steps.

pavanvidem · 2024-11-07T15:50:30Z

topics/single-cell/tutorials/scrna-seurat-pbmc3k/tutorial.md

+Creating a Seurat Object in R would require two steps - first, we would need to read in our data, in this case using the `Read10X` function, then secondly we would turn it into a Seurat Object using the `CreateSeuratObject` function. On Galaxy, we can perform both steps with a single tool. The `CreateSeuratObject` function also generates some QC metrics and performs basic filtering of the data.
+
+><hands-on-title>Create a Seurat Object</hands-on-title>
+> 1. {% tool [Seurat Create](toolshed.g2.bx.psu.edu/repos/iuc/seurat_create/seurat_create/5.0+galaxy0) %} with the following parameters:


The barcodes file is in txt format. Currently, the tools does not support the txt files as input.
I am trying to add it here: galaxyproject/tools-iuc#6539. So please use galaxy1 version for this step.

pavanvidem · 2024-11-07T16:35:51Z

topics/single-cell/tutorials/scrna-seurat-pbmc3k/tutorial.md

+>        - *"UMAP implementation to run"*: `uwot`
+>        - *"Run UMAP on dimensions, features, graph or KNN output"*: `dims`
+>            - *"Number of dimensions from reduction to use as input"*: If you ran separate preprocessing steps, leave this as `10`, if you used SCTransform then change it to `30`
+>


Please rename the output. This output is used later in for FindAllMarkers and then FindMarkers. Users might select wrong input if not renamed.

pavanvidem · 2024-11-07T16:42:36Z

topics/single-cell/tutorials/scrna-seurat-pbmc3k/tutorial.md

+Although there is a lot of information here, all we need to know for now is that the markers listed for each cluster are the genes that were expressed more by these cells than any of the other clusters. We can search online for these genes to get an idea of what types of cells are in our clusters.
+
+> <question-title></question-title>
+> 1. Are the top genes associated with PCs 1-3 in our list of markers? Which clusters are they markers for?


maybe add a question on how to select markers for a particular cluster. Then answer with Filter tool on column 7.

pavanvidem · 2024-11-07T17:42:17Z

topics/single-cell/tutorials/scrna-seurat-pbmc3k/tutorial.md

+>
+> 1. Click on the {% icon galaxy-pencil %} pencil icon of the file we renamed as `DE Markers` (this was the CSV output from `FindAllMarkers`) then select {% icon galaxy-chart-select-data %} Datatypes in the central panel. Choose the second option, `Convert to Datatype` and make sure `tabular (using `Convert CSV to tabular`)` is selected in the drop down menu before pressing the `Create Dataset` button. This will create a new, tabular version of the dataset at the top of your history - make sure that this is the version you use in the next step.
+>
+> 2. {% tool [Table Compute](toolshed.g2.bx.psu.edu/repos/iuc/table_compute/table_compute/1.2.4+galaxy0) %} with the following parameters:


When I tested, the empty header line did not affect the heatmap. I think a simple cut tool should be enough here.

Co-authored-by: Pavankumar Videm <[email protected]>

MarisaJL and others added 30 commits September 9, 2024 13:19

Create seurat pbmc tutorial

8eed6fd

Adding images

d85dd30

Correcting image paths and a few small edits

5342dd1

Minor edits

4fce919

Edits to markers section

aed0064

Edited cell annotation section

2be8fd8

Delete topics/single-cell/images/scrna-seurat-pbmc3k directory

6e7452b

Removing old images to replace with newer ones

Hands on sections

c0d7226

Add image via upload

97a0a51

Create image folder

64b3b5e

Fixing types and adding some image captions

7e2a55f

Adding images

4210a14

Removing old image

0218f0d

Deleting old image

0438a31

Deleting old image

f159ff3

Adding SCTransform results

40e8fbd

Deleting old image

f5f655a

Adding more SCTransform results

5a2cce0

Adding more

daaec19

Removing typo

a1f29c0

Final results added for SCTransform route

8164931

Adding images for SCTransform route

31a11ee

Two more images for SCT results

9e2c837

Upload image for SCT

a21c64b

Fixing some images

f912a80

Proofreading

d404fa3

Updated image

4ed6bba

Proofreading

614486f

Adding workflow file

3e37322

Creating workflows folder

8448497

MarisaJL requested a review from a team as a code owner October 31, 2024 13:30

github-actions bot added the single-cell label Oct 31, 2024

MarisaJL added 12 commits October 31, 2024 13:46

Fixing some formatting problems

17596d6

Missed this one!

c2f0791

Adding missing " to workflow

553c45c

Fixing errors in Seurat_PBMC_Workflow.ga

1e4cb24

Fixing errors in Seurat_PBMC_Workflow_SCT.ga

4ddf97d

Adding missing DOI

8579661

Correcting file name 1

ed38334

Correcting file name 2

b8b3c6e

Fixing errors

ff863cf

Fixing the one I missed!

c2ad554

Trying to fix snippet

c1edc52

Merge branch 'main' into seurat_pbmc

9943049

pavanvidem reviewed Nov 6, 2024

View reviewed changes

pavanvidem reviewed Nov 7, 2024

View reviewed changes

MarisaJL and others added 10 commits November 28, 2024 14:03

Apply suggestions from code review

30d1a03

Co-authored-by: Pavankumar Videm <[email protected]>

Adding in some more questions

2b9f69f

Starting to restructure to separate paths

b9ab98c

More restructuring

f60411d

Rewriting some Q & As

d81e599

Changing a few parameters

6a4c55c

Changing some images

0dbb219

Updating/adding images

58f34e5

Adding new images for SCT route

8346252

SCT version of the FindMarkers section

ee97685

MarisaJL marked this pull request as draft December 6, 2024 16:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding Seurat version of the PBMC clustering tutorial #5491

Adding Seurat version of the PBMC clustering tutorial #5491

MarisaJL commented Oct 31, 2024

pavanvidem commented Nov 4, 2024

pavanvidem left a comment

pavanvidem Nov 6, 2024

pavanvidem Nov 6, 2024

pavanvidem Nov 6, 2024

pavanvidem commented Nov 7, 2024

MarisaJL commented Nov 7, 2024

pavanvidem left a comment

pavanvidem Nov 7, 2024

pavanvidem Nov 7, 2024

pavanvidem Nov 7, 2024

pavanvidem Nov 7, 2024

pavanvidem Nov 7, 2024

pavanvidem Nov 7, 2024

Adding Seurat version of the PBMC clustering tutorial #5491

Are you sure you want to change the base?

Adding Seurat version of the PBMC clustering tutorial #5491

Conversation

MarisaJL commented Oct 31, 2024

pavanvidem commented Nov 4, 2024

pavanvidem left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pavanvidem commented Nov 7, 2024

MarisaJL commented Nov 7, 2024

pavanvidem left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment