Skip to content

Commit

Permalink
number of features plot and updated zenodo links
Browse files Browse the repository at this point in the history
  • Loading branch information
timonschlegel committed Jul 2, 2024
1 parent ad514c7 commit 0e40bc5
Show file tree
Hide file tree
Showing 3 changed files with 14 additions and 5 deletions.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Original file line number Diff line number Diff line change
Expand Up @@ -13,19 +13,19 @@ items:
- name: 'DOI: 10.5281/zenodo.11473531'
description: latest
items:
- url: https://zenodo.org/api/records/11473531/files/atac_pbmc_5k_nextgem_fragments.tsv/content
- url: https://zenodo.org/api/records/11473531/files/atac_pbmc_5k_nextgem_fragments.tsv/
src: url
ext: auto
info: https://zenodo.org/records/11473531
- url: https://zenodo.org/api/records/11473531/files/gencode.v46.annotation.gtf.gz/content
- url: https://zenodo.org/api/records/11473531/files/gencode.v46.annotation.gtf.gz/
src: url
ext: auto
info: https://zenodo.org/records/11473531
- url: https://zenodo.org/api/records/11473531/files/chrom_sizes.txt/content
- url: https://zenodo.org/api/records/11473531/files/chrom_sizes.txt/
src: url
ext: auto
info: https://zenodo.org/records/11473531
- url: https://zenodo.org/api/records/11473531/files/BAM_500-PBMC.bam/content
- url: https://zenodo.org/api/records/11473531/files/BAM_500-PBMC.bam/
src: url
ext: auto
info: https://zenodo.org/records/11473531
Original file line number Diff line number Diff line change
Expand Up @@ -404,6 +404,13 @@ After creating the variables, the most accessible features are selected.
> >
> > - Including more features improves resolution and can reveal finer details, but it may also introduce noise.
> > - To optimize results, experiment with the `n_features` parameter to find the most appropriate value for your dataset.
> >
> > > <details-title> Different number of features </details-title>
> > > - To demonstrate the differences when selecting features, the following UMAP plots are the outputs from processing with a number of features between 1,000 and 500,000.
> > > - Fewer features result in fewer, but larger clusters. And selecting a lot of features will output more granular clusters and the compute time will increase.
> > > ![Different number of features UMAP]({% link topics/single-cell/images/scatac-standard-snapatac2/number_features.png %}"UMAP plots with different selected features")
> > >
> > {: .details}
> > - At this step you can provide a blacklist or whitelist to specifically select relevant features.
> > - For example the [**ENCODE Blacklist**](https://github.com/Boyle-Lab/Blacklist) ({% cite Amemiya2019 %}) can be applied here.
> {: .comment}
Expand Down Expand Up @@ -511,6 +518,8 @@ Dimension reduction is a very important step during the analysis of single cell
> - As such, they are implemented in many algorithms to visualize the data in 2 dimensions (f.ex. **UMAP** embedding).
> - The nonlinear dimension reduction algorithm, through *matrix-free spectral embedding*, used in **SnapATAC2** is a very fast and memory efficient non-linear algorithm ({% cite Zhang2024%}).
> - **Spectral embedding** utilizes an iterative algorithm to calculate the **spectrum** (*eigenvalues* and *eigenvectors*) of a matrix without computing the matrix itself.
> - For a simple introduction into *spectral embedding* and how it compares to *PCA*, please check out the blog post ["On Laplacian Eigenmaps for Dimensionality Reduction"](https://juanitorduz.github.io/laplacian_eigenmaps_dim_red/) by Juan Orduz.
>
{: .details}
## Spectral embedding
Expand Down Expand Up @@ -702,7 +711,7 @@ Since the *cell-by-gene-activity* matrix resembles the *cell-by-gene-expression*
> - The algorithm calculates the likely gene expression of a single cell, based on similar cells and fills in the missing data to produce the expected expression.
> - *MAGIC* achieves this by building a graph from the data and using data diffusion to smooth out the noise.
>
> ![Imputation with the MAGIC algorithm]({% link topics/single-cell/images/scatac-standard-snapatac2/magic_method.png %} "MAGIC restores noisy and sparse single-cell data using diffusion geometry")
> ![Imputation with the MAGIC algorithm]({% link topics/single-cell/images/scatac-standard-snapatac2/magic_method.png %} "MAGIC restores noisy and sparse single-cell data using diffusion geometry ({%cite vanDijk2018%})")
>
{: .details}
Expand Down

0 comments on commit 0e40bc5

Please sign in to comment.