-
Notifications
You must be signed in to change notification settings - Fork 69
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Proportion of sites under selection #1772
Comments
Dear @mridna, Well, a principled (but not something that you could easily) method would be use your functional class (categorical) as a part of the phylogenetic fit. In other words, ω and its weight would be functions of the functional class (e.g., as a random effect), and all Both approaches you have are sensible but ad hoc.
As an example, here are 10 randomly selected genes from https://pubmed.ncbi.nlm.nih.gov/30620335/ using different measures of "selected weight". I ran BUSTED-E (https://www.biorxiv.org/content/10.1101/2024.11.13.620707v1) and MEME. The alignments, Mostly, just be aware that there will be a number of hard-to-quantify factors influencing the estimates. You should also explore how the definition of the "fraction of alignment" influences your downstream analyses. If you get very different results based on what you select, then probably there's not much sense in placing too much faith in any associations you find. Best, |
Dear Sergei, Thank you so much for your detailed response and suggestions, I really appreciate it! Best regards, |
Dear @spond,
I wanted your advice on how best to calculate the proportion of sites under positive selection across the entire phylogeny (~30 species) for ~2000 genes. Our intention is to compare different functional categories and see if there are differences in how many sites are under positive selection. We are not concerned about what specific sites are under selection. What we are ultimately trying to check is even if the whole gene is not evolving under positive selection, are there still some sites under positive selection and does this proportion vary based on function. With this in mind, how would you approach calculating proportion of positively selected sites? We are currently thinking about this in two ways:
Proportion of sites with omega > 1 from BUSTED. Here we are a little stumped on how to proceed. We have the model averaged p-values for each alignment, but should the proportion be taken from the best fitting model, since it seems to be different based on BUSTED settings (with/without SRV and MH)? And if there is no significant p-value for positive selection, should we consider that there are no sites with omega > 1 even if BUSTED shows some codons are evolving in this rate class.
Use number of significant positively selected sites from MEME (+MH) and normalise it for number of sites in the alignment to arrive at a significant proportion of sites under positive selection.
Hope this was explained clearly!
Best regards,
Mridula
The text was updated successfully, but these errors were encountered: