Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Gene set expression with module score? #15

Open
borshago opened this issue Oct 18, 2019 · 2 comments
Open

Gene set expression with module score? #15

borshago opened this issue Oct 18, 2019 · 2 comments

Comments

@borshago
Copy link

Hi and thanks for the very useful tool!

Currently gene set expression is calculated as an arithmetic mean, which has pitfalls like sensitivity to outliers. Could you add a feature where expression for gene sets is calculated like the module score in Seurat AddModuleScore? This would provide a sort of per-cell "enrichment" of a gene set, and would be more useful than just the mean.

Best regards,
Daniel

@romanhaa
Copy link
Owner

HI Daniel,

Thank you for the suggestion. I agree that the mean is not an ideal measurement, here it just serves as a cheap (in terms of resources) approximation. Not only is it sensitive to outliers, one also has to keep the average number of transcripts per sample/cluster in mind as that affects the average expression of the genes of interest.

I already thought about alternatives, similar to the module score calculated by Seurat which is a very nice solution IMO. The biggest hurdle is of technical nature: the more calculation must be done, the longer it will take and the user has to wait making the interface feel sluggish. This is especially true for older machines.

One way this could be solved is letting the user choose how the gene expression should be calculated: mean, median, or something more sophisticated like a module score. I don't want to put Seurat on the list of dependencies to avoid unnecessarily inflating the size of the standalone version of Cerebro. Therefore, one would either have to copy the module score function and/or make something similar.

TL;DR: It's on the list of things to do :)

Best,
Roman

@borshago
Copy link
Author

Thanks for the info!

How about an option to store precomputed values of the module score for gene sets in the crb file? If this is doable, it would be interesting to have stored enrichment scores from AUCell as well:

https://bioconductor.org/packages/release/bioc/html/AUCell.html

Just my two cents. :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants