-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Marker genes in JSON file - Correlation analysis mode #11
Comments
Hi Dimitris, Yes, that is an expected result. When you run correlation mapping, the first step the code takes is to flatten the taxonomy tree, i.e. reduce it to a single level so that the cell type clusters are all direct children of the root node. The marker gene lookup table is similarly flattened. There is now only one parent node in the tree ( Cheers, Scott |
Hi Scott, Great, thank you for clarifying. A follow-up question regarding the hierarchical mapping marker genes this time: The JSON file includes such info:
As I was examining the genes included in each "SUBC", I observed that a big percentage of them are constantly present in every predicted "SUBCLASS". And if I also seek for "unique" markers across subclasses, I end up with really few subclasses with some unique markers. Nevertheless, it seems that this is not a problem for the mapper, since the results I get make sense. Thank you in advance, |
The marker genes used by the on-line MapMyCells app are the product of another research team, so I'm going to have to ask around to see if there is an answer to your question. With the onset of the end-of-year holidays, I probably won't be able to properly respond to this until early 2024. Sorry I can't give you anything more helpful now. |
I just read over your question again and realized I can answer it. There is no importance score that the algorithm uses when assigning classes, subclasses, etc. The data is simply subsampled to include only the marker genes and then correlated against the average gene expression profiles of the clusters in the reference data (again, using only the marker genes). The cluster with the highest correlation coefficient is chosen (i.e. all marker genes are considered equal). The documentation for the cell type assignment algorithm can now be found here. |
Hi Scott, Happy new year and thank you very much for your response. Ok, that's totally reasonable then. I will go through the documentation you have created and let you know if everything is clear. Cheers, Dimitris |
Hi @danielsf,
I'm creating a new thread here, so we can continue our conversation regarding the marker genes that are used during the mapping process.
I followed your instructions in #10 in order to access those marker genes.
(1) In the hierarchical analysis mode I get multiple marker gene lists that are used to discriminate between the children of each parent in the taxonomy tree, as also a 'None' element, that indicates the root of the taxonomy tree. That's in general clear.
(2) In the correlation analysis mode though, I only get a 'None' element. Is that reasonable?
Cheers,
Dimitris
The text was updated successfully, but these errors were encountered: