Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Large number of significant interactions #32

Open
joan-yanqiong opened this issue May 3, 2023 · 5 comments
Open

Large number of significant interactions #32

joan-yanqiong opened this issue May 3, 2023 · 5 comments

Comments

@joan-yanqiong
Copy link

joan-yanqiong commented May 3, 2023

hello, I'm using cell2cell to identify interactions with scRNAseq data.

  interactions = c2c.analysis.SingleCellInteractions(rnaseq_data=rnaseq,
                                                     ppi_data=lr_pairs,
                                                     metadata=meta,
                                                     interaction_columns=(
                                                         'source_genesymbol', 'target_genesymbol'),
                                                     communication_score='expression_gmean',
                                                     cci_score='bray_curtis',
                                                     cci_type='directed',
                                                     aggregation_method='average',
                                                     barcode_col='index',
                                                     celltype_col=args.annot,
                                                     complex_sep='_',
                                                     verbose=True)

When I look at the number of interactions per cell type, i.e. looking at ccc_permutation_pvalues using p < 0.05. I see that I get a very large number of interactions per cell type pair (up to 1800 interactions), compared when using LIANA. I'm wondering why that could be.

@earmingol
Copy link
Owner

earmingol commented May 3, 2023

Hi @joan-yanqiong!

Yeah, I think that's something that happens due that cell2cell does not filter genes by the fraction of cells that are expressing them. LIANA does that, through the expr_prop parameter. The default value LIANA uses is 0.1 I think, so that means that it will consider only genes that are expressed above 10% of the single cells in a cell type. So at the end of the day, the pool of LR pairs will be smaller for the dataset, and therefore you will get a smaller number of significant interactions.

@joan-yanqiong
Copy link
Author

Thank you for the very fast response. Okay that makes sense, so the interactions that I get with cell2cell are not necessarily wrong. Is there a way to make it more stringent to reduce the number of interactions?

@earmingol
Copy link
Owner

earmingol commented May 3, 2023

No, unfortunately I developed this before most recent tools came out, and I haven't had time to implement these filtering steps they include. I plan to add more stringent filtering at some point.

@joan-yanqiong
Copy link
Author

I was thinking of maybe using the scores as a filtering step to reduce the number of interactions: interactions.interaction_space.interaction_elements["communication_matrix"]. Would that make sense?

@poloarol
Copy link

poloarol commented Aug 1, 2024

Hello @joan-yanqiong

I know its been a while now, do you remember how you went about the filtering of the dataset?

Best

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants