Skip to content
This repository has been archived by the owner on Jun 16, 2023. It is now read-only.

Updated analysis: Fusion filtering #533

Closed
migbro opened this issue Apr 18, 2023 · 2 comments
Closed

Updated analysis: Fusion filtering #533

migbro opened this issue Apr 18, 2023 · 2 comments
Assignees
Labels
bug Something isn't working invalid This doesn't seem right

Comments

@migbro
Copy link

migbro commented Apr 18, 2023

What analysis module should be updated and why?

Assuming I understand correctly, this one: 05-QC_putative_onco_fusion_distribution.Rmd
An attempt is made to collapse hits in which both callers called the same breakpoint, but fails.
The code in question seems to be here: https://github.com/PediatricOpenTargets/OpenPedCan-analysis/blob/6e53918a1cf33c190082665df1ecf833efc96b23/analyses/fusion_filtering/05-QC_putative_onco_fusion_distribution.Rmd#L240-L250

What changes need to be made? Please provide enough detail for another participant to make the update.

Currently, some collapsing does occur, i.e.:
BS_00FD2KMP 3:160356013 14:67809289 IFT80--ZFYVE26 in-frame IFT80 NA ZFYVE26 NA NA Oncogene NA NA NA STARFUSION, ARRIBA 2 NA NA FALSE PT_SW4Q1HZP [INTERCHROMOSOMAL[chr3--chr14]], translocation Genic
However, other instance are incomplete:

BS_00FD2KMP     6:158818082     6:4431475       EZR--ENSG00000285424    frameshift      EZR     NA      ENSG00000285424 NA      CosmicCensus    NA      NA      NA      NA      ARRIBA  1       NA      NA      FALSE   PT_SW4Q1HZP     [INTRACHROMOSOMAL[chr6:154.18Mb]], deletion     Genic
BS_00FD2KMP     6:158818082     6:4431475       EZR--ENSG00000285424    other   EZR     NA      ENSG00000285424 NA      CosmicCensus    NA      NA      NA      NA      STARFUSION      1       NA      NA      FALSE   PT_SW4Q1HZP     [INTRACHROMOSOMAL[chr6:154.18Mb]]       Genic
BS_028YFYJ6     3:53846648      3:39169937      IL17RB--ENSG00000284669 frameshift      IL17RB  NA      ENSG00000284669 NA      Oncogene        NA      NA      NA      NA      ARRIBA  1       NA      NA      FALSE   PT_394ZA6P7     [INTRACHROMOSOMAL[chr3:14.67Mb]], duplication   Genic
BS_028YFYJ6     3:53846648      3:39169937      IL17RB--ENSG00000284669 other   IL17RB  NA      ENSG00000284669 NA      Oncogene        NA      NA      NA      NA      STARFUSION      1       NA      NA      FALSE   PT_394ZA6P7     [INTRACHROMOSOMAL[chr3:14.67Mb]]        Genic

You can see two example of the same breakpoint on different lines from the same caller, when the intent I believe is to merge those. The difference seems to be in Fusion_Type. Recommendation: _Merge these calls and use the Fusion_type assigned to ARRIBA`.

What input data should be used? Which data were used in the version being updated?

Whatever input is normally given for this step

When do you expect the revised analysis will be completed?

ASAP

Who will complete the updated analysis?

Unknown

@migbro migbro added bug Something isn't working invalid This doesn't seem right labels Apr 18, 2023
@migbro
Copy link
Author

migbro commented Apr 18, 2023

@kelseykeith I think Jo Lynne assigned you to the wrong ticket, just made the adjustment. It is related to #509, bit this actually describes the work

@jharenza
Copy link
Member

This may stem from kinases with multiple kinase domains in that one domain is retained and not another. We cross-posted here, but immediate fix will be to collapse these as "Yes,No" in the domain retained columns.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
bug Something isn't working invalid This doesn't seem right
Projects
None yet
Development

No branches or pull requests

3 participants