Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

XEC spike diversity issue #2088

Open
1 of 11 tasks
xz-keg opened this issue Sep 25, 2024 · 21 comments
Open
1 of 11 tasks

XEC spike diversity issue #2088

xz-keg opened this issue Sep 25, 2024 · 21 comments

Comments

@xz-keg
Copy link
Contributor

xz-keg commented Sep 25, 2024

It seems that there is, and will be a lot of spike diversity in XEC. Better gather them in one issue.
Only count for branches with more than 2 places.

GPT Model trained on August data(seqs before XEC appears) predicts 4 spike mutations for XEC on top potential mutation list: S:T572I, S:R346T, S:N185D, S:A688V.

Tasks

@FedeGueli
Copy link

FedeGueli commented Sep 25, 2024

Very good idea, i will try to help. do u prefer direct editing or highlighting in the comments?

I would require also that the lineage has to be sampled at least once in September ( or last 30 days) to avoid dead ends
If you agree .

@xz-keg
Copy link
Contributor Author

xz-keg commented Sep 25, 2024

Very good idea, i will try to help. do u prefer direct editing or highlighting in the comments?

I would require also that the lineage has to be sampled at least once in September ( or last 30 days) to avoid dead ends If you agree .

Sure, you can do direct editing.

@xz-keg xz-keg pinned this issue Sep 26, 2024
@cvejris
Copy link

cvejris commented Sep 26, 2024

Branch 7: XEC+Orf1a:A599T+S:S680F (furin), 5seqs (1xFrance, 1xCanada, 3xUS). Arose 2x independently (all US sequences are separated on the Orf1a:T2274I subbranch). All seqs less than 1 month old. Query: G2060A, C23601T

@Mydtlwn
Copy link

Mydtlwn commented Sep 27, 2024

@corneliusroemer

@cvejris
Copy link

cvejris commented Sep 28, 2024

Branch 9: XEC+Orf1a:A599T+S:E1202Q, 2 seqs, France + Ireland. Query: G2060A,C11020T,G25166C

@FedeGueli
Copy link

Br.7 went to 8 with three GBW samples from Peru' (2 patients)

@cvejris
Copy link

cvejris commented Oct 1, 2024

Branch 10/11: XEC+S:W152R (defining for Centaurus) arose convergently: Branch 10 via T22016A (1xNL, 1xFR). Branch 11 via T22016C (1xIR, 1xUSA). All seqs sampled in September

@cvejris
Copy link

cvejris commented Oct 1, 2024

Branch 12: XEC+S:T678I (C23595T), furin-adjacent. 4xCAN, 2xFR, 2xUSA. Convergent: one on the Orf1a:I1367L, the rest on the Orf1a:A599T polytomy. All seqs sampled in September

@cvejris
Copy link

cvejris commented Oct 1, 2024

Branch 13: XEC+S:P1263Q(C25350A): 3xSW,1xFR,1xNL. Convergent (Swedish seqs on the C27630T subbranch of Orf1a:A599T, the rest on the Orf1a:A599T polytomy).
Interestingly, most XEC with P1263Q harbor other "promising" mutations
image

@xz-keg
Copy link
Contributor Author

xz-keg commented Oct 1, 2024

Branch 10: XEC+S:W152R (defining for Centaurus) arose convergently: i. via T22016A (1xNL, 1xFR) ii. via T22016C (1xIR, 1xUSA). All seqs sampled in September

Please separate these branches and ensure every branch is monoplyetic.

@cvejris
Copy link

cvejris commented Oct 1, 2024

Branch 14: XEC+S:P1263L(C25350T), same site as branch 13. 1xSW,1xPL.

@xz-keg
Copy link
Contributor Author

xz-keg commented Oct 2, 2024

Branch 13: XEC+S:P1263Q(C25350A): 3xSW,1xFR,1xNL. Interestingly, most XEC with P1263Q harbor other "promising" mutations image

Please check with usher before proposing. They seem to be on different usher branches.
https://genome-test.gi.ucsc.edu/cgi-bin/hgPhyloPlace

If they are on different usher branches it is likely they emerge separately and shouldn't be treated as one. Unless you provide reason (artifact, usher-flip flop, convergent other mutations like branch 7, etc. ) to merge them.

@cvejris
Copy link

cvejris commented Oct 2, 2024

Branch 13: XEC+S:P1263Q(C25350A): 3xSW,1xFR,1xNL. Interestingly, most XEC with P1263Q harbor other "promising" mutations image

Please check with usher before proposing. They seem to be on different usher branches. https://genome-test.gi.ucsc.edu/cgi-bin/hgPhyloPlace

If they are on different usher branches it is likely they emerge separately and shouldn't be treated as one. Unless you provide reason (artifact, usher-flip flop, convergent other mutations like branch 7, etc. ) to merge them.

IMO, at present the relatively low number of XEC seqs makes it hard to correctly assess the phylogeny, the resolution is still insufficient. The total XEC Usher tree still places most seqs on polytomy.
My contributions should not be treated as lineage proposals. I look for mutations which may be beneficial for the virus. They might be i. founder mutations for a monophyletic lineage which spread to different countries, or ii. the same AA substitution arising independently on different background in different countries. It does not make that much difference - both scenarios may in theory indicate selective advantage conferred by the mutations.
I edited my contributions by adding notes about convergence for the mutations where I think there was one :)

@FedeGueli
Copy link

@aviczhl2 i think soon we will be force to raise the parameters to three places and 10 seqs, i tell this because in my previous experience with spike diversity issues it rapidly becomes very mess or too long with the opposite effect to risk hiding something fast instead of highlighting it. It is not yet the moment but we shopuld think about it.

@cvejris
Copy link

cvejris commented Oct 2, 2024

Branch 15: XEC+Orf1a:A599T+S:K182N. 3xNL, 1xENG, 1xCanary Islands (with additional S:M153I). Monophyletic, query: C829T, A22108T

@FedeGueli
Copy link

FedeGueli commented Oct 3, 2024

Not a Branch but worth tracking: XEC + Orf1a:A599T+ S:A475V
Query:T8416C,C22986T,T3565C
Samples: 3
Places: Netherlands, 2 regions (Gelderman, Zuit Holland)

@FedeGueli
Copy link

FedeGueli commented Oct 3, 2024

Solved 5 now from Br.8 also from Germany it looks interesting

Deleted for mistake the query of BR.8 i m re building it ( i fear it is not monophyletic though)

@cvejris
Copy link

cvejris commented Oct 4, 2024

Branch 16: XEC+Orf1a:A599T+S:G72R. 5 seqs, 5 countries: Denmark, Netherlands, France (with S:K113R), Canada, GBW from Mexico. Query: G2060A, G21776A,T3565C (edited)

@FedeGueli
Copy link

Branch 16: XEC+Orf1a:A599T+S:G72R. 5 seqs, 5 countries: Denmark, Netherlands, France (with S:K113R), Canada, GBW from Mexico. Query: G2060A, G21776A,T3565C (edited)

i ve added T3565C to exclude old samples.

@xz-keg
Copy link
Contributor Author

xz-keg commented Oct 6, 2024

G2060A, G21776A,T3565C

You can directly edit the task list. But be cautious on cvejris proposals that may not be monophyletic.

@FedeGueli
Copy link

FedeGueli commented Oct 6, 2024

G2060A, G21776A,T3565C

You can directly edit the task list. But be cautious on cvejris proposals that may not be monophyletic.

yeah i m a bit confused about the branch numbering . @cvejris i suggest just to add the lineage you find without a branch number

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants