Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Splice model #16

Closed
arunvv90 opened this issue Oct 10, 2022 · 3 comments
Closed

Splice model #16

arunvv90 opened this issue Oct 10, 2022 · 3 comments
Labels
question Further information is requested

Comments

@arunvv90
Copy link

arunvv90 commented Oct 10, 2022

Hi,
Thank you for the great tool!! I am using it for virus annotation(Ictalurid herpesvirus). Testing went very well. In the last update, two splice models are added. In the earlier version, I was using the default version. I was wondering which model would be appropriate for my case(I guess the general model?). Another question is about the GFF format. My GFF looks like this

C02-169_draft1	miniprot	mRNA	33598	35094	2600	+	.	ID=C02-169_ORF25_000001;Identity=0.9940;Positive=0.9960;Target=C02-169_ORF25 1 498

Can you please tell me what is 2600, Identity(nucleotide identity or amino acid identity), Positive=0.9960, and the last two columns(1,498)? Please forgive me for these silly questions from a newcomer. There are not many good tools out there for virus annotation and most of them fail with introns like herpesvirus. Your tool seems very promising.

lh3 added a commit that referenced this issue Oct 10, 2022
@lh3
Copy link
Owner

lh3 commented Oct 10, 2022

I don't know what splice site look like in Ictalurid herpesvirus (is it GT-AG?). It is safer to use -j1, which is now the default and is equivalent to the splice model in v0.4.

nucleotide identity or amino acid identity

Amino acid. miniprot doesn't see the nucleotide sequences of query genes.

the last two columns

See the GFF3 spec. It is the region of the query protein aligned

@lh3 lh3 closed this as completed Oct 10, 2022
@lh3 lh3 added the question Further information is requested label Oct 10, 2022
@arunvv90
Copy link
Author

Thank you for the quick reply. Splice junctions in the virus are major GT-AG or minor AT-AC

@lh3
Copy link
Owner

lh3 commented Oct 11, 2022

Thanks. I didn't know that. Miniprot considers GT-AG, GC-AG and AT-AC. I think it should work.

PS: also forgot to answer – 2600 is the alignment score without considering introns.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants