Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question: technical definition of perfect, strict, and loose #140

Closed
alimayy opened this issue Jan 29, 2021 · 3 comments
Closed

Question: technical definition of perfect, strict, and loose #140

alimayy opened this issue Jan 29, 2021 · 3 comments

Comments

@alimayy
Copy link

alimayy commented Jan 29, 2021

Hi there,

Thanks for developing RGI.

Regarding the perfect, strict, and loose hits, Jia et al. 2017 mentions:

The RGI currently supports two detection model types (Protein Homolog and Protein Variant) and analyzes sequences under three paradigms—Perfect, Strict, and Loose (a.k.a. Discovery). The Perfect algorithm is most often applied to clinical surveillance as it detects perfect matches to the curated reference sequences and mutations in CARD.

Could you provide a more technical definition of these terms? For instance for 'perfect', I understand from #101 that the self-mapping bit-scores determine the 'perfect' hits. Could you provide more info on how 'strict' and 'loose' thresholds are chosen?

Thanks in advance,
Ali

@raphenya
Copy link
Collaborator

@alimayy As an example, Blasting AXC-1 on the CARD database produces the results shown. The bit-score chosen will be 500, as it will capture genes similar to AXC-1.

Bitscore-AXC-1

  • A Perfect hit to AXC-1 has all amino acids matching CARD reference.
  • A Strict hit to AXC-1 will be any gene with at least bitscore of 500.
  • A Loose hit to AXC-1 will be any gene with bitscore below 500.
  • All genes in CARD go through the same processes.

@parwa28
Copy link

parwa28 commented Dec 25, 2023

Can you please tell me how exactly I can use one of the detection models?

@agmcarthur
Copy link
Collaborator

@alimayy these detection models are used by CARD's Resistance Gene Identifier software: https://github.com/arpcard/rgi (contains full documentation). There is an online portal at https://card.mcmaster.ca/analyze/rgi. Also described in CARD's latest publication: https://pubmed.ncbi.nlm.nih.gov/36263822/

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants