-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
refine evidence_direction #123
base: v0.11
Are you sure you want to change the base?
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
MolecularMatch changes don't address #122. CIViC looks pretty good, minor cosmetic fix needed. OncoKB seems good to me.
harvester/civic.py
Outdated
association['response_type'] = '{} {} {}'.format( | ||
evidence_item['evidence_direction'], | ||
evidence_item['evidence_type'], | ||
description) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
add check if description == 'n/a'
. If so, do not add description
to response_type
string.
otherwise, CIViC looks good!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
harvester/molecularmatch.py
Outdated
@@ -238,7 +238,7 @@ def convert(evidence): | |||
|
|||
# association['evidence_label'] = direction | |||
association = el.evidence_label(tier, association, na=False) | |||
association = ed.evidence_direction(tier, association, na=False) | |||
association = ed.evidence_direction(evidence['direction'], association, na=False) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this is not returning the correct value to response_type
. There are multiple different values from the source with different contexts which should be preserved (for now):
{
'conflicting': 35,
'decreased response': 394,
'no benefit': 265,
'not applicable': 69,
'predicted – resistant': 45,
'predicted – sensitive': 507,
'resistant': 976,
'sensitive': 3449,
'unknown': 6
}
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@ahwagner : where did you find these responses in molecularmatch data?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
cat /tmp/molecularmatch_raw.json | jq .direction | sort | uniq
"does_not_support"
"supports"
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I didn't save the code, and now I'm not sure. Looking at MM records, what we want for this field is clearly a combination of top level keys ['direction'] and ['clinicalSignificance']. I'm wondering if I was looking at a different source? Anyway, I'm trying to verify that molecularmatch was the source with the entries that needed adjusting, but search.cancervariants.org is not loading for me right now (timeout errors). I'll try again tonight and hopefully can get you some more info.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@ahwagner Re. molecularmatch
association['response_type'] = '{} {}'.format(evidence['direction'], evidence['clinicalSignificance'])
before
after
@@ -215,7 +215,7 @@ def _enrich_feature(gene, feature): | |||
association = el.evidence_label(clinical['level'], | |||
association, na=True) | |||
association = ed.evidence_direction(clinical['level_label'], | |||
association, na=True) | |||
association, na=False) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That was a surprisingly easy fix. Live data look good.
Corrects evidence_direction see #122
@ahwagner latest data pushed onto search.cancervariants.org