-
Notifications
You must be signed in to change notification settings - Fork 602
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[bugfix] fgbio error rate by read position per-base plots #1251
[bugfix] fgbio error rate by read position per-base plots #1251
Conversation
If the error rate for any base-to-base (ex. A>C/a_to_c_error_rate column ) error was higher than the overall error rate (error_rate column), then it would be filtered out by y_max. This fix calculates y_max across not only the overall error rate (across positions), but also the base-to-base error rate columns.
@ewels I was suprised that setting |
This is where data is discarded: https://github.com/ewels/MultiQC/blob/fa7e0bf8e7c5f7a6377bd217c13b3cbb12a70b0a/multiqc/plots/linegraph.py#L139 |
@ewels any thoughts on getting this merged and a new release? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good, thanks!
Apologies @nh13 - I've been swamped with hackathons and conferences lately and am focussing on @nf-core stuff (which has waited longer than MultiQC for a release). I'll get back to MultiQC + a new release ASAP, but won't be for a few weeks as tomorrow is my last day before some holiday. In the mean time I've merged this so it's at least fixed on |
ps. Yes I agree that it seems odd and a little dangerous to discard data when above
If you look in the same commit, I was also modifying the Preseq module - from memory that was spitting out these huge exponential curves with thousands and thousands of points. So in that context it kind of made sense, but I agree that if it's just ditching a single point in an otherwise bumpy line then this looks like missing data and is misleading. I guess the correct solution here is to only discard those points if the curve doesn't come back under Phil |
x-ref #1257 |
Thanks! Enjoy your vacation! |
@ewels any chance we could have a release with this included? |
If the error rate for any base-to-base (ex. A>C/a_to_c_error_rate column ) error
was higher than the overall error rate (error_rate column), then it
would be filtered out by y_max.
This fix calculates y_max across not only the overall error rate (across
positions), but also the base-to-base error rate columns.
CHANGELOG.md
has been updated