preserve decimal point in float INFO fields #980

pontikos · 2019-03-12T17:46:24Z

INFO fields of type float should have a decimal point even if the number has trailing zeroes
I.e 70.0 instead of 70.
Rounding to an integer breaks GATK.

The text was updated successfully, but these errors were encountered:

jkbonfield · 2019-03-12T17:59:11Z

This has come up before, although I'm struggling to find the issue. Maybe it was over in htsjdk land.

Anyway, this is a parsing bug in GATK, not in bcftools output. Floating point numbers are a superset of integers. "70" is still a valid floating point number and C "atof" and "strtod" functions quite happily accept whole numbers.

While I guess we could change all floating point numbers to include .0 if they are whole numbers, it needlessly wastes space and isn't the correct solution.

pontikos · 2019-03-12T18:08:48Z

Ok I've posted on GATK github:

broadinstitute/gatk#5789

I agree that it seems silly that GATK falls over when a decimal point is missing for a float.

I hope htsjdk (assuming that's what GATK are using) and htslib can agree on this.

pd3 · 2019-03-13T09:12:53Z

Yes, this is a silly bug in GATK and we will not address this in bcftools / htslib. As a workaround, you can "fix" the numbers to GATK's liking using this script https://github.com/samtools/bcftools/blob/develop/misc/fix-broken-GATK-Double-vs-Integer

pontikos · 2019-03-13T09:20:22Z

Thanks! I also wrote a script to fix it. GATK don't want to fix it as GATK 3 is no longer maintained. If you are maybe able to point to the line of code that does this in bcftools I can fix this in my version.

…

On Wed, 13 Mar 2019, 09:12 Petr Danecek, ***@***.***> wrote: Yes, this is a silly bug in GATK and we will not address this in bcftools / htslib. As a workaround, you can "fix" the numbers to GATK's liking using this script https://github.com/samtools/bcftools/blob/develop/misc/fix-broken-GATK-Double-vs-Integer — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#980 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ADrG9HFMcftw085DHATaTuk5CbQyXb7Tks5vWMEXgaJpZM4brghN> .

jkbonfield · 2019-03-13T11:40:02Z

It's probably kputd in kstring.c. This uses %g to print up floats if very large or very small, or otherwise emulates the printf %g format itself. The z[-1] = 0 line MAY be responsible along with some editing to the trailing zero removal, but you'll need to experiment. Note though this is just following normal printing mechanism. Eg try printf on the command line:

jkb$ printf "%g\n" 0.170
0.17
jkb$ printf "%g\n" 1.70
1.7
jkb$ printf "%g\n" 17.0
17

"17", not "17.0"!

pontikos mentioned this issue Mar 12, 2019

allow for no decimal point in float INFO fields broadinstitute/gatk#5789

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

preserve decimal point in float INFO fields #980

preserve decimal point in float INFO fields #980

pontikos commented Mar 12, 2019

jkbonfield commented Mar 12, 2019

pontikos commented Mar 12, 2019

pd3 commented Mar 13, 2019

pontikos commented Mar 13, 2019 via email

jkbonfield commented Mar 13, 2019 •

edited

Loading

preserve decimal point in float INFO fields #980

preserve decimal point in float INFO fields #980

Comments

pontikos commented Mar 12, 2019

jkbonfield commented Mar 12, 2019

pontikos commented Mar 12, 2019

pd3 commented Mar 13, 2019

pontikos commented Mar 13, 2019 via email

jkbonfield commented Mar 13, 2019 • edited Loading

jkbonfield commented Mar 13, 2019 •

edited

Loading