arm: Fix floating point arguments in arm #1135

honggyukim · 2020-03-05T07:25:20Z

The designated initializer in C makes implicit memset internally in arm
architecture. Due to the memset, VFP registers are clobbered
unexpectedly, so this patch fixes the problem by replacing the
designated initializer to explicit 'mcount_memset4'.

In addition, the offset of VFP registers were incorrect because the
following commit pushs d1 register on top of the original d0.

ffb69ce arm: Handle struct return type by keeping more regs on stack

So the offset is adjusted from -2 to -4 to cope with the change

Source:

  #include <stdio.h>

  float float_add(float a, float b)
  {
          fprintf(stderr, "a = %f, b = %f\n", a, b);
          return a + b;
  }

  int main(int argc, char *argv[])
  {
          double c;

          c = float_add(-0.1, 0.2);
          fprintf(stderr, "c = %f\n", c);
          return c > 0;
  }

Before:

  $ uftrace -a -F main a.out
  a = 0.000000, b = 0.000000
  c = 0.000000
  # DURATION     TID     FUNCTION
              [ 25362] | main(1, 0x7ea25344) {
              [ 25362] |   float_add(0.000000, 0.000000) {
   503.228 us [ 25362] |     fprintf(&_IO_2_1_stderr_, "a = %f, b = %f\n") = 27;
   511.197 us [ 25362] |   } = 0.000000; /* float_add */
     9.687 us [ 25362] |   fprintf(&_IO_2_1_stderr_, "c = %f\n") = 13;
   531.977 us [ 25362] | } = 0; /* main */

After:

  $ uftrace -a -F main a.out
  a = -0.100000, b = 0.200000
  c = 0.100000
  # DURATION     TID     FUNCTION
              [ 25146] | main(1, 0x7edbb344) {
              [ 25146] |   float_add(-0.100000, 0.200000) {
   501.769 us [ 25146] |     fprintf(&_IO_2_1_stderr_, "a = %f, b = %f\n") = 28;
   509.321 us [ 25146] |   } = 0.100000; /* float_add */
    12.500 us [ 25146] |   fprintf(&_IO_2_1_stderr_, "c = %f\n") = 13;
   533.539 us [ 25146] | } = 1; /* main */

Fixed: #1088

Signed-off-by: Honggyu Kim [email protected]

namhyung · 2020-03-09T00:41:59Z

arch/arm/mcount-support.c

+		/* d0, d1 registers (64 bit) were saved below the r0 */
+		long *float_retval = ctx->retval - 4;
+
+		memcpy(ctx->val.v, float_retval, spec->size);


Why not using mcount_memcpy4() here?

I cannot test this now, but changed it to mcount_memcpy4 as you suggested.

namhyung · 2020-03-15T14:49:30Z

libmcount/plthook.c

+	struct uftrace_trigger tr;
+
+	mcount_memset4(&tr, 0, sizeof(tr));
+	tr.flags = 0;


You don't need to reset flag anymore.. :)

I left it to keep the original code, but removed and updated it anyway.

The designated initializer in C makes implicit memset internally in arm architecture. Due to the memset, VFP registers are clobbered unexpectedly, so this patch fixes the problem by replacing the designated initializer to explicit 'mcount_memset4'. In addition, the offset of VFP registers were incorrect because the following commit pushs d1 register on top of the original d0. ffb69ce arm: Handle struct return type by keeping more regs on stack So the offset is adjusted from -2 to -4 to cope with the change. Source: #include <stdio.h> float float_add(float a, float b) { fprintf(stderr, "a = %f, b = %f\n", a, b); return a + b; } int main(int argc, char *argv[]) { double c; c = float_add(-0.1, 0.2); fprintf(stderr, "c = %f\n", c); return c > 0; } Before: $ uftrace -a -F main a.out a = 0.000000, b = 0.000000 c = 0.000000 # DURATION TID FUNCTION [ 25362] | main(1, 0x7ea25344) { [ 25362] | float_add(0.000000, 0.000000) { 503.228 us [ 25362] | fprintf(&_IO_2_1_stderr_, "a = %f, b = %f\n") = 27; 511.197 us [ 25362] | } = 0.000000; /* float_add */ 9.687 us [ 25362] | fprintf(&_IO_2_1_stderr_, "c = %f\n") = 13; 531.977 us [ 25362] | } = 0; /* main */ After: $ uftrace -a -F main a.out a = -0.100000, b = 0.200000 c = 0.100000 # DURATION TID FUNCTION [ 25146] | main(1, 0x7edbb344) { [ 25146] | float_add(-0.100000, 0.200000) { 501.769 us [ 25146] | fprintf(&_IO_2_1_stderr_, "a = %f, b = %f\n") = 28; 509.321 us [ 25146] | } = 0.100000; /* float_add */ 12.500 us [ 25146] | fprintf(&_IO_2_1_stderr_, "c = %f\n") = 13; 533.539 us [ 25146] | } = 1; /* main */ Fixed: namhyung#1088 Signed-off-by: Honggyu Kim <[email protected]>

namhyung

LGTM

honggyukim force-pushed the check/fix-arm-float-args branch 2 times, most recently from 6647e85 to f3b678f Compare March 5, 2020 23:54

namhyung reviewed Mar 9, 2020

View reviewed changes

honggyukim force-pushed the check/fix-arm-float-args branch from f3b678f to 6519cd0 Compare March 9, 2020 08:54

namhyung reviewed Mar 15, 2020

View reviewed changes

honggyukim force-pushed the check/fix-arm-float-args branch from 6519cd0 to 0752223 Compare March 15, 2020 23:57

namhyung approved these changes Mar 16, 2020

View reviewed changes

namhyung merged commit 19e6f0d into namhyung:master Mar 16, 2020

honggyukim deleted the check/fix-arm-float-args branch March 16, 2020 03:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

arm: Fix floating point arguments in arm #1135

arm: Fix floating point arguments in arm #1135

honggyukim commented Mar 5, 2020 •

edited

Loading

namhyung Mar 9, 2020

honggyukim Mar 9, 2020

namhyung Mar 15, 2020

honggyukim Mar 15, 2020

namhyung left a comment

arm: Fix floating point arguments in arm #1135

arm: Fix floating point arguments in arm #1135

Conversation

honggyukim commented Mar 5, 2020 • edited Loading

namhyung Mar 9, 2020

Choose a reason for hiding this comment

honggyukim Mar 9, 2020

Choose a reason for hiding this comment

namhyung Mar 15, 2020

Choose a reason for hiding this comment

honggyukim Mar 15, 2020

Choose a reason for hiding this comment

namhyung left a comment

Choose a reason for hiding this comment

honggyukim commented Mar 5, 2020 •

edited

Loading