[JIT][arm64] recognize Interlocked.Increment in volatile increment #60021

EgorBo · 2021-10-05T19:09:10Z

volatile int a;

void Test() => a++;

Currently emits two dmb:

; Method Program:Test():this
G_M27469_IG01:              ;; offset=0000H
        A9BF7BFD          stp     fp, lr, [sp,#-16]!
        910003FD          mov     fp, sp
						;; bbWeight=1    PerfScore 1.50

G_M27469_IG02:              ;; offset=0008H
        B9400801          ldr     w1, [x0,#8]
        D50339BF          dmb     ishld
        52800022          mov     w2, #1
        0B020021          add     w1, w1, w2
        D5033BBF          dmb     ish
        B9000801          str     w1, [x0,#8]
						;; bbWeight=1    PerfScore 25.00

G_M27469_IG03:              ;; offset=0020H
        A8C17BFD          ldp     fp, lr, [sp],#16
        D65F03C0          ret     lr
						;; bbWeight=1    PerfScore 2.00
; Total bytes of code: 40

I guess we can use armv8.1 LDADDAL (acquire and release) here? to get:

; Method Program:Test2():this
G_M27469_IG01:              ;; offset=0000H
        A9BF7BFD          stp     fp, lr, [sp,#-16]!
        910003FD          mov     fp, sp
						;; bbWeight=1    PerfScore 1.50

G_M27469_IG02:              ;; offset=0008H
        B940001F          ldr     wzr, [x0]
        D2800101          mov     x1, #8
        8B010000          add     x0, x0, x1
        52800021          mov     w1, #1
        B8E10000          ldaddal w1, w0, [x0]
						;; bbWeight=1    PerfScore 7.50

G_M27469_IG03:              ;; offset=001CH
        A8C17BFD          ldp     fp, lr, [sp],#16
        D65F03C0          ret     lr
						;; bbWeight=1    PerfScore 2.00
; Total bytes of code: 36

Basically replace

[000006] -A-XGO------              *  ASG       int   
[000005] V--XGO-N----              +--*  FIELD     int    a
[000000] ------------              |  \--*  LCL_VAR   ref    V00 this         
[000004] ---XGO------              \--*  ADD       int   
[000002] V--XGO-N----                 +--*  FIELD     int    a
[000001] ------------                 |  \--*  LCL_VAR   ref    V00 this         
[000003] ------------                 \--*  CNS_INT   int    1

with GT_XADD(GT_FIELD)

am I correct? @dotnet/jit-contrib

The text was updated successfully, but these errors were encountered:

ghost · 2021-10-05T19:09:14Z

Tagging subscribers to this area: @JulieLeeMSFT
See info in area-owners.md if you want to be subscribed.

Issue Details

volatile int a;

void Test() => a++;

Currently emits:

; Method Program:Test():this
G_M27469_IG01:              ;; offset=0000H
        A9BF7BFD          stp     fp, lr, [sp,#-16]!
        910003FD          mov     fp, sp
						;; bbWeight=1    PerfScore 1.50

G_M27469_IG02:              ;; offset=0008H
        B9400801          ldr     w1, [x0,#8]
        D50339BF          dmb     ishld
        52800022          mov     w2, #1
        0B020021          add     w1, w1, w2
        D5033BBF          dmb     ish
        B9000801          str     w1, [x0,#8]
						;; bbWeight=1    PerfScore 25.00

G_M27469_IG03:              ;; offset=0020H
        A8C17BFD          ldp     fp, lr, [sp],#16
        D65F03C0          ret     lr
						;; bbWeight=1    PerfScore 2.00
; Total bytes of code: 40

I guess we can use armv8.1 LDADDAL (acquire and release) here? to get:

; Method Program:Test2():this
G_M27469_IG01:              ;; offset=0000H
        A9BF7BFD          stp     fp, lr, [sp,#-16]!
        910003FD          mov     fp, sp
						;; bbWeight=1    PerfScore 1.50

G_M27469_IG02:              ;; offset=0008H
        B940001F          ldr     wzr, [x0]
        D2800101          mov     x1, #8
        8B010000          add     x0, x0, x1
        52800021          mov     w1, #1
        B8E10000          ldaddal w1, w0, [x0]
						;; bbWeight=1    PerfScore 7.50

G_M27469_IG03:              ;; offset=001CH
        A8C17BFD          ldp     fp, lr, [sp],#16
        D65F03C0          ret     lr
						;; bbWeight=1    PerfScore 2.00
; Total bytes of code: 36

Basically replace

[000006] -A-XGO------              *  ASG       int   
[000005] V--XGO-N----              +--*  FIELD     int    a
[000000] ------------              |  \--*  LCL_VAR   ref    V00 this         
[000004] ---XGO------              \--*  ADD       int   
[000002] V--XGO-N----                 +--*  FIELD     int    a
[000001] ------------                 |  \--*  LCL_VAR   ref    V00 this         
[000003] ------------                 \--*  CNS_INT   int    1

with GT_XADD(GT_FIELD)

am I correct? @dotnet/jit-contrib

Author:	EgorBo
Assignees:	-
Labels:	`tenet-performance`, `area-CodeGen-coreclr`, `untriaged`
Milestone:	-

EgorBo · 2021-10-05T19:34:39Z

NOTE: we have >1000 of volatile fields across the base class libs, perhaps some of the access patterns also can be optimized with Atomics

echesakov · 2021-10-05T19:48:38Z

I am wondering whether in the baseline case (Armv8.0) it should be sufficient to have one memory barrier inserted after the load?

ldr     w1, [x0,#8]
dmb     ishld
mov     w2, #1
add     w1, w1, w2
str     w1, [x0,#8]

Also interesting why we are not using add w1, w1, #1 form of the instruction.

EgorBo added the tenet-performance Performance related issue label Oct 5, 2021

dotnet-issue-labeler bot added area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI untriaged New issue has not been triaged by the area owner labels Oct 5, 2021

EgorBo added this to the 7.0.0 milestone Oct 5, 2021

EgorBo removed the untriaged New issue has not been triaged by the area owner label Oct 5, 2021

EgorBo mentioned this issue Oct 9, 2021

JIT: Optimize redundant memory barriers on arm/arm64 #60219

Merged

ghost added the in-pr There is an active PR which will close this issue when it is merged label Oct 9, 2021

EgorBo self-assigned this Oct 9, 2021

EgorBo closed this as completed in #60219 Oct 13, 2021

ghost removed the in-pr There is an active PR which will close this issue when it is merged label Oct 13, 2021

ghost locked as resolved and limited conversation to collaborators Nov 12, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[JIT][arm64] recognize Interlocked.Increment in volatile increment #60021

[JIT][arm64] recognize Interlocked.Increment in volatile increment #60021

EgorBo commented Oct 5, 2021 •

edited

Loading

ghost commented Oct 5, 2021

EgorBo commented Oct 5, 2021

echesakov commented Oct 5, 2021

[JIT][arm64] recognize Interlocked.Increment in volatile increment #60021

[JIT][arm64] recognize Interlocked.Increment in volatile increment #60021

Comments

EgorBo commented Oct 5, 2021 • edited Loading

ghost commented Oct 5, 2021

EgorBo commented Oct 5, 2021

echesakov commented Oct 5, 2021

EgorBo commented Oct 5, 2021 •

edited

Loading