NIR: asm statement support, Wip #22992

ASVIEST · 2023-11-27T16:05:57Z

GlobalAsm instr is basic asm, which is just str literals or some instrs

Asm instr is GCC extended asm
https://gcc.gnu.org/onlinedocs/gcc/Extended-Asm.html

Asm {
  AsmTemplate {
     Some asm code
     SymUse nimInlineVar # `a`
     Some asm code
   }
  AsmOutputOperand {
     # [asmSymbolicName] constraint (nimVariableName)
     AsmInjectExpr {symUse nimVariableName} # for output it have only one sym (lvalue)
     asmSymbolicName # default: ""
     constraint
  }
  AsmInputOperand {
     # [asmSymbolicName] constraint (nimExpr)
     AsmInjectExpr {symUse nimVariableName} # (rvalue)
     asmSymbolicName # default: ""
     constraint
  }
  AsmClobber {
    "clobber"
  }
}

This structure is simple for codegens (including those what may be in the future like libgccjit or LLVM).
Also it can produce some nim errors without dependencies of backends.

compiler/nir/nirinsts.nim

Araq · 2023-11-28T09:33:44Z

Btw here is my NIR todo:

NIR TODO:

Implement closure calls.
Port block exiting logic from the C codegen.
Port parameter passing and NRVO.
Port 3 opcodes to enable closure iterators.
Port Runtime type information from the C codegen.
Port the few missing magics from the C codegen.

ASVIEST · 2023-11-28T14:30:26Z

# asm (for gcc extended asm it includes also AsmTemplate, AsmOutOperand*, AsmInOperand*)
Emit {
  EmitTarget "Asm"
  Info {
    InfoKind "IsGlobal"
    false
  }

  # only GCC extended asm info.
  # It can be used by backend or not used.
  Info {
    InfoKind "AsmTemplate"
    Verbatim """
      mov %1, %0
      add $1, %0
    """
  }
  # there can be any number of operands
  Info {
   InfoKind "AsmOutOperand"
   SymUse r
   Verbatim ""
   Verbatim "=r"
  }
  Info {
   InfoKind "AsmInOperand"
   SymUse r
   Verbatim ""
   Verbatim "r"
  }

  EmitCode {
    Verbatim """
      mov %1, %0
      add $1, %0 
      :"=r"(
    """
    SymUse result
    Verbatim """
      )
      :"r"(
    """
    SymUse dst
    Verbatim ")"
   """
  }

When using gcc extended asm, it contains useful information for backends.
The EmitCode instruction directly emits code that is a list of Verbatim, SymUse, etc. into c code (as an example).

What do you think about this design ?

Araq · 2023-11-28T15:11:25Z

What do you think about this design ?

It's wrong. You effectively do the GCC specific asm parsing in the Nim frontend, but it belongs into the "NIR to C" backend part. Only the Nim specific parsing should be done in the frontend.

compiler/nir/nirinsts.nim

Araq · 2023-11-30T13:30:05Z

Already 600 lines that accomplishes what exactly? Parsing the clobber lists of GCC's unreadable inline assembler in order to pass it back to GCC to parse it for itself. Sorry, but this is terrible.

ASVIEST · 2023-11-30T18:27:35Z

The initial goal was to help future backends like LLVM or libgccjit in parsing the extended assembler (as auxiliary information) when it is available, since there is no parsing of the assembler itself. It is more logical to do it not on the frontend, but perhaps on an intermediate stage before the backend (Nir -> Nir) that has information about basic things, such as for example assembler syntax (gcc or Visual C++), or on the backend, which is worse. Asm parsing is mainly aimed at this future backends(LLVM, libgccjit), but it have some pluses on other targets (C, C++).

The main plus for C(++) purposes is that the parsing is done on the Nim side, not the compiler, so it can then be parsed, e.g. not allowing such code

proc test(a: int64) =
  asm """
    add %[val], %[a]
    :[a]"=r"(`a`)
    :[val]"r"((long long)(4/2))
  """

var a: int64 = 35
test(a)
echo a

This code compiles (although it shouldn't because it tries to change a parameter), but it can't change the variable a (it stays with the value 35), which is good, but the fact that nim's code is compiled (with invalid behavour) is not pleasant.

proc test(a: var int64) =
  asm """
    add %[val], %[a]
    :[a]"=r"(`a`)
    :[val]"r"((long long)(4/2))
  """

var a: int64 = 35
test(a)
echo a

Here the code compiles again, but again it doesn't change a (for C through GCC), or when compiling in C++ through GCC, a becomes random number.
Because of this, you should write this code instead of the previous one:

proc test(a: var int64) =
  var b = a
  asm """
    add %[val], %[a]
    :[a]"=r"(`b`)
    :[val]"r"((long long)(4/2))
  """
  a = b

var a: int64 = 35
test(a)
echo a

Now it works as it should.

Analysis of gcc extended asm also move some C, C++ compiler exceptions to Nir, which means they can be get in nimsuggest, etc.

Without such analysis, adding support for such cases is simply not possible because the Nim compiler will think of assembly language as just a black box, not knowing what is being changed, etc.
About repetitive parsing of assembler in GCC(or clang) and in Nim is completely true, but it is the simplest thing in all inline asm, the main complexity of it in the interaction of the built-in assembler with the compiler-generated assembler and code, for example, a jump in the assembler should not violate the space of compiler-generated labels, etc.
See bytecodealliance/wasmtime#1041 (comment).
It's really hard work, but not parsing.

Araq · 2023-12-01T04:59:57Z

Ok, well, currently not even "hello world" works with NIR, so can we focus on the essentials please before adding superior logic for inline asm...

ASVIEST · 2023-12-01T10:12:43Z

Ok, then I'll do it without parsing and analyzing the inline asm, for now, and keep the parsing part in a separate file for when NIR is more complete and there's time to add support for assembler analyzing.

arnetheduck · 2023-12-01T10:22:58Z

fwiw, I wanted to do this for nlvm which would benefit from proper asm parsing (so it can set up the correct constraints like gcc does from the nim-inlined asm code) - if the asm parser can be separate, that would certainly be helpful

ASVIEST · 2023-12-01T13:38:43Z

You can't just import. Keep in mind that this only works with NIR instructions (previously with ast, but now it is not supported), and not with strings, since the asm stategment may contain nim symbols and expressions. Therefore, it will be more correct to use this when nlvm generates llvm IR from NIR, and not from ast. For now you can take gcc_extended_asm.nim and replace it's tokenizer by genasm.nim tokenizer (parseAsm iterator). Note: it don't parses labels.

It don't use strutils :)

compiler/nir/nirinsts.nim

ASVIEST · 2023-12-08T13:18:43Z

Currently this can generate code for GCC and VCC inline asm, this can be selected via the -a gcc-like(or msvc-like) flag on nirc. The basic assembler also works. noAsmStackFrame also works, instead of generate __declspec or attribute naked it generates NIM_NAKED C macro. Assembler parsing is separated into a separated file, which is not currently used. Assembly analysis will be offered as rfc.

RFC: nim-lang/RFCs#542

Araq · 2023-12-09T20:07:44Z

Currently this can generate code for GCC and VCC inline asm, this can be selected via the -a gcc-like(or msvc-like) flag on nirc.

The used inline assembler format should be a pragma though, not a command line option and nimrc should have a dialect switch so that we can say "produce C code compatible with VCC etc".

ASVIEST · 2023-12-12T09:59:03Z

Currently this can generate code for GCC and VCC inline asm, this can be selected via the -a gcc-like(or msvc-like) flag on nirc.

The used inline assembler format should be a pragma though, not a command line option and nimrc should have a dialect switch so that we can say "produce C code compatible with VCC etc".

-a flag say "can produce C code from inline asm statements compatible with VCC etc".
If there will be only dialect switch (without -a flag), what dialect will be for example ICC (intel C compiler) ? How dialect can show that C compiler don't support inline asm (like some tcc builds) ? If for ICC etc. it is a different dialect, then the codegen will have to check all dialects, etc. I think it is better to leave the -a flag for customization and add the targetCC (or dialect) flag, which sets the default settings for the selected compiler.

The pragma idea is good for deciding what syntax to generate assembler code with, for example for ICC, the assembler can be either gcc style or vcc style. If code uses an procedure with assembler from an external library, the assembler stmt must have a pragma that explains it gcc(AT&T) or vcc(intel), then it can use libraries that that use gcc or vcc syntax.

ASVIEST · 2023-12-12T16:07:34Z

Now it can resolve asm syntax kind. For example backend Inline asm syntax is {"gcc", "vcc"} and via {.inlineAsmSyntax: "gcc".} can specify that asm stmt with gcc syntax.
NOTE: this pragma not required when CC support only one syntax: gcc/vcc, but recommend to make code more universal.

This reverts commit f0efd4c.

Araq · 2023-12-15T07:27:22Z

compiler/nir/gcc_extended_asm.nim

+    AsmInjectExpr
+    AsmStrVal
+
+  GccAsmNode* = ref object


That's not how we write ASTs in Nim 2023. We now use "packed trees". The files you have touched can guide you.

Araq · 2023-12-15T07:29:09Z

compiler/nir/ast2ir.nim

+  build c.code, info, EmitCode:
+    for i in offset..<n.len:
+      let it = n[i]
+      case it.kind:


That's not how the rest of the codebase indents the case statement.

Araq · 2023-12-15T07:29:39Z

compiler/nir/cir.nim

@@ -31,6 +32,7 @@ type
    Semicolon = ";"
    Comma = ", "
    Space = " "
+    Quote = $'"'


Suggested change

Quote = $'"'

Quote = "\""

Araq · 2023-12-15T07:31:02Z

compiler/nir/cir.nim

+      left = 0
+      for i in 0..s.high:
+        if s[i] == '\n':
+          c.add s[left..i - 1]


What? Why? Didn't you write a parser for the asm? Shouldn't it split it at newlines if they are important?

Araq · 2023-12-15T07:31:35Z

compiler/nir/cir.nim

+
+  var asmTemplate = true
+  var left = 0
+  var s = ""


Why is s mutable?

Araq · 2023-12-15T07:32:34Z

compiler/nir/cir.nim

+          assert (
+            isLastSon(t, code, code.firstSon) and
+            t[code.firstSon].kind == Verbatim
+          ), "Invalid basic asm. Basic asm should be only one verbatim"


Shouldn't the frontend do this?

Araq · 2023-12-15T07:33:06Z

compiler/nir/cir.nim

+      of Code:
+        raiseAssert"not supported"
+
+  of EmitTarget:


Why is a EmitTarget an opcode of its own? We have a pragma annotation system in place for this...

Araq · 2023-12-15T07:35:07Z

700 lines we have to maintain for good all the while "NIR" cannot even do "hello world". Enough of this.

Araq · 2023-12-15T07:36:32Z

Reopen it once NIR can do something useful at all.

ASVIEST added 2 commits November 27, 2023 18:34

Asm (gcc extended asm) for nir

ddaea3b

Update genasm.nim

73eb391

Araq reviewed Nov 27, 2023

View reviewed changes

compiler/nir/nirinsts.nim Show resolved Hide resolved

ASVIEST marked this pull request as draft November 29, 2023 18:19

ASVIEST added 3 commits November 29, 2023 21:54

Asm parsing now working on backends (WIP(

73f9b60

Emit targets

a4ddb3b

Remove asm instrs

03af260

Araq reviewed Nov 30, 2023

View reviewed changes

compiler/nir/nirinsts.nim Outdated Show resolved Hide resolved

ASVIEST added 8 commits December 1, 2023 17:23

Update nirinsts.nim

545d35a

verbatims

5cefc65

gcc asm (simple)

cd5de29

fix lastSon

c0ab580

Seperated file

78a4c61

target properties

e4af3f0

GCC asm c codegen

e9221ae

It don't use strutils :)

Small fix

8419151

Araq reviewed Dec 3, 2023

View reviewed changes

compiler/nir/nirinsts.nim Outdated Show resolved Hide resolved

ASVIEST added 4 commits December 3, 2023 16:34

apply suggestions

57802ed

msvc asm

a1f9c3f

info's

f41557d

fix

fc8b91f

ASVIEST added 2 commits December 4, 2023 15:10

fix asmNoStackFrame

330ae88

fix

21ee838

ASVIEST marked this pull request as ready for review December 4, 2023 18:51

ASVIEST and others added 2 commits December 7, 2023 19:57

small refactor

cc45710

Merge branch 'nim-lang:devel' into nir-asm

1b88194

ASVIEST closed this Dec 8, 2023

ASVIEST reopened this Dec 8, 2023

ASVIEST and others added 2 commits December 12, 2023 13:01

Merge branch 'nim-lang:devel' into nir-asm

c7d3093

inlineAsmSyntax pragma

11a6fe8

ASVIEST and others added 7 commits December 12, 2023 19:51

Merge branch 'nim-lang:devel' into nir-asm

ec8beea

simple dialect switch

540ef8f

Update cir.nim

5d6c354

Update cir.nim

f0efd4c

Revert "Update cir.nim"

811e6ab

This reverts commit f0efd4c.

Merge branch 'nim-lang:devel' into nir-asm

94c7fa0

Merge branch 'nim-lang:devel' into nir-asm

747ead5

Araq reviewed Dec 15, 2023

View reviewed changes

compiler/nir/cir.nim

var asmTemplate = true

var left = 0

var s = ""

Copy link

Member

Araq Dec 15, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why is s mutable?

Araq reviewed Dec 15, 2023

View reviewed changes

Araq closed this Dec 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NIR: asm statement support, Wip #22992

NIR: asm statement support, Wip #22992

ASVIEST commented Nov 27, 2023

Araq commented Nov 28, 2023

ASVIEST commented Nov 28, 2023

Araq commented Nov 28, 2023 •

edited

Loading

Araq commented Nov 30, 2023

ASVIEST commented Nov 30, 2023 •

edited

Loading

Araq commented Dec 1, 2023

ASVIEST commented Dec 1, 2023

arnetheduck commented Dec 1, 2023

ASVIEST commented Dec 1, 2023

ASVIEST commented Dec 8, 2023 •

edited

Loading

Araq commented Dec 9, 2023

ASVIEST commented Dec 12, 2023 •

edited

Loading

ASVIEST commented Dec 12, 2023 •

edited

Loading

Araq Dec 15, 2023

Araq Dec 15, 2023

Araq Dec 15, 2023

Araq Dec 15, 2023

Araq Dec 15, 2023

Araq Dec 15, 2023

Araq Dec 15, 2023

Araq commented Dec 15, 2023

Araq commented Dec 15, 2023

NIR: asm statement support, Wip #22992

NIR: asm statement support, Wip #22992

Conversation

ASVIEST commented Nov 27, 2023

Araq commented Nov 28, 2023

NIR TODO:

ASVIEST commented Nov 28, 2023

Araq commented Nov 28, 2023 • edited Loading

Araq commented Nov 30, 2023

ASVIEST commented Nov 30, 2023 • edited Loading

Araq commented Dec 1, 2023

ASVIEST commented Dec 1, 2023

arnetheduck commented Dec 1, 2023

ASVIEST commented Dec 1, 2023

ASVIEST commented Dec 8, 2023 • edited Loading

Araq commented Dec 9, 2023

ASVIEST commented Dec 12, 2023 • edited Loading

ASVIEST commented Dec 12, 2023 • edited Loading

Araq Dec 15, 2023

Choose a reason for hiding this comment

Araq Dec 15, 2023

Choose a reason for hiding this comment

Araq Dec 15, 2023

Choose a reason for hiding this comment

Araq Dec 15, 2023

Choose a reason for hiding this comment

Araq Dec 15, 2023

Choose a reason for hiding this comment

Araq Dec 15, 2023

Choose a reason for hiding this comment

Araq Dec 15, 2023

Choose a reason for hiding this comment

Araq commented Dec 15, 2023

Araq commented Dec 15, 2023

Araq commented Nov 28, 2023 •

edited

Loading

ASVIEST commented Nov 30, 2023 •

edited

Loading

ASVIEST commented Dec 8, 2023 •

edited

Loading

ASVIEST commented Dec 12, 2023 •

edited

Loading

ASVIEST commented Dec 12, 2023 •

edited

Loading