Rabbit VM RFC

Why?

I would like to make a RISC architecture that is capable of comfortably sitting on top of nearly any other architecture. If it’s possible to compile to Rabbit, then a program can run on more or less any hardware or virtualized architecture.

Rabbit can be optimized per-architecture, while maintaining the same interface. It may, for example, take advantage of Intel’s SIMD behind the scenes.

What?

Definitions

space: A register or memory location.

Registers

Registers are 32 bits wide.

Value	Register	Use
`0x0`	`zero`	Contains 0. MIPS style.
`0x1` .. `0x9`	`r1` .. `r9`	General purpose.
`0xA`	`ip`	Instruction pointer.
`0xB`	`sp`	Stack pointer.
`0xC`	`ret`	Returned value.
`0xD`	`tmp`	Temporary register.
`0xE`	`flags`	Flags used for comparison.

<<flags_section>>

Flags

Bit	Flag	Meaning
`0x0`	`SF`	Sign flag. On if sign bit of result is on.
`0x1`	`ZF`	Zero flag. On if result is zero or numbers were the same.
`0x2` .. `0x20`		Reserved.

Instruction set

Real instructions

When it makes sense, the destination register is the first argument to an instruction. The last argument to the following instructions may also be an immediate value, denoted with a prefix of $: move, add, sub, mul, div, shr, shl, nand, xor, br, brz, brnz.

Value	Instruction	Usage	Explanation	Description
`0x0`	`halt`	`halt`		Stop the execution of the machine immediately.
`0x1`	`move`	`move %rB, %rC`	`r[B] := r[C]`	Move one space into another.
`0x2`	`add`	`add %rA, %rB, %rC`	`r[A] := r[B] + r[C]`	Add two spaces into a third.
`0x3`	`sub`	`sub %rA, %rB, %rC`	`r[A] := r[B] - r[C]`	Subtract two spaces into a third. Sets flags.
`0x4`	`mul`	`mul %rA, %rB, %rC`	`r[A] := r[B] * r[C]`	Multiply two spaces into a third.
`0x5`	`div`	`div %rA, %rB, %rC`	`r[A] := r[B] / r[C]`	Divide two spaces into a third.
`0x6`	`shr`	`shr %rA, %rB, %rC`	`r[A] := r[B] >> r[C]`	Shift right one space a number of times.
`0x7`	`shl`	`shl %rA, %rB, %rC`	`r[A] := r[B] << r[C]`	Shift left one space a number of times.
`0x8`	`nand`	`nand %rA, %rB, %rC`	`r[A] := not(r[B] & r[C])`	NAND two spaces.
`0x9`	`xor`	`xor %rA, %rB, %rC`	`r[A] := r[B] ^ r[C]`	XOR two spaces.
`0xA`	`br`	`br %rC`	`goto r[C]`	Branch.
`0xB`	`brz`	`brz %rC`	`if (ZF set) goto r[C]`	Branch if `ZF` is set.
`0xC`	`brnz`	`brnz %rC`	`if (!(ZF set)) goto r[C]`	Branch if `ZF` is not set.
`0xD`	`in`	`in %rC`	`r[C] := getchar()`	Read one character from `stdin` into a space.
`0xE`	`out`	`out %rC`	`putchar(r[C])`	Print one character from a space to `stdout`.

Assembler macros

The last argument to the following macros may also be an immediate value, denoted with a prefix of $: cmp, not, push, call.

Macro	Usage	Expansion
`cmp`	`cmp A, B`	`sub %tmp, A, B`
`not`	`not A, B`	`nand A, B, B`
`or`	`or A, B, C`	`(A nand A) nand (B nand B)`
`and`	`and A, B, C`	`nand A, B, C` // `not A, A`
`push`	`push A`	`move (%sp), A` // `sub %sp, %sp, $1`
`pop`	`pop A`	`add %sp, %sp, $1` // `move A, (%sp)`
`call`	`call A`	`push %ip` // `br A`
`ret`	`ret`	`pop %ip`

Addressing modes

There are two addressing modes: %reg and (%reg). The former uses the value in the register, and the latter uses the word at the address in the register.

How?

Instruction formats

instr %rA, %rB, %rC
instr %rA, %rB
instr %rA

     +-----Immediate bit
     |+----Addressing mode bit C
     ||+---Addressing mode bit B
     |||+--Addressing mode bit A
     |||| +Dead space+     regB
     vvvv vvvvvvvvvvvv     vvvv
IIII MDDD 000000000000 CCCCBBBBAAAA
^^^^                   ^^^^    ^^^^
Opcode                 regC    regA

VVVVVVVVVVVVVVVVVVVVVVVVVVVVVVVV
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Immediate value

Every bit in “Dead space” must be turned off. If one is turned on, the result is undefined.

If the immediate bit is on, then the instruction disregards rC and instead looks for its third argument in the 32 bits after the first instruction. For example:

1: 0001 1 000 000000000000 0001 0000 0000
2: 0000 0 000 000000000000 0000 0000 0111

represents a move instruction with the immediate bit set. It will therefore look for an immediate value in the following word (in this case, the value is 7), and then store it in r1.

Addition works in a similar fashion:

1: 0010 1 000 000000000000 0001 0001 0000
2: 0000 0 000 000000000000 0000 0000 0001

represents an add instruction with the immediate bit set. It looks for an immediate value in the following word (in this case, 1), adds it to the value in r1, then stores the result in r1. So this instruction would be an increment instruction.

The addressing mode bits are simple; if a register’s addressing mode bit is on, then the address in the register is dereferenced when the instruction is being executed, and that data is used instead. For example:

1: 0010 0 100 000000000000 0111 0001 0000

Performs an addition operation that adds the contents of zero with r1 and stores the result in memory at the address in r7.

Stages of compilation

Preprocessing

The preprocessor will be responsible for macro expansion and label to address translation. Macros exist in the form of instruction expansions, done behind the scenes.

Peephole optimization

Assembling

Floating point

Floating point computation is left to the client (an exercise for the reader, if you will).

Memory layout

The memory layout is completely flat right now.

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.org		README.org
assembler.c		assembler.c
bif.rbt		bif.rbt
blogstyle.css		blogstyle.css
deref.rbt		deref.rbt
disassembler.c		disassembler.c
rabbit.c		rabbit.c
rabbit.html		rabbit.html
rabbit.org		rabbit.org
rabbit.pdf		rabbit.pdf
rabbit_bif.h		rabbit_bif.h
rabbit_codewords.c		rabbit_codewords.c
rabbit_codewords.h		rabbit_codewords.h
rabbit_io.c		rabbit_io.c
rabbit_io.h		rabbit_io.h
rabbit_types.h		rabbit_types.h
rabbitvm.html		rabbitvm.html
rabbitvm.org		rabbitvm.org
rabbitvm.pdf		rabbitvm.pdf
style.css		style.css
test.rbt		test.rbt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Rabbit VM RFC

Why?

What?

Definitions

Registers

Flags

Instruction set

Real instructions

Assembler macros

Addressing modes

How?

Instruction formats

Stages of compilation

Preprocessing

Peephole optimization

Assembling

Floating point

Memory layout

About

Releases

Packages

Contributors 2

Languages

License

RabbitVM/rabbit

Folders and files

Latest commit

History

Repository files navigation

Rabbit VM RFC

Why?

What?

Definitions

Registers

Flags

Instruction set

Real instructions

Assembler macros

Addressing modes

How?

Instruction formats

Stages of compilation

Preprocessing

Peephole optimization

Assembling

Floating point

Memory layout

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages