Implement first-class functions #468

jfecher · 2022-11-14T10:45:08Z

Related issue(s)

In progress of #467, but does not resolve it as this PR contains only internal/refactoring changes required to add lambdas and function types.

Summary of changes

Implements first-class functions in noir on top of PR #462. The majority of changes revolves around changing call expressions to accept any expression in the function position, and allow arbitrary variables to refer to functions.

Test additions / changes

None, waiting for #467 to be completely resolved to test lambdas and first class functions.

Checklist

I have tested the changes locally.
I have formatted the changes with Prettier and/or cargo fmt with default settings.
I have linked this PR to the issue(s) that it resolves.
I have reviewed the changes on GitHub, line by line.
I have ensured all changes are covered in the description.

Additional context

This PR is mostly internal changes, we can have another PR to implement lambdas and function types.

guipublic

My feedback regarding #462 still apply here, the frontend should not modify the control flow.

guipublic

In overall it's a good start but needs some re-work.
I would start by merging Builtin and standard function so that they are mostly treated the same by the SSA pass: a builtin function would get a FuncId and a FuncIndex.
For instance the FuncIndex could match the OPCODE so we can easily know when a function is a builtin.
Then I would remove FunctionObj/BuiltinObj and use directly a NodeId of type Field, which would represent a function pointer, i.e its value is equal to the FuncIndex it references. Being a NodeObj, it will be properly handled by the SSA througout all the passes.
Finally, we would need to implement "recursive inlining" in the inlining process of main, i.e when we reach a call instruction, we evaluate the function pointer and inline the function call before processing the other instructions (and return an unimplemented!() error if we cannot evaluate it).

This mechanism could be easily extended later on to support any function pointers (i.e not only the ones known at compile time).
n.b. If you want I can take it from here.

jfecher · 2022-12-02T16:36:48Z

Thank you for the review. This PR isn't nearly done though, there are still bugs when calling builtin functions and first-class functions aren't implemented in at all yet (there is no recursive inlining at all as you mentioned).

I agree that Function and Builtin NodeObj variants can be merged and I was considering doing so myself as well. I do think they could be both put inside a single Function node though it'd have to contain an enum of either a FuncId or OPCODE internally as FuncIds always correspond to a function in the Ast. I do not think representing functions as Fields is a good idea just because Fields are already supported - functions are not fields and it would be meaningless to support Field operations like addition on functions. Instead we should keep Functions as separate objects to reduce bugs in the future and make the code easier to follow.

(Edit: I would like to have this PR as a draft while it is unfinished but I cannot seem to revert it to one without closing the PR and reopening it)

vezenovm · 2023-01-05T16:37:37Z

Looks like a couple tests are failing now. Once these are passing I think this looks good.

…ll stack being hidden on test failure

guipublic

It is globally ok for me, my main concerns are these 3 points:

The frontend is not checking function signatures so you can call f(x) with f= foo when foo is declared as fn foo(), or fn foo(x,y)
There is a confusion about the returned arrays, you should remove all the modifications you did about them.
You did not implement the recursive inlining, but since it is not needed for basic use cases and because this PR is open for too long already, we should do it in a separate PR.

crates/nargo/tests/test_data/9_conditional/src/main.nr

crates/noirc_evaluator/src/ssa/node.rs

crates/noirc_evaluator/src/ssa/function.rs

crates/noirc_evaluator/src/ssa/context.rs

crates/noirc_evaluator/src/ssa/function.rs

crates/noirc_evaluator/src/ssa/node.rs

jfecher · 2023-01-11T16:41:46Z

Addressing the first point:

The frontend is not checking function signatures so you can call f(x) with f= foo when foo is declared as fn foo(), or fn foo(x,y)

This is incorrect, function types are still checked and you can verify this by changing argument types, counts, etc. The way they are checked now is slightly different. When we have a function call we do not have access to the function id directly anymore, instead we have the function type from the function expression. Consider the call expression (if c { add1 } else { return_some_function() })(5). The exact function being called is unknown but the type checker still knows it has the type Field -> Field (presumably), and it can use this to typecheck the argument 5 and get the expected return type of Field.

guipublic · 2023-01-11T16:47:42Z

The exact function being called is unknown but the type checker still knows it has the type Field -> Field (presumably), and it can use this to typecheck the argument 5 and get the expected return type of Field.

Yes I agree, so shouldn't we do this?

jfecher · 2023-01-11T16:48:41Z

Yes I agree, so shouldn't we do this?

I am confused what you mean, this PR already does this.

Edit: I believe I found what you're referring to, it looks to be a bug but doesn't seem to be happening in all cases. I'll investigate further. The intention was always to check these so allowing this bug through is not an option

jfecher · 2023-01-11T17:12:08Z

Indeed the if param_len != arg_len { check from type_check_function_call was missing once that function was removed. I've re-added it into bind_function

jfecher

Most of the comments seem to be based on the returned_arrays handling. I'd like to remove this as well, just ran into difficulty originally. One easy way to remove them would be if we could just create a fresh, empty array at each callsite. Assuming these ids are truly temporary then this could be fine.

Note: I assume these ids are temporary now based on previous talks and code such as:

let mut a1 = get_array();
let a2 = get_array();

which should not both refer to the same array.

crates/noirc_evaluator/src/ssa/code_gen.rs

crates/noirc_evaluator/src/ssa/function.rs

crates/noirc_evaluator/src/ssa/inline.rs

crates/noirc_evaluator/src/ssa/node.rs

This reverts commit ee0c4e7.

jfecher · 2023-01-13T20:51:29Z

Reverted the removal of ArraySetIds commit since the large default array size lead to a large performance regression likely due to copying these large but empty arrays later on.

guipublic · 2023-01-16T09:26:00Z

Reverted the removal of ArraySetIds commit since the large default array size lead to a large performance regression likely due to copying these large but empty arrays later on.

I don't understand why do you use these big arrays. Using your removal of ArraySetIds commit, I tweaked the function.rs/call() method and it seems to be fine: I add the previous behaviour when we know the function, else I put fresh returned_arrays like you did, but with the correct length. (n.b: we could avoid creating fresh arrays by caching the one we create here and re-use them when the type+len matches.)

```
    if let Some(func_id) = self.context.try_get_funcid(func) {
            let rtt = self.context.functions[&func_id].result_types.clone();
            let mut result = Vec::new();
            for i in rtt.iter().enumerate() {
                result.push(self.context.new_instruction(
                    node::Operation::Result { call_instruction, index: i.0 as u32 },
                    *i.1,
                )?);
            }
           return Ok(result);
        }

        let result_ids = try_vecmap(return_types, |(i, typ)| {
            let result = Operation::Result { call_instruction, index: i as u32 };
            let typ = match typ {
                Type::Array(len, elem_type) => {
                    let elem_type = self.context.convert_type(&elem_type);
                    let array_id =
                        self.context.new_array("", elem_type, len as u32, None).1;
                    returned_arrays.push((array_id, i as u32));
                    ObjectType::Pointer(array_id)
                }
                other => self.context.convert_type(&other),
            };

            self.context.new_instruction(result, typ)
        });

This reverts commit 1382d81.

… incorrectly tracked otherwise

jfecher · 2023-01-17T16:43:35Z

This PR should be finished now. I did have to re-add returned_arrays as higher order functions that returned arrays would be incorrectly tracked otherwise - leading to the final ssa storing to a different array that it later loaded from.

guipublic

It's fine for me

jfecher added 18 commits November 8, 2022 13:53

Finish most of partial evaluator pass

f0a1175

Get all tests working

c653f60

Remove debug println

60405db

Merge master

d11d686

Implement first class functions

6760fe7

Fix clippy

757a180

Fix for loop desugaring

c951875

Add Shared expression as an optimization

6dad4d4

Merge partial evaluator branch

338538e

Revert 6_array test

fdb9539

Fix method resolution

8183cdd

Fix comptime evaluation of for loops

cefc379

Fix merge conflicts

7bee007

cargo fmt

24d729f

Remove debug printout

b40ad17

Merge branch 'master' into jf/hof

7ff0aa3

Fix short-circuiting bug

f2f987b

Fix copying arrays of main parameters

dc07ee5

guipublic requested changes Nov 21, 2022

View reviewed changes

jfecher mentioned this pull request Nov 28, 2022

Implement traits #527

Closed

jfecher added 5 commits November 29, 2022 13:46

Remove partial evaluator

68b6ee5

Resolve merge conflicts

db9f643

Start evaluator changes

f6ede68

Fix compile errors

5a43419

Generate functions when referenced by a variable

e6523d0

jfecher force-pushed the jf/hof branch from 9af4bf9 to e6523d0 Compare November 30, 2022 21:18

Start working on supporting builtin and lowlevel functions

4dc9218

jfecher force-pushed the jf/hof branch from 755fe01 to 4dc9218 Compare December 1, 2022 18:11

guipublic requested changes Dec 2, 2022

View reviewed changes

Fix test fail caused by new panic when converting array types. Fix ca…

abcfcda

…ll stack being hidden on test failure

guipublic requested changes Jan 11, 2023

View reviewed changes

Add function argument count check

0674621

jfecher commented Jan 11, 2023

View reviewed changes

jfecher added 5 commits January 11, 2023 13:05

Code review; remove printlns

98c9560

Fix merge conflicts

41a0037

Remove returned_arrays and ArraySetId tracking

ee0c4e7

Fix merge conflicts

6a06443

Revert "Remove returned_arrays and ArraySetId tracking"

1382d81

This reverts commit ee0c4e7.

jfecher added 6 commits January 17, 2023 08:45

Revert "Revert "Remove returned_arrays and ArraySetId tracking""

c5b8d66

This reverts commit 1382d81.

Fix Results handling and add function name to debug output

3df1183

Some code review

6225027

Fix last bug introduced by the function name commit

dd764ba

Revert inlining changes

4001386

Re-add returned_arrays. Higher-order functions that return arrays are…

c4f58ff

… incorrectly tracked otherwise

Dont update call graph for higher order functions

31f05b6

guipublic approved these changes Jan 17, 2023

View reviewed changes

vezenovm approved these changes Jan 17, 2023

View reviewed changes

jfecher merged commit 3c3dffb into master Jan 17, 2023

jfecher deleted the jf/hof branch January 17, 2023 19:17

This was referenced Jan 23, 2023

Add functions as first class #310

Closed

Implement first-class functions #467

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement first-class functions #468

Implement first-class functions #468

jfecher commented Nov 14, 2022

guipublic left a comment

guipublic left a comment

jfecher commented Dec 2, 2022 •

edited

Loading

vezenovm commented Jan 5, 2023

guipublic left a comment

jfecher commented Jan 11, 2023

guipublic commented Jan 11, 2023

jfecher commented Jan 11, 2023 •

edited

Loading

jfecher commented Jan 11, 2023

jfecher left a comment

jfecher commented Jan 13, 2023

guipublic commented Jan 16, 2023

jfecher commented Jan 17, 2023

guipublic left a comment

Implement first-class functions #468

Implement first-class functions #468

Conversation

jfecher commented Nov 14, 2022

Related issue(s)

Summary of changes

Test additions / changes

Checklist

Additional context

guipublic left a comment

Choose a reason for hiding this comment

guipublic left a comment

Choose a reason for hiding this comment

jfecher commented Dec 2, 2022 • edited Loading

vezenovm commented Jan 5, 2023

guipublic left a comment

Choose a reason for hiding this comment

jfecher commented Jan 11, 2023

guipublic commented Jan 11, 2023

jfecher commented Jan 11, 2023 • edited Loading

jfecher commented Jan 11, 2023

jfecher left a comment

Choose a reason for hiding this comment

jfecher commented Jan 13, 2023

guipublic commented Jan 16, 2023

jfecher commented Jan 17, 2023

guipublic left a comment

Choose a reason for hiding this comment

jfecher commented Dec 2, 2022 •

edited

Loading

jfecher commented Jan 11, 2023 •

edited

Loading