[MORAE] Make new codebase2amr endpoint that uses the Linespan endpoint #621

Free-Quarks · 2023-11-07T19:41:36Z

This is an additional endpoint, perhaps called llm-assisted-code2amr, which will use the Linespan endpoint we currently have to sub-select the code to only the relevant part for an AMR extraction and send that through the code-snippets-2AMR pipeline.

Some notes:

We want this endpoint and the Linespan both available in the unified service. TA-4 is interested in using the linespan endpoint alone as well for sending things to our snippets endpoint with HMI. This is also why the output of the linespan endpoint is what it is, since they already had support for that data structure.
The linespan endpoint currently uses GPT3.5 for the extraction. This is temporary until we replace it with our own model that operates on function networks. A downside of the LLM model (besides response time) is that it operates on the source code itself, so it currently only operates on one code file, despite us calling it a codebase2amr, it will only work for one file in the zip until we update the model to our own. I didn't think it was worth the effort to engineer it to handle a codebase of arbitrary size, since we will be replacing it hopefully soon, but that is an option and I thought worth noting.

## Summary of changes Adds a new workflow endpoint to skema.rest `llm-assisted-codebase-to-pn-amr` that slices the source code based on model dynamic linespans determined by an llm. This greatly increases the accuracy of AMR generation. Enables support for generating AMR for the CHIME-SIR model, which was previously failing with the normal `codebase-to-pn-amr` endpoint. Adds a basic test case for testing CHIME-SIR->AMR generation. Resolves #621 Resolves #628 --------- Co-authored-by: Justin <[email protected]>

## Summary of changes Adds a new workflow endpoint to skema.rest `llm-assisted-codebase-to-pn-amr` that slices the source code based on model dynamic linespans determined by an llm. This greatly increases the accuracy of AMR generation. Enables support for generating AMR for the CHIME-SIR model, which was previously failing with the normal `codebase-to-pn-amr` endpoint. Adds a basic test case for testing CHIME-SIR->AMR generation. Resolves #621 Resolves #628 --------- Co-authored-by: Justin <[email protected]> e740ac1

Free-Quarks assigned myedibleenso, cl4yton, Free-Quarks and vincentraymond-ua Nov 7, 2023

Free-Quarks added the MORAE label Nov 7, 2023

This was referenced Nov 7, 2023

[MORAE] Create unit tests for code2amr pipeline #624

Closed

[MORAE] code2AMR version 2.0 #632

Open

Draft of endpoint for llm-assisted extraction #633

Closed

vincentraymond-ua mentioned this issue Nov 9, 2023

[morae] LLM assisted code->AMR endpoint + support for CHIME-SIR #637

Merged

vincentraymond-ua closed this as completed in #637 Nov 14, 2023

vincentraymond-ua added this to the [DARPA] Milestone 11 milestone Nov 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MORAE] Make new codebase2amr endpoint that uses the Linespan endpoint #621

[MORAE] Make new codebase2amr endpoint that uses the Linespan endpoint #621

Free-Quarks commented Nov 7, 2023

[MORAE] Make new codebase2amr endpoint that uses the Linespan endpoint #621

[MORAE] Make new codebase2amr endpoint that uses the Linespan endpoint #621

Comments

Free-Quarks commented Nov 7, 2023