Policy2code widget #1782

leehengpan · 2024-09-11T01:33:29Z

Fixes #1749
Fixes #430
Fixes #1851
Fixes #508
Fixes #1855
Fixes #1856

This PR creates a new endpoint /tracer_analysis, that enables household tracer analysis output to be analyzed via Claude. It also modifies PolicyEngineCountry.calculate to create a local tracer and saves that into a new purpose-built database table. It creates a function to parse the resulting tracer output for a particular variable's trace.

It also renames /analysis to /simulation_analysis to represent the fact that we now have two AI-driven endpoints. It refactors the former /analysis endpoint to make the function we use to interface with Claude more modular, and it re-enables streaming on this endpoint by returning a ReadableStream instead of a series of textual messages.

Due to the fact that this renames the /analysis endpoint, this must be merged at the same time as PolicyEngine/policyengine-app#2010. This will be kept in draft until that is ready.

anth-volk

@leehengpan Thanks for this. Everything looks good so far, all that remains is actually writing to the local database so that we're able to pull the output on the front end.

PavelMakarchuk · 2024-09-15T13:17:24Z

@leehengpan Thanks for this. Everything looks good so far, all that remains is actually writing to the local database so that we're able to pull the output on the front end.

I took a stab at this for time purposes - not sure if the test etc. is implemented correctly - lmk what you think

anth-volk

Thanks for this @leehengpan. I've noted some places where I think edits would be necessary.

Also, I want to note that we'll have to do two things at the end:

Update the added test
Delete tracer_output_outer_function.json, though it may be helpful as we create the parsing function

anth-volk · 2024-09-19T16:20:10Z

policyengine_api/api.py

@@ -117,6 +118,10 @@

 app.route("/<country_id>/user_profile", methods=["PUT"])(update_user_profile)

+app.route("/<country_id>/tracer_analysis", methods=["GET"])(
+    trigger_tracer_analysis


While this is a valid name, it doesn't follow standard naming conventions. Could you instead call this get_tracer_analysis?

anth-volk · 2024-09-19T16:21:29Z

policyengine_api/country.py

+# write a recursive function here that, when there is an adds and/or a subtracts, calls get_all_variables on that next tier downward, until eventually you hit some marker of there being no more levels.
+
+
+def get_all_variables(


All of the below function (and its comments) should be deleted

anth-volk · 2024-09-19T16:21:50Z

policyengine_api/data/initialise.sql

+  country_id VARCHAR(3) NOT NULL,
+  api_version VARCHAR(10) NOT NULL,
+  tracer_output JSON NOT NULL,
+--   variable_name VARCHAR(255) NOT NULL


We can fully delete this line

anth-volk · 2024-09-19T16:22:01Z

policyengine_api/data/initialise_local.sql

+  country_id VARCHAR(3) NOT NULL,
+  api_version VARCHAR(10) NOT NULL,
+  tracer_output JSON NOT NULL,
+--   variable_name VARCHAR(255) NOT NULL


We can delete here, as well

policyengine_api/endpoints/tracer_analysis.py

anth-volk

Thanks for this @leehengpan. I've suggested one edit and briefly made another, and I can explain the one I made more in detail if you'd like.

anth-volk · 2024-10-02T13:39:13Z

policyengine_api/country.py

+        tracer_output = simulation.tracer.computation_log
+        log_lines = tracer_output.lines(aggregate=False, max_depth=10)
+        log_json = json.dumps(log_lines)
+


Could you add an if statement here to make sure that household_id and policy_id exist before trying to write to the db? You'll notice they're optional params in the function signature above.

anth-volk · 2024-10-02T17:21:42Z

policyengine_api/endpoints/tracer_analysis.py

+    {anthropic.AI_PROMPT}"""
+
+    # get prompt_id
+    prompt_id = local_database.query(


Unfortunately, this piece of code won't work. What you would have to do is first check if the prompt is already stored in the db. If it is, then use that prompt_id; if not, store it, then fetch the prompt_id.

I think the best resolution will be to modify get_analysis as follows:

Add an optional param to get_analysis that is the prompt, with a default value of None

If the prompt is passed as an arg, treat that as the prompt, else if it's within the request args, use that, else it's equal to None

Basically, we're adding one additional conditional way of passing the prompt to the controller. Then, you can remove the code getting the prompt_ID value and just add the prompt to the code as follows:
analysis = get_analysis(country_id, prompt=prompt)

anth-volk · 2024-10-02T17:40:42Z

policyengine_api/endpoints/tracer_analysis.py

+    tracer_segment = parse_tracer_output(row["tracer_output"], variable)
+
+    # TODO: Add the parsed tracer output to the prompt
+    prompt = tracer_analysis_prompt.format(


I took the liberty of moving the prompt into a new folder and using a formatting trick to pass the values into a f-string. This improves separation of concerns and makes it easier to modify the prompt itself, if necessary.

Gotcha. Thank you, Anthony.

anth-volk · 2024-10-02T17:40:56Z

policyengine_api/endpoints/tracer_analysis.py

+    # TODO: Call get_analysis with the complete prompt
+    analysis = get_analysis(country_id, prompt_id)
+
+    if row is not None:


I understand from here down is legacy code, right?

Yes, except for the get_analysis and prompt_id lines.

Ah, right, my bad, I meant for that comment to be two lines lower

Modifying 'calculate' function to save tracer log

nikhilwoodruff

LGTM after one minor comment

changelog_entry.yaml

anth-volk · 2024-10-09T17:45:36Z

This should be good. Will merge after tests pass.

anth-volk · 2024-10-09T19:43:44Z

Tests have passed. Merging. Once this launches, the front-end changes will also be merged.

anth-volk mentioned this pull request Sep 11, 2024

Add tracer table to data/initialise.sql and data/initialise_local.sql #1758

Closed

anth-volk reviewed Sep 13, 2024

View reviewed changes

anth-volk force-pushed the policy2code_widget branch from 1225c59 to b337065 Compare September 19, 2024 16:17

anth-volk requested changes Sep 19, 2024

View reviewed changes

leehengpan requested a review from anth-volk September 23, 2024 20:47

anth-volk requested changes Oct 2, 2024

View reviewed changes

anth-volk self-requested a review October 4, 2024 17:11

leehengpan and others added 22 commits October 7, 2024 21:50

Update api.py

01b2dc6

Update country.py

be373b6

Update initialise.sql

b0010dd

Update initialise_local.sql

51b6f02

Update api.py

6c6b3ec

Create tracer.py

a663da6

Update country.py

b4d7361

Update country.py

df9b7b2

Update country.py

7cf8f66

Update country.py

af3d514

Update country.py

6dadb6b

Update country.py

e5186b0

Update country.py

36db926

fix: Handle list params

34d7114

chore: Remove inaccurate testing comment

f438777

Update country.py

53f36a9

Modifying 'calculate' function to save tracer log

local db

0c821d3

format

8f6b248

adj. tarcer

882fb4d

test adj.

53e3604

more accurate test

3c60900

adj. test - remove commit()

9d7c284

leehengpan and others added 15 commits October 7, 2024 21:55

Added optional argument 'prompt' to get_analysis

06a7b74

fix: Remove unnecessary commas in SQL assertions

ac47c99

feat: Begin to refactor main analysis endpoint; DO NOT MERGE

874ecbd

feat: Create prompt for execute_simulation_analysis; DO NOT MERGE

b40a459

feat: Add streaming response for existing analysis

1aafcef

feat: Streaming response for AI analysis, refactoring, etc.

0f32b12

fix: Properly store data; allow existing analysis to return prompt

5b44dba

fix: Rename generator func from get_existing_analysis

57b4ffc

feat: Use new util funcs in execute_tracer_analysis

82314db

fix: Add API version when selecting tracer

72d9ece

fix: Properly load tracer output

e0b5273

feat: Add error handling

cd4e83f

fix: Properly reference chart data

3ece09b

test: Add tests

3c2cd01

chore: Changelog and lint

38a75b0

anth-volk force-pushed the policy2code_widget branch from 0afdb32 to 38a75b0 Compare October 7, 2024 19:59

chore: Format

1f1cf2f

anth-volk marked this pull request as draft October 7, 2024 20:08

anth-volk mentioned this pull request Oct 7, 2024

Add info button next to each benefit PolicyEngine/policyengine-app#2010

Merged

test: Rewrite simulation_analysis tests

c452d2a

anth-volk mentioned this pull request Oct 7, 2024

Return more accurate status codes in get_analysis #1845

Closed

fix: Make household_id optional

0cf9fcb

anth-volk marked this pull request as ready for review October 8, 2024 17:46

anth-volk requested a review from nikhilwoodruff October 8, 2024 17:47

nikhilwoodruff approved these changes Oct 9, 2024

View reviewed changes

changelog_entry.yaml Show resolved Hide resolved

fix: Rename analysis endpoints to match RESTful requirements

4b47616

fix: Update test to use new endpoint

4b1bae4

anth-volk merged commit 5862b17 into master Oct 9, 2024
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Policy2code widget #1782

Policy2code widget #1782

leehengpan commented Sep 11, 2024 •

edited by anth-volk

Loading

anth-volk left a comment

PavelMakarchuk commented Sep 15, 2024

anth-volk left a comment

anth-volk Sep 19, 2024

anth-volk Sep 19, 2024

anth-volk Sep 19, 2024

anth-volk Sep 19, 2024

anth-volk left a comment

anth-volk Oct 2, 2024

leehengpan Oct 2, 2024

anth-volk Oct 2, 2024

anth-volk Oct 2, 2024

leehengpan Oct 2, 2024

anth-volk Oct 2, 2024

leehengpan Oct 2, 2024

anth-volk Oct 2, 2024

nikhilwoodruff left a comment

anth-volk commented Oct 9, 2024

anth-volk commented Oct 9, 2024

		# write a recursive function here that, when there is an adds and/or a subtracts, calls get_all_variables on that next tier downward, until eventually you hit some marker of there being no more levels.


		def get_all_variables(

Policy2code widget #1782

Policy2code widget #1782

Conversation

leehengpan commented Sep 11, 2024 • edited by anth-volk Loading

anth-volk left a comment

Choose a reason for hiding this comment

PavelMakarchuk commented Sep 15, 2024

anth-volk left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

anth-volk left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nikhilwoodruff left a comment

Choose a reason for hiding this comment

anth-volk commented Oct 9, 2024

anth-volk commented Oct 9, 2024

leehengpan commented Sep 11, 2024 •

edited by anth-volk

Loading