hcl2template: rework parsing logic #12607

lbajolet-hashicorp · 2023-08-24T18:49:58Z

In previous versions of Packer, the parser would load a HCL file (in either HCL or JSON format), and process it in phases. This process meant that if the file was not valid in the first place, we'd have some duplicate errors first.

Another thing is that sources and build blocks were not being parsed at the beginning of the parsing phase at all, but were discovered later in the process, when the data sources have been executed and variables have been evaluated.

This commit changes the parsing to happen all at once, so we parse the files with the parser, which will fail should one file be malformatted, then we visit the body once using the top-level schema for a Packer build template, handling misplaced blocks once in the process.

Then, since the builds and sources may contain some dynamic blocks that need to be expanded once their context is ready (i.e. the variables/datasources they depend on), we can expand the build block for them, and populate their respective objects.

This last step should also be adapted for datasources later, as they may need to support dynamic blocks, and be expanded lazily.

nywilken

Overall this looks good to me. I left a few suggestions to start.

hcl2template/types.build.go

hcl2template/parser.go

hcl2template/types.build.go

nywilken · 2023-10-03T15:28:59Z

hcl2template/types.build.go

+		return nil
+	}
+
+	build.ready = true


Should this be set to true after all has been decoded? I see there are a few early returns if there are errors. In that case does this attribute represent that the build block has been fully decoded or that we've called finalizeDocode on it already?

At first I took it to mean that the build block is good to go and has been fully decoded. But I wonder how we capture the case where build.decoded is true but it decoded with error. If Packer fails then this case is probably moot.

If the build block fails to be decoded, it means some of its dependencies wasn't properly executed before, so the build should fail at that point. Here, we can consider it decoded after running it once, as it's not supposed to be executed multiple times.
If the decoding fails, so will Packer.

nywilken · 2023-10-03T16:04:19Z

hcl2template/types.packer_config_test.go

+						Name: "ubuntu-1204",
+					},
+				},
+				Builds: Builds{


It is not entirely clear to me why we need to initialize Builds and Sources with the rework. Is it because the calls to finalizeDecode requires valid blocks?

Yeah, since the decode functions are part of the source or build blocks, we need them initialised with a reference to the block they're coming from.
This changes the output for the parsing tests, but fundamentally this doesn't change how Packer behaves from a user standpoint.
Though maybe if there's a structural issue with the source or build block, it will get caught as early as the parsing, and not after the datasources are executed.

Edit: tested this, and no, even with this change, if a build block is fundamentally invalid, we still execute them first. This might be fixable though, I'll see what I can do for this.

hcl2template/parser.go

nywilken · 2023-10-03T16:12:58Z

hcl2template/plugin.go

+		// Since the build's contents may not have been dynamically
+		// expanded when we first loaded the config from the template
+		// file, we decode it now.
+		diags = append(diags, build.finalizeDecode(cfg)...)


I feel like there is some important context missing here. In the decodeFile you have a comment for the BuildBlock and SourceBlock cases that reads "The final decoding step (which requires an up-to-date context) will be done when we need it." but it is not clear form this line that the time is now.

Also correct me if I'm wrong but a call to finalilzeDecode requires that the buildBlock has been decoded at least once. So calling build.finalizeDecode on an zero value BuildBlock would do what?

It might be my understanding but there feels like there is a dependency chain that is not captured in the comments or in the type itself.

So the build block is not completely decoded at this point, we only partially decode its top-level contents, and we certainly didn't try to expand the potential dynamic blocks inside it.
Same for the source blocks.

Regarding this comment, it's something I did not change, and that was how things worked before too. This initializeBlocks function basically expands the dynamic blocks, and filling-in the blanks in the structures that Packer uses.
At that point we do the decoding for all those, like provisioners, post-processors, and the sources themselves.

That being said, the may indicates that we could try before, but in the current implementation, this is very much not the case, we only get basic information from the blocks, and we need to call the finalizeDecode function on all the build and source blocks.

I'll change the phrasing here to remove this confusion, but does my explanation clarifies things for you?

Come to think of it, I feel that the confusion stems from the vocabulary not being clear, so I'll leave this:

decode (parser): lift the configuration from the HCL file and start building the components for the Packer config. At this point, we have not evaluated anything, nor should we. This step's purpose is only to parse the HCL, check if it is valid, and that there are no unexpected blocks/attributes in the config (for those we do know in advance what the format is).

evaluateVariables/executeDatasources: there, we decode (as gohcl.Decode) the configuration for the entity, and send it to the component responsible, which effectively executes it. For variables there's no execution here, only expressions to evaluate, hence the difference in terminology.

finalizeDecode (later): there, we expand the dynamic blocks where applicable, and we do as for datasources, we decode (as in gohcl.Decode), and send those configs to the plugins for validation.

build: finally, once everything passed the validation and we have *packer.CoreBuild objects, we can then execute the builds.

With this in mind, I think the decode name is not representative of what is done during parsing. What would you think of some alternatives like lift or load? This would make it less likely to have a conflict in naming, and the finalizeDecode functions can then becode decode only, as it's what they essentially do.

I struggle in finding an appropriate name for this step of preparation of the structures, so feel free to suggest anything else, I'm eager to see what you think may fit the purpose better.

In previous versions of Packer, the parser would load a HCL file (in either HCL or JSON format), and process it in phases. This process meant that if the file was not valid in the first place, we'd have some duplicate errors first. Another thing is that sources and build blocks were not being parsed at the beginning of the parsing phase at all, but were discovered later in the process, when the data sources have been executed and variables have been evaluated. This commit changes the parsing to happen all at once, so we parse the files with the parser, which will fail should one file be malformatted, then we visit the body once using the top-level schema for a Packer build template, handling misplaced blocks once in the process. Then, since the builds and sources may contain some dynamic blocks that need to be expanded once their context is ready (i.e. the variables/datasources they depend on), we can expand the build block for them, and populate their respective objects. This last step should also be adapted for datasources later, as they may need to support dynamic blocks, and be expanded lazily.

vercel · 2023-10-03T19:14:06Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Ignored Deployment

Name	Status	Preview	Comments	Updated (UTC)
packer	⬜️ Ignored (Inspect)	Visit Preview		Oct 3, 2023 7:14pm

lbajolet-hashicorp requested a review from a team as a code owner August 24, 2023 18:49

lbajolet-hashicorp mentioned this pull request Aug 24, 2023

Datasource logic cleanup #12608

Merged

lbajolet-hashicorp mentioned this pull request Sep 1, 2023

Datasource eval breakdown #12615

Closed

nywilken reviewed Sep 13, 2023

View reviewed changes

hcl2template/types.build.go Outdated Show resolved Hide resolved

hcl2template/parser.go Show resolved Hide resolved

hcl2template/parser.go Outdated Show resolved Hide resolved

lbajolet-hashicorp force-pushed the parser_rework branch 2 times, most recently from 8759aba to 8e61cde Compare September 19, 2023 20:58

nywilken reviewed Oct 3, 2023

View reviewed changes

lbajolet-hashicorp force-pushed the parser_rework branch from 8e61cde to c1c1577 Compare October 3, 2023 19:14

hc-github-team-packer mentioned this pull request Oct 5, 2023

Backport of Datasource logic cleanup into release/1.9.x #12645

Merged

nywilken added this to the 1.12.0 milestone May 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hcl2template: rework parsing logic #12607

hcl2template: rework parsing logic #12607

lbajolet-hashicorp commented Aug 24, 2023

nywilken left a comment

nywilken Oct 3, 2023

lbajolet-hashicorp Oct 3, 2023

nywilken Oct 3, 2023

lbajolet-hashicorp Oct 3, 2023 •

edited

Loading

nywilken Oct 3, 2023

lbajolet-hashicorp Oct 3, 2023

lbajolet-hashicorp Oct 3, 2023

vercel bot commented Oct 3, 2023 •

edited

Loading

hcl2template: rework parsing logic #12607

Are you sure you want to change the base?

hcl2template: rework parsing logic #12607

Conversation

lbajolet-hashicorp commented Aug 24, 2023

nywilken left a comment

Choose a reason for hiding this comment

nywilken Oct 3, 2023

Choose a reason for hiding this comment

lbajolet-hashicorp Oct 3, 2023

Choose a reason for hiding this comment

nywilken Oct 3, 2023

Choose a reason for hiding this comment

lbajolet-hashicorp Oct 3, 2023 • edited Loading

Choose a reason for hiding this comment

nywilken Oct 3, 2023

Choose a reason for hiding this comment

lbajolet-hashicorp Oct 3, 2023

Choose a reason for hiding this comment

lbajolet-hashicorp Oct 3, 2023

Choose a reason for hiding this comment

vercel bot commented Oct 3, 2023 • edited Loading

lbajolet-hashicorp Oct 3, 2023 •

edited

Loading

vercel bot commented Oct 3, 2023 •

edited

Loading