TraceQL: Nested set intrinsics #3497

joe-elliott · 2024-03-15T19:48:56Z

Expose nested set intrinsics into the traceql language. This allows calling applications to request structural details about spans.

mdisibio

Agree, there are some really promising queries that can answered by accessing these columns directly. Looks very straight-forward and 99% LGTM. Can you take a look at couple q's? Probably ok but want to check.

mdisibio · 2024-03-20T18:49:16Z

tempodb/encoding/vparquet3/block_traceql.go

@@ -1209,6 +1224,35 @@ func createSpanIterator(makeIter makeIterFn, primaryIter parquetquery.Iterator,
 		case traceql.IntrinsicStructuralSibling:
 			selectColumnIfNotAlready(columnPathSpanParentID)
 			continue
+
+		case traceql.IntrinsicNestedSetLeft:
+			nestedSetLeftExplicit = true


Can you check the behavior in this area, and see if these preds work ok with the nils added in the cases above? I'm thinking about a query that invokes both e.g. { nestedSetLeft = 1 } >> { } | select(nestedSetRight).

this works b/c the columns are stored in the columnPredicates and columnSelectAs maps.

if the explicit intrinsics hit first then they are added to the map and then the logic in selectColumnIfNotAlready prevents them from being overwritten with an OpNone condition.

if the structural intrinsics hit first then they are overwritten when the explicit intrinsics come along.

i did note and fix an issue in this area. if the structural intrinsic were to be added after the nested set intrinsic then it would have seen a non-empty predicate and not added the nil predicate. added to test for nil predicate

https://github.com/grafana/tempo/pull/3497/files#diff-7f3f53f3d2f4e723f54f23778f62dfa089a61ad39f1c90ca4bf63a335aacf3a5R1144

mdisibio · 2024-03-20T18:50:51Z

tempodb/tempodb_search_test.go

+		},
+		// fun way to get the root span
+		{
+			req: &tempopb.SearchRequest{Query: "{ nestedSetParent = -1 } | select(name)"},


Can you add a case for when we dual purpose read these columns. Similar to the other comment a query like: {nestedSetParent = -1} >> {} | select(name)

so there is kind of a bug here, but it exists for all spanset operators. for instance this:

{ span.foo = "bar" } >> {}

Will return the value of the attribute "foo" on every matched span even though the condition is on the LHS of the operator. I'm not sure if this is a bug or not.

Agree it sounds like a bug. But pre-existing so ok for this PR, if I understand you correctly.

mdisibio · 2024-03-20T18:59:23Z

tempodb/encoding/vparquet3/block_traceql.go

@@ -141,6 +141,16 @@ func (s *span) AttributeFor(a traceql.Attribute) (traceql.Static, bool) {
 	}

 	if a.Intrinsic != traceql.IntrinsicNone {
+		if a.Intrinsic == traceql.IntrinsicNestedSetLeft {


Do you know if these extra checks are enough to show up in benchmarks, or would a switch be ok here? Wondering at what point this method needs another overhaul. For instance there is duplicate logic for .foo fallback to check resource-level and then span-level, between here and engine.

i will check benches but would guess these are not visible. don't mind swapping to a switch either for aesthetic or perf reasons if that's preferred.

are switches faster than a series of ifs?

Signed-off-by: Joe Elliott <[email protected]>

joe-elliott requested review from annanay25, mdisibio, mapno, yvrhdn, zalegrala, electron0zero, ie-pham and stoewer as code owners March 15, 2024 19:48

mdisibio reviewed Mar 20, 2024

View reviewed changes

joe-elliott force-pushed the nested-set-queries branch from 21f3f62 to 48b159b Compare March 21, 2024 14:05

joe-elliott added 7 commits March 21, 2024 10:06

add nested set

55227a1

Signed-off-by: Joe Elliott <[email protected]>

only return nested set params if explicitly requested

320f078

Signed-off-by: Joe Elliott <[email protected]>

tests, tests, tests !

03bb60e

Signed-off-by: Joe Elliott <[email protected]>

lint

e23f963

Signed-off-by: Joe Elliott <[email protected]>

changelog

1f03aa4

Signed-off-by: Joe Elliott <[email protected]>

added/fixed tests

f3a0609

Signed-off-by: Joe Elliott <[email protected]>

fix predicate adding

104cf8d

Signed-off-by: Joe Elliott <[email protected]>

joe-elliott force-pushed the nested-set-queries branch from 48b159b to 104cf8d Compare March 21, 2024 14:06

mdisibio approved these changes Mar 21, 2024

View reviewed changes

joe-elliott merged commit 3d2850d into grafana:main Mar 21, 2024
14 checks passed

stoewer added a commit that referenced this pull request Apr 12, 2024

Backport: Nested set intrinsics (#3497)

1db4a1c

stoewer added a commit to stoewer/tempo that referenced this pull request Apr 16, 2024

Backport: TraceQL nested set intrinsics (grafana#3497)

0587418

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TraceQL: Nested set intrinsics #3497

TraceQL: Nested set intrinsics #3497

joe-elliott commented Mar 15, 2024

mdisibio left a comment

mdisibio Mar 20, 2024

joe-elliott Mar 21, 2024

joe-elliott Mar 21, 2024

mdisibio Mar 20, 2024

joe-elliott Mar 21, 2024

mdisibio Mar 21, 2024

mdisibio Mar 20, 2024

joe-elliott Mar 20, 2024

TraceQL: Nested set intrinsics #3497

TraceQL: Nested set intrinsics #3497

Conversation

joe-elliott commented Mar 15, 2024

mdisibio left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment