Find Errant GTIDs #6296

PrismaPhonic · 2020-06-10T07:23:49Z

This PR adds a function called FindErrantGTIDs which finds the errant GTIDs for a given replica if all other replica slave status' are supplied as well.

This includes:

A Difference method on Mysql56GTIDSet which can be used to find the difference between the receiver GTIDSet, and the supplied GTIDSet.
A FindErrantGTIDs method on SlaveStatus which can tell us the receivers errant GTIDs, when we've supplied a comprehensive list of all known SlaveStatus for replicas of the same shard. I choose to make this a method on SlaveStatus rather than a standalone function because I believe it makes it more clear that we are finding the errant GTIDs of the receiver SlaveStatus.

Related Issue: #6206

…tions. Unfortunately the data structure of other GTIDSets does not allow for us to represent a proper diff for those flavors. Signed-off-by: Peter Farr <[email protected]>

Signed-off-by: Peter Farr <[email protected]>

go/mysql/mysql56_gtid_set.go

shlomi-noach

Hi @PrismaPhonic! Saw your request for preliminary reviews for this draft.
Thought I'd share a bit of insight since I similarly developed errant GTID detection in orchestrator (work spread over openark/orchestrator#607, openark/orchestrator#617, openark/orchestrator#707 and probably more).

I apologize in advance that this turns a bit lengthy.

First thing to consider what the reference (base, or other) GTID is and what it means.

Naively, that would be the master of the replica being examined. Compare GTID with the immediate master and get the errant value.
This unfortunately is only half correct, since the immediate master may in itself be a replica. The immediate master may itself have errant GTID, thereby making all of its own replicas (including the one we're looking at) have errant GTIDs. But as compared with its direct master, maybe our replica does not have an errant GTID. It is a matter of definition to declare whether our replica does or does not have errant GTID. That may depend on what operations we wish to achieve. Some refactoring operations involving our replica may make the problem worse, others won't.
Compare with the "real" master of the topology? This is more correct, because the master is the source of truth. But then, we may want to understand where the errant transaction originated. On our replica? Or, on its immediate or any of its ancestry masters? Back to previous clause.

Whichever we decide, this leads us to the question of "when and how" we collected the executed_gtid_set for our replica and for the base.
Say we first sample the base. For simplicity, let's assume it's the direct master. If we happen to first sample the master and then the replica, then it's possible that in between our sampling, new transactions will have been applied on the master and propagated to the replica. In which case, by the time we read the GTID set from the replica, it makes the appearance of having an errant transaction, although in reality nothing is wrong.

This is why in evaluating the diff, we must disregard any part of the GTID set that contains the UUID of the replica's master or any of its ancestors. It is safe to skip a UUID that belongs to an ancestor because there is no way for our replica to have a transaction with such UUID that is errant. The transaction must have originated from an ancestor, therefore it is applied on the ancestor.

So all this lengthy preface is to note that other GTIDSet should come with context, and by comparing the difference of two GTID sets does not imply an errant transaction.

And all of the above may be premature, since current code merely runs a Difference method; but I thought I'd throw this as heads up.

Asking out of complete ignorance. In orchestrator my choice was to not evaluate the difference in code, but instead delegate it to MySQL by executing GTID_SUBTRACT(). The line of thought was that if I wanted to run an operation on some replica and was able to read its executed_gtid_set, then I'd also have access to run GTID_SUBTRACT(). I'm merely wondering if the same line of thought can be applied here and if it at all makes sense to contact MySQL to evaluate the diff. Obviously, the downside is the need to connect to a MySQL server.

PrismaPhonic · 2020-06-11T07:39:03Z

Hi @shlomi-noach!

Thank you very much for your in depth reply. Since it's pretty late over here I'll give a brief reply for now and try to expand tomorrow if necessary.

This is just a pure difference function. The intention of FindErrantGTIDs function (to be added next) is to throw out GTIDs with the UUID of the the current master, which can be determined for each replica via slave status. I'm curious how orchestrator knows definitively who all valid ancestors are?
We could potentially defer to GTID_SUBTRACT() however, that involves more calls to MySQL. We already have to deal with potential failure of not getting back some slave statuses. This would involve a number of calls to GTID_SUBTRACT() for each replica, and that dependence feels like a more fragile approach to me. As for executed_gtid_set, we are actually planning to do something a bit different in this case. We actually are going to take a union of the retrieved_gtid_set and the executed_gtid_set (because retrieved_gtid_set can be cleared for a number of reasons outside of our control), to ensure that we are making the best choice possible when considering which replica to failover to. It's that union that we ideally want to check for errant gtids. I'm guessing that wouldn't make much of a difference in terms of calling GTID_SUBTRACT.

Thanks for the great reply! If you have some time for a video chat later this week, or early next week I would love to pick your brain more about this.

shlomi-noach · 2020-06-11T08:40:36Z

The intention of FindErrantGTIDs function (to be added next) is to throw out GTIDs with the UUID of the the current master, which can be determined for each replica via slave status.

That makes sense. Also, in the context of a failover, I guess you will only be looking at 1st tier replicas? That simplifies things.

I'm curious how orchestrator knows definitively who all valid ancestors are?

That's basically what orchestrator does first: map the topology and be able to answer questions about relations (e.g. it can print out the tree, or you can inquire about parent/children).

that dependence feels like a more fragile approach to me.

Makes sense.

take a union of the retrieved_gtid_set and the executed_gtid_set

Sounds good! One thing to note is whether you necessarily intend to wait for the replica to consume its relay logs (hence, its retrieved_gtid_set). Users of orchestrator have represented 4 different and contradicting approaches: wait forever; wait until timeout then fail; wait until timeout then promote; promote immediate regardless of dataloss. I'm not sure whether vitess needs to be opinionated or not.

go/mysql/mysql56_gtid_set.go

go/mysql/mysql56_gtid_set_test.go

go/mysql/mysql56_gtid_set.go

Signed-off-by: Peter Farr <[email protected]>

… possible combinations we might see when comparing two intervals. Expanded testing and filled out comments for clarity Signed-off-by: Peter Farr <[email protected]>

… used for Mysql56GTIDSet anyways. Signed-off-by: Peter Farr <[email protected]>

go/mysql/mysql56_gtid_set.go

systay · 2020-06-13T14:02:35Z

go/mysql/mysql56_gtid_set.go

+
+	// Make a fresh, empty set to hold the new value.
+	// This function is not supposed to modify the original set.
+	differenceSet := make(Mysql56GTIDSet)


if this is something that is used in the hot path, and will contain more than just a few items, you might want to consider using make with a given size, to avoid resizing hash maps

I was thinking about that, but we don't know yet what the size of the differenceSet will be. Are you thinking that we should just make it the same size as the receiver to cover all potential SIDs we might find that have valid diffs?

systay

⭐ Really nice code, well commented code

shlomi-noach

The rewrite looks good!

shlomi-noach · 2020-06-14T06:16:10Z

go/mysql/mysql56_gtid_set_test.go

+		sid1: []interval{{20, 30}, {35, 39}, {40, 53}, {55, 75}},
+		sid2: []interval{{1, 7}, {20, 50}, {60, 70}},
+		sid4: []interval{{1, 30}},
+	}


Can you please add the following two test cases:

Where the result range is a single value or single-value range (e.g. {1, 7} - {2, 6})

Where the result range is empty (e.g. {2, 10}, {20-30} - {1, 40})

2 is technically covered by {20, 30} - {20, 30} although that's a very trivial case. I'll include yours. Great suggestions!

Added both, and pushed up. Tests still passing :-)

…onger call the second stack s2, and instead just called it otherIntervals. Signed-off-by: Peter Farr <[email protected]>

…Ds method on SlaveStatus. I think this belongs here because it's clear that we are referring to ErrantGTIDs of the receiver, and it gives us access to the MasterUUID when comparing relay log positions. Also added testing to verify correctness of new method. Signed-off-by: Peter Farr <[email protected]>

deepthi

Concur with @systay, nicely written and well-commented.

deepthi · 2020-06-16T00:29:33Z

go/mysql/mysql56_gtid_set.go

+
+		// Found server id match between sets, so now we need to subtract each interval.
+		var diffIntervals []interval
+		advance := func() bool {


It might be possible to replace advance and advanceOther with a single func and passing the relevant []interval to it (and having it return the 0th element for later use).

I had wanted to do that at first. The reason why I choose to use two different closures is that what we do for advance and advanceOther is different. In the case that we advance we always want to push a new interval onto diffIntervals. In the case that we advanceOther we do not. Unless I'm missing something, I don't think these can be merged into a single helper method.

That is why I suggested returning the 0th element. As of now you always call advance with intervals and advanceOther with otherIntervals. Depending on which one you are operating on, you can do different things.
For instance, the first time you use a common advance func could look like this:

if i, ok := advance(intervals); ok { diffIntervals = append(diffIntervals, i) } else { continue } if _, ok := advance(otherIntervals); !ok { differenceSet[sid] = intervals continue }

I'm not going to insist on this. If you feel the current version is more readable, that is fine.

Ahhhh I see what you're saying. Yes, we could do it this way - although then we would have this kind of code in every switch branch. To me it reads cleaner to contain this logic into advance and advanceOther but this is certainly another way we could do it - and not worse. Likely just a matter of personal preference. Thanks for the feedback!

go/mysql/slave_status.go

Signed-off-by: Peter Farr <[email protected]>

…ver to a new set rather than deleting master sid from existing set. Signed-off-by: Peter Farr <[email protected]>

Signed-off-by: Peter Farr <[email protected]>

…aster sid is consistent for both receiver and supplied SlaveStatus' Signed-off-by: Peter Farr <[email protected]>

PrismaPhonic · 2020-06-17T18:54:31Z

Thanks for all of the great feedback everyone! I've finished addressing all existing feedback.

PrismaPhonic added 2 commits June 10, 2020 00:20

Wrote a Difference method and added a unit test to check that it func…

807acd5

…tions. Unfortunately the data structure of other GTIDSets does not allow for us to represent a proper diff for those flavors. Signed-off-by: Peter Farr <[email protected]>

Clarifying comment.

bbb6148

Signed-off-by: Peter Farr <[email protected]>

PrismaPhonic requested review from enisoc and deepthi June 10, 2020 21:45

systay reviewed Jun 11, 2020

View reviewed changes

go/mysql/mysql56_gtid_set.go Outdated Show resolved Hide resolved

shlomi-noach reviewed Jun 11, 2020

View reviewed changes

enisoc reviewed Jun 11, 2020

View reviewed changes

PrismaPhonic added 4 commits June 11, 2020 22:39

Comment corrections.

fc586ef

Signed-off-by: Peter Farr <[email protected]>

Completely refactored logic, and added multi-split test cases that pass.

dbed330

Signed-off-by: Peter Farr <[email protected]>

Significant refactor to make code easier to follow, by defining the 6…

269689c

… possible combinations we might see when comparing two intervals. Expanded testing and filled out comments for clarity Signed-off-by: Peter Farr <[email protected]>

Make Difference take and return concrete types, since it will only be…

ec92f31

… used for Mysql56GTIDSet anyways. Signed-off-by: Peter Farr <[email protected]>

PrismaPhonic requested review from enisoc and systay June 12, 2020 23:48

systay reviewed Jun 13, 2020

View reviewed changes

go/mysql/mysql56_gtid_set.go Outdated Show resolved Hide resolved

systay reviewed Jun 13, 2020

View reviewed changes

systay approved these changes Jun 13, 2020

View reviewed changes

shlomi-noach reviewed Jun 14, 2020

View reviewed changes

PrismaPhonic added 2 commits June 15, 2020 13:44

Added extra tests requested, and clarified a comment now that we no l…

1c1cce0

…onger call the second stack s2, and instead just called it otherIntervals. Signed-off-by: Peter Farr <[email protected]>

deepthi reviewed Jun 16, 2020

View reviewed changes

Improve comment per review suggestions.

d315ef5

Signed-off-by: Peter Farr <[email protected]>

PrismaPhonic marked this pull request as ready for review June 16, 2020 01:20

PrismaPhonic requested a review from sougou as a code owner June 16, 2020 01:20

PrismaPhonic requested review from shlomi-noach and deepthi June 16, 2020 01:20

PrismaPhonic added 3 commits June 15, 2020 18:25

Just realized this was mutating our input. Fixed it to instead copy o…

1dd0f17

…ver to a new set rather than deleting master sid from existing set. Signed-off-by: Peter Farr <[email protected]>

Check set for nil as well per review suggestion.

e56a7c0

Signed-off-by: Peter Farr <[email protected]>

Let's follow the same pattern here so the way that we throw out the m…

703fe9b

…aster sid is consistent for both receiver and supplied SlaveStatus' Signed-off-by: Peter Farr <[email protected]>

deepthi merged commit aa909c5 into master Jun 17, 2020

deepthi deleted the find-errant-gtids branch July 8, 2020 20:31

deepthi added this to the v7.0 milestone Jul 27, 2020

ameetkotian mentioned this pull request Aug 20, 2020

Slack vitess 2020.08.19.r0 tinyspeck/vitess#180

Merged

deepthi mentioned this pull request Sep 6, 2024

FindErrantGTIDs: superset is not an errant GTID situation #16725

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Find Errant GTIDs #6296

Find Errant GTIDs #6296

PrismaPhonic commented Jun 10, 2020 •

edited

Loading

shlomi-noach left a comment •

edited by deepthi

Loading

PrismaPhonic commented Jun 11, 2020

shlomi-noach commented Jun 11, 2020

systay Jun 13, 2020

PrismaPhonic Jun 15, 2020

systay left a comment

shlomi-noach left a comment

shlomi-noach Jun 14, 2020

PrismaPhonic Jun 15, 2020 •

edited

Loading

PrismaPhonic Jun 15, 2020

deepthi left a comment

deepthi Jun 16, 2020

PrismaPhonic Jun 16, 2020

deepthi Jun 16, 2020

PrismaPhonic Jun 16, 2020

PrismaPhonic commented Jun 17, 2020

Find Errant GTIDs #6296

Find Errant GTIDs #6296

Conversation

PrismaPhonic commented Jun 10, 2020 • edited Loading

shlomi-noach left a comment • edited by deepthi Loading

Choose a reason for hiding this comment

PrismaPhonic commented Jun 11, 2020

shlomi-noach commented Jun 11, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

systay left a comment

Choose a reason for hiding this comment

shlomi-noach left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

PrismaPhonic Jun 15, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

deepthi left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

PrismaPhonic commented Jun 17, 2020

PrismaPhonic commented Jun 10, 2020 •

edited

Loading

shlomi-noach left a comment •

edited by deepthi

Loading

PrismaPhonic Jun 15, 2020 •

edited

Loading