stop/pause until reached the end of a transaction #1095

csuzhangxc · 2020-09-24T10:14:02Z

Feature Request

Is your feature request related to a problem? Please describe:

DM split transactions from upstream MySQL into rows and re-aggregate rows into a new transaction as a batch.

When stop-task or pause-task, maybe only a part of rows in an upstream MySQL transaction have been committed into the downstream TiDB, in other words, the original transaction is broken in TiDB after that.

Describe the feature you'd like:

stop/pause the task until reached the end of a transaction for start-task/pause-task or shutdown the DM-worker process normally.

Describe alternatives you've considered:

Teachability, Documentation, Adoption, Migration Strategy:

The text was updated successfully, but these errors were encountered:

lance6716 · 2020-10-15T05:28:21Z

another way is reverting to last consistent point (maybe recorded in checkpoint) if TiDB could flashback table

pingcap/tidb#20302

lichunzhu · 2021-07-14T03:10:36Z

Maybe we can refer to tidb-binlog's logic:

Don't send any sqls later than the current transation.
Close syncer until all sqls are replicated to downstream.
Add a boolean column synced in downstream checkpoint to notify whether DM really stops at a certain transaction.

https://github.com/pingcap/tidb-binlog/blob/v4.0.13/drainer/syncer.go#L484

okJiang · 2021-07-28T02:09:49Z

How about saving to the end of the transaction(last xid event) directly every time saveTablePoint?

okJiang · 2021-07-28T02:28:07Z

How about saving to the end of the transaction(last xid event) directly every time saveTablePoint?

Since this cannot guarantee the consistency of upstream and downstream data, we choose to delay syncer when stop/pause until the end of the transaction, and then flushCheckPoint.

okJiang · 2021-07-28T06:16:13Z

Initial idea: Delay when syncer exits and job closes.

dm/syncer/syncer.go

Line 3040 in 11fb5a8

s.closeJobChans()

Wait for the arrival of XIDEvent before closing the job channel.

Steps:

Before closing the job channel, set Syncer.waitXIDType = waiting. Then wait Syncer.waitXIDType = waitComplete here.
After setting Syncer.waitXIDType = waiting, Syncer is running normaly until addJob encounter a XIDEvent.
After encountering a XIDEvent,
a. set Syncer.waitXIDType = waitComplete
b. stop addJob
c. continue close jobChan in 1

The job in jobQueue is continue to be executed here. (by add executeSQLs())

dm/syncer/syncer.go

Lines 1322 to 1326 in 11fb5a8

    
           case sqlJob, ok := <-jobChan: 
        
           	metrics.QueueSizeGauge.WithLabelValues(s.cfg.Name, queueBucket, s.cfg.SourceID).Set(float64(len(jobChan))) 
        
           	if !ok { 
        
           		return 
        
           	}

waitXIDType:

type waitXIDType int

const (
    noWait waitXIDType = iota
    waiting
    waitComplete
)

PTAL @lance6716 @lichunzhu

lance6716 · 2021-07-28T07:14:34Z

there're so many states of syncer: normal replication, sharing re-sync, handle-error injected some SQL, check whether to turn off safe mode, and your waiting xid.

could you use a more clear way to express above states and their transition? maybe a state machine

okJiang · 2021-07-28T07:17:01Z

there're so many states of syncer: normal replication, sharing re-sync, handle-error injected some SQL, check whether to turn off safe mode, and your waiting xid.

could you use a more clear way to express above states and their transition? maybe a state machine

I'm afraid I can't fully understand all the above states in a short time

csuzhangxc added type/feature-request This issue is a feature request help wanted This issue wanted some help from contributor labels Sep 24, 2020

okJiang mentioned this issue Jul 28, 2021

syncer: stop/pause until reached the end of a transaction #1928

Merged

ti-chi-bot closed this as completed in #1928 Aug 17, 2021

ti-chi-bot mentioned this issue Aug 17, 2021

syncer: stop/pause until reached the end of a transaction (#1928) #2000

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

stop/pause until reached the end of a transaction #1095

stop/pause until reached the end of a transaction #1095

csuzhangxc commented Sep 24, 2020

lance6716 commented Oct 15, 2020 •

edited

Loading

lichunzhu commented Jul 14, 2021

okJiang commented Jul 28, 2021 •

edited

Loading

okJiang commented Jul 28, 2021

okJiang commented Jul 28, 2021 •

edited

Loading

lance6716 commented Jul 28, 2021

okJiang commented Jul 28, 2021

stop/pause until reached the end of a transaction #1095

stop/pause until reached the end of a transaction #1095

Comments

csuzhangxc commented Sep 24, 2020

Feature Request

lance6716 commented Oct 15, 2020 • edited Loading

lichunzhu commented Jul 14, 2021

okJiang commented Jul 28, 2021 • edited Loading

okJiang commented Jul 28, 2021

okJiang commented Jul 28, 2021 • edited Loading

lance6716 commented Jul 28, 2021

okJiang commented Jul 28, 2021

lance6716 commented Oct 15, 2020 •

edited

Loading

okJiang commented Jul 28, 2021 •

edited

Loading

okJiang commented Jul 28, 2021 •

edited

Loading