Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

br restore failed when injection network partition between one of tikv and other pods #54053

Closed
Lily2025 opened this issue Jun 17, 2024 · 1 comment · Fixed by #54084
Closed
Assignees
Labels
affects-6.5 affects-7.1 affects-7.5 affects-8.1 component/br This issue is related to BR of TiDB. severity/major type/bug The issue is confirmed as a bug.

Comments

@Lily2025
Copy link

Lily2025 commented Jun 17, 2024

Bug Report

Please answer these questions before submitting your issue. Thanks!

1. Minimal reproduce step (Required)

1、run br restore
2、inject network partition between one of tikv and other pods last for 3mins

2. What did you expect to see? (Required)

br restore can success

3. What did you see instead (Required)

br restore failed when injection network partition between one of tikv and other pods

"download file failed"] [files="{total=1,files=\"[4_44722_1354_b077a553d62445e72046f59e7616693bf929d6284e7895610f0ebea47059ec57_1638173205994_write.sst]\",totalKVs=410870,totalBytes=88747920,totalSize=40516977}"] [region="{ID=7888,startKey=7480000000000000FFFA5F728000000001FFE62B060000000000FA,endKey=7480000000000000FFFA5F728000000001FFEC97DA0000000000FA,epoch=\"conf_ver:23 version:845 \",peers=\"id:7889 store_id:1 ,id:7891 store_id:4 ,id:8040 store_id:8 \"}"] [startKey=7480000000000000FFFA5F728000000001FFE62B060000000000FA] [endKey=7480000000000000FFFA5F728000000001FFEC97DA0000000000FA] [error="rpc error: code = Unavailable desc = error reading from server: read tcp [10.233.109.46:38220](http://10.233.109.46:38220/)->[10.233.109.55:20160](http://10.233.109.55:20160/): read: connection timed out; rpc error: code = Unavailable desc = connection error: desc = \"transport: error while dialing: dial tcp [10.233.109.55:20160](http://10.233.109.55:20160/): i/o timeout\"; rpc error: code = Unavailable desc = connection error: desc = \"transport: error while dialing: dial tcp [10.233.109.55:20160](http://10.233.109.55:20160/): i/o timeout\"; rpc error: code = Unavailable desc = connection error: desc = \"transport: error while dialing: dial tcp [10.233.109.55:20160](http://10.233.109.55:20160/): i/o timeout\"; rpc error: code = Unavailable desc = connection error: desc = \"transport: error while dialing: dial tcp [10.233.109.55:20160](http://10.233.109.55:20160/): i/o timeout\"; rpc error: code = Unavailable desc = connection error: desc = \"transport: error while dialing: dial tcp [10.233.109.55:20160](http://10.233.109.55:20160/): i/o timeout\"; rpc error: code = Unavailable desc = connection error: desc = \"transport: error while dialing: dial tcp [10.233.109.55:20160](http://10.233.109.55:20160/): i/o timeout\"; rpc error: code = Unavailable desc = connection error: desc = \"transport: error while dialing: dial tcp [10.233.109.55:20160](http://10.233.109.55:20160/): i/o timeout\""] [stack="[github.com/pingcap/tidb/br/pkg/restore.(*FileImporter).ImportSSTFiles.func1\n\t/home/jenkins/agent/workspace/build-common/go/src/github.com/pingcap/br/br/pkg/restore/import.go:523\ngithub.com/pingcap/tidb/br/pkg/utils.WithRetry.func1\n\t/home/jenkins/agent/workspace/build-common/go/src/github.com/pingcap/br/br/pkg/utils/retry.go:216\ngithub.com/pingcap/tidb/br/pkg/utils.WithRetryV2[...]\n\t/home/jenkins/agent/workspace/build-common/go/src/github.com/pingcap/br/br/pkg/utils/retry.go:234\ngithub.com/pingcap/tidb/br/pkg/utils.WithRetry\n\t/home/jenkins/agent/workspace/build-common/go/src/github.com/pingcap/br/br/pkg/utils/retry.go:215\ngithub.com/pingcap/tidb/br/pkg/restore.(*FileImporter).ImportSSTFiles\n\t/home/jenkins/agent/workspace/build-common/go/src/github.com/pingcap/br/br/pkg/restore/import.go:492\ngithub.com/pingcap/tidb/br/pkg/restore.(*Client).RestoreSSTFiles.func2\n\t/home/jenkins/agent/workspace/build-common/go/src/github.com/pingcap/br/br/pkg/restore/client.go:1183\ngithub.com/pingcap/tidb/br/pkg/utils.(*WorkerPool).ApplyOnErrorGroup.func1\n\t/home/jenkins/agent/workspace/build-common/go/src/github.com/pingcap/br/br/pkg/utils/worker.go:76\ngolang.org/x/sync/errgroup.(*Group).Go.func1\n\t/go/pkg/mod/golang.org/x/[email protected]/errgroup/errgroup.go:78](http://github.com/pingcap/tidb/br/pkg/restore.(*FileImporter).ImportSSTFiles.func1/n/t/home/jenkins/agent/workspace/build-common/go/src/github.com/pingcap/br/br/pkg/restore/import.go:523/ngithub.com/pingcap/tidb/br/pkg/utils.WithRetry.func1/n/t/home/jenkins/agent/workspace/build-common/go/src/github.com/pingcap/br/br/pkg/utils/retry.go:216/ngithub.com/pingcap/tidb/br/pkg/utils.WithRetryV2[...]/n/t/home/jenkins/agent/workspace/build-common/go/src/github.com/pingcap/br/br/pkg/utils/retry.go:234/ngithub.com/pingcap/tidb/br/pkg/utils.WithRetry/n/t/home/jenkins/agent/workspace/build-common/go/src/github.com/pingcap/br/br/pkg/utils/retry.go:215/ngithub.com/pingcap/tidb/br/pkg/restore.(*FileImporter).ImportSSTFiles/n/t/home/jenkins/agent/workspace/build-common/go/src/github.com/pingcap/br/br/pkg/restore/import.go:492/ngithub.com/pingcap/tidb/br/pkg/restore.(*Client).RestoreSSTFiles.func2/n/t/home/jenkins/agent/workspace/build-common/go/src/github.com/pingcap/br/br/pkg/restore/client.go:1183/ngithub.com/pingcap/tidb/br/pkg/utils.(*WorkerPool).ApplyOnErrorGroup.func1/n/t/home/jenkins/agent/workspace/build-common/go/src/github.com/pingcap/br/br/pkg/utils/worker.go:76/ngolang.org/x/sync/errgroup.(*Group).Go.func1/n/t/go/pkg/mod/golang.org/x/[email protected]/errgroup/errgroup.go:78)"]

4. What is your TiDB version? (Required)

./tidb-server -V
Release Version: v6.5.10
Edition: Community
Git Commit Hash: 2306250
Git Branch: heads/refs/tags/v6.5.10
UTC Build Time: 2024-06-12 08:25:43
GoVersion: go1.19.13
Race Enabled: false
TiKV Min Version: 6.2.0-alpha
Check Table Before Drop: false
Store: unistore
2024-06-13T06:58:24.395+0800

@Lily2025 Lily2025 added the type/bug The issue is confirmed as a bug. label Jun 17, 2024
@Lily2025
Copy link
Author

/type bug
/severity major
/assign Leavrth

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
affects-6.5 affects-7.1 affects-7.5 affects-8.1 component/br This issue is related to BR of TiDB. severity/major type/bug The issue is confirmed as a bug.
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants