Optimize DDL for CHAR(10) to VARCHAR(100) #40574

dveeden · 2023-01-13T09:27:01Z

Feature Request

Changing a column from a CHAR(n) to a VARCHAR(m) is now going through a full reorg. However the CHAR is guaranteed to fit in the VARCHAR as long as the charset remains the same and if m>=n.

Here the source is a CHAR and the target is a VARCHAR, but this is likely also true for any other combination of these types and might also be true for VARBINARY etc.

The improvement would be to do this as a metadata only change and only change the actual rows when they are written again.

The text was updated successfully, but these errors were encountered:

mjonss · 2023-01-17T12:23:49Z

It looks like the secondary indexes have different format for char and varchar (according to comment here).
We could still do this as metadata change if the column is not included in any index.

bb7133 · 2023-01-17T13:33:26Z

Thanks, @mjonss

Even for columns without a secondary index, we cannot convert VARCHAR to CHAR without data reorganization because for TiDB, the data in CHAR is potentially truncated by removing all trailing spaces.

But for CHAR to VARCHAR without any index, I think this can be optimized. /cc @zimulala @Benjamin2037

yiwen92 · 2023-01-18T07:54:48Z

It looks like the secondary indexes have different format for char and varchar (according to comment here). We could still do this as metadata change if the column is not included in any index.

What is the exactly difference between char and varchar for index encoding?
// 2. char -> varchar: the index value encoding of secondary index on clustered primary key tables is different. // These secondary indexes need to be rewritten.

dveeden added the type/feature-request Categorizes issue or PR as related to a new feature. label Jan 13, 2023

dveeden mentioned this issue Jan 17, 2023

Kafka sink: Optimize DML with column type changes that don't change data pingcap/tiflow#8095

Closed

bb7133 added type/enhancement The issue or PR belongs to an enhancement. and removed type/feature-request Categorizes issue or PR as related to a new feature. labels Jan 17, 2023

bb7133 added the help wanted Denotes an issue that needs help from a contributor. Must meet "help wanted" guidelines. label Jan 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize DDL for CHAR(10) to VARCHAR(100) #40574

Optimize DDL for CHAR(10) to VARCHAR(100) #40574

dveeden commented Jan 13, 2023

mjonss commented Jan 17, 2023

bb7133 commented Jan 17, 2023

yiwen92 commented Jan 18, 2023

Optimize DDL for CHAR(10) to VARCHAR(100) #40574

Optimize DDL for CHAR(10) to VARCHAR(100) #40574

Comments

dveeden commented Jan 13, 2023

Feature Request

mjonss commented Jan 17, 2023

bb7133 commented Jan 17, 2023

yiwen92 commented Jan 18, 2023