Skip to content

Latest commit

 

History

History
207 lines (133 loc) · 10.1 KB

serverless-export.md

File metadata and controls

207 lines (133 loc) · 10.1 KB
title summary
Export Data from TiDB Cloud Serverless
Learn how to export data from TiDB Cloud Serverless clusters.

Export Data from TiDB Cloud Serverless

TiDB Cloud Serverless Export (Beta) is a service that enables you to export data from a TiDB Cloud Serverless cluster to a local file or an external storage service. You can use the exported data for backup, migration, data analysis, or other purposes.

While you can also export data using tools such as mysqldump and TiDB Dumpling, TiDB Cloud Serverless Export offers a more convenient and efficient way to export data from a TiDB Cloud Serverless cluster. It brings the following benefits:

  • Convenience: the export service provides a simple and easy-to-use way to export data from a TiDB Cloud Serverless cluster, eliminating the need for additional tools or resources.
  • Isolation: the export service uses separate computing resources, ensuring isolation from the resources used by your online services.
  • Consistency: the export service ensures the consistency of the exported data without causing locks, which does not affect your online services.

Export locations

You can export data to a local file or Amazon S3.

Note:

If the size of the data to be exported is large (more than 100 GiB), it is recommended that you export it to an external storage.

A local file

To export data from a TiDB Cloud Serverless cluster to a local file, you need to export data using the TiDB Cloud console or using the TiDB Cloud CLI, and then download the exported data using the TiDB Cloud CLI.

Exporting data to a local file has the following limitations:

  • Downloading exported data using the TiDB Cloud console is not supported.
  • Exported data is saved in the stashing area of TiDB Cloud and will expire after two days. You need to download the exported data in time.
  • If the storage space of stashing area is full, you will not be able to export data to the local file.

Amazon S3

To export data to Amazon S3, you need to provide the following information:

  • URI: s3://<bucket-name>/<file-path>
  • One of the following access credentials:
    • An access key: make sure the access key has the s3:PutObject and s3:ListBucket permissions.
    • A role ARN: make sure the role ARN has the s3:PutObject and s3:ListBucket permissions.

For more information, see Configure External Storage Access for TiDB Cloud Serverless.

Export options

Data filtering

  • TiDB Cloud console supports exporting data with the selected databases and tables.

Data formats

You can export data in the following formats:

  • SQL: export data in SQL format.
  • CSV: export data in CSV format. You can specify the following options:
    • delimiter: specify the delimiter used in the exported data. The default delimiter is ".
    • separator: specify the character used to separate fields in the exported data. The default separator is ,.
    • header: specify whether to include a header row in the exported data. The default value is true.
    • null-value: specify the string that represents a NULL value in the exported data. The default value is \N.

The schema and data are exported according to the following naming conventions:

Item Not compressed Compressed
Database schema {database}-schema-create.sql {database}-schema-create.sql.{compression-type}
Table schema {database}.{table}-schema.sql {database}.{table}-schema.sql.{compression-type}
Data {database}.{table}.{0001}.{csv|sql} {database}.{table}.{0001}.{csv|sql}.{compression-type}

Data compression

You can compress the exported CSV and SQL data using the following algorithms:

  • gzip (default): compress the exported data with gzip.
  • snappy: compress the exported data with snappy.
  • zstd: compress the exported data with zstd.
  • none: do not compress the exported data.

Steps

Export data to a local file

  1. Log in to the TiDB Cloud console and navigate to the Clusters page of your project.

    Tip:

    If you have multiple projects, you can click in the lower-left corner and switch to another project.

  2. Click the name of your target cluster to go to its overview page, and then click Import in the left navigation pane.

  3. On the Import page, click Export Data to in the upper-right corner, then choose Local File from the drop-down list. Fill in the following parameters:

    • Task Name: enter a name for the export task. The default value is SNAPSHOT_{snapshot_time}.
    • Exported Data: choose the databases and tables you want to export.
    • Data Format: choose SQL File or CSV.
    • Compression: choose Gzip, Snappy, Zstd, or None.

    Tip:

    If your cluster has neither imported nor exported any data before, you need to click Click here to export data to... at the bottom of the page to export data.

  4. Click Export.

  5. After the export task is successful, you can copy the download command displayed in the export task detail, and then download the exported data by running the command in the TiDB Cloud CLI.

  1. Create an export task:

    ticloud serverless export create -c <cluster-id>

    You will get an export ID from the output.

  2. After the export task is successful, download the exported data to your local file:

    ticloud serverless export download -c <cluster-id> -e <export-id>

    For more information about the download command, see ticloud serverless export download.

Export data to Amazon S3

  1. Log in to the TiDB Cloud console and navigate to the Clusters page of your project.

    Tip:

    If you have multiple projects, you can click in the lower-left corner and switch to another project.

  2. Click the name of your target cluster to go to its overview page, and then click Import in the left navigation pane.

  3. On the Import page, click Export Data to in the upper-right corner, then choose Amazon S3 from the drop-down list. Fill in the following parameters:

    • Task Name: enter a name for the export task. The default value is SNAPSHOT_{snapshot_time}.
    • Exported Data: choose the databases and tables you want to export.
    • Data Format: choose SQL File or CSV.
    • Compression: choose Gzip, Snappy, Zstd, or None.
    • Folder URI: enter the URI of the Amazon S3 with the s3://<bucket-name>/<folder-path>/ format.
    • Bucket Access: choose one of the following access credentials and then fill in the credential information. If you do not have such information, see Configure External Storage Access for TiDB Cloud Serverless.
      • AWS Role ARN: enter the role ARN that has the s3:PutObject and s3:ListBucket permissions to access the bucket.
      • AWS Access Key: enter the access key ID and access key secret that have the s3:PutObject and s3:ListBucket permissions to access the bucket.
  4. Click Export.

ticloud serverless export create -c <cluster-id> --s3.bucket-uri <uri> --s3.access-key-id <access-key-id> --s3.secret-access-key <secret-access-key>
  • s3.bucket-uri: the Amazon S3 URI with the s3://<bucket-name>/<file-path> format.
  • s3.access-key-id: the access key ID of the user who has the permission to access the bucket.
  • s3.secret-access-key: the access key secret of the user who has the permission to access the bucket.

Cancel an export task

To cancel an ongoing export task, take the following steps:

  1. Log in to the TiDB Cloud console and navigate to the Clusters page of your project.

    Tip:

    If you have multiple projects, you can click in the lower-left corner and switch to another project.

  2. Click the name of your target cluster to go to its overview page, and then click Import in the left navigation pane.

  3. On the Import page, click Export to view the export task list.

  4. Choose the export task you want to cancel, and then click Action.

  5. Choose Cancel in the drop-down list. Note that you can only cancel the export task that is in the Running status.

ticloud serverless export cancel -c <cluster-id> -e <export-id>

Pricing

The export service is free during the beta period. You only need to pay for the Request Units (RUs) generated during the export process of successful or canceled tasks. For failed export tasks, you will not be charged.