Copy the data

Feedback


Copy data to the database (currently only supports copying vector data to HBase). The supported data engines are: UDB, GDB. The UDB contains local files or files registered to the iServer, and supports the creation of field indexes and vector pyramids for registered UDB data sets. When a data set with the same name is copied to the database for multiple times, it is a new data set by default (copying data containing sensitive fields such as "ID" is not supported temporarily).

In order to solve the problem of failure to copy a large amount of data, the process of copying data is optimized-batch copy, that is, for registered UDB data sets, when copying data, it supports batch copy to the database. Depending on the dataset type, the specific batch is as follows:

Point dataset:

    1. When the amount of data is less than 4 million, batch copy is not performed.

    2. When the amount of data is greater than 4 million, 2 million copies are copied in batches. For example, 5 million points of data are copied in 3 batches.

Line dataset:

    1. When the amount of data is less than 2 million, batch copy is not performed.

    2. When the amount of data is greater than 2 million, each batch is copied in batches of 1 million. For example, 3 million line data is copied in 3 batches.

Face dataset:

    1. When the amount of data is less than 1 million, batch copy is not performed.

    2. When the amount of data is more than 1 million, each batch is copied in batches of 500,000. For example, 1.5 million face data is copied in 3 batches.

When creating a copy data task, you need to set the following parameters: