Skip to content

Commit

Permalink
Update migrate-from-parquet-files-to-tidb.md
Browse files Browse the repository at this point in the history
  • Loading branch information
hfxsd authored Jan 19, 2024
1 parent 0a4d9d9 commit 43fab23
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion migrate-from-parquet-files-to-tidb.md
Original file line number Diff line number Diff line change
Expand Up @@ -41,7 +41,7 @@ Each table in Hive can be exported to parquet files by annotating `STORED AS PAR
DROP TABLE temp;
```

3. The parquet files exported from Hive might not have the `.parquet` suffix and cannot be correctly identified by TiDB Lightning. Therefore, before importing the files, you need to rename the exported files and add the `.parquet` suffix to change the full filename to a format that TiDB Lightning recognizes, for example, `${db_name}. ${table_name}.parquet`.
3. The parquet files exported from Hive might not have the `.parquet` suffix and cannot be correctly identified by TiDB Lightning. Therefore, before importing the files, you need to rename the exported files and add the `.parquet` suffix to change the full filename to a format that TiDB Lightning recognizes, for example, `${db_name}. ${table_name}.parquet`. For more formats, see [TiDB Lightning Data Sources](/tidb-lightning/tidb-lightning-data-source.md).

4. Put all the parquet files in a unified directory, for example, `/data/my_datasource/` or `s3://my-bucket/sql-backup`. TiDB Lightning will recursively search for all `.parquet` files in this directory and its subdirectories.

Expand Down

0 comments on commit 43fab23

Please sign in to comment.