本頁面由 Cloud Translation API 翻譯而成。

將查詢結果匯出至 Amazon S3

本文說明如何將針對 BigLake 資料表執行的查詢結果匯出至 Amazon Simple Storage Service (Amazon S3) 儲存桶。

如要瞭解 BigQuery 和 Amazon S3 之間的資料流動方式，請參閱「匯出資料時的資料流動」。

限制

如要查看適用於 Amazon S3 和 Blob 儲存體 BigLake 資料表的完整限制清單，請參閱「限制」。

事前準備

請確認您已備妥下列資源：

存取 Amazon S3 值區的連結。
Amazon S3 BigLake 資料表。
正確的 Amazon Web Services (AWS) 身分與存取權管理 (IAM) 政策：
- 您必須具備 PutObject 權限，才能將資料寫入 Amazon S3 值區。如需更多資訊，請參閱「為 BigQuery 建立 AWS 身分與存取權管理政策」。

如果您採用容量定價模式，請確認您已為專案啟用 BigQuery Reservation API。如要瞭解定價資訊，請參閱 BigQuery Omni 定價。

匯出查詢結果

無論現有內容為何，BigQuery Omni 都會寫入指定的 Amazon S3 位置。匯出查詢可以覆寫現有資料，或將查詢結果與現有資料混合。建議您將查詢結果匯出至空白的 Amazon S3 值區。

如要執行查詢，請選取下列任一選項：

SQL

在「Query editor」(查詢編輯器) 欄位中輸入 GoogleSQL 匯出查詢。GoogleSQL 是 Google Cloud 主控台的預設語法。

前往 Google Cloud 控制台的「BigQuery」頁面。

前往 BigQuery
在查詢編輯器中輸入以下陳述式：
```
   EXPORT DATA WITH CONNECTION `CONNECTION_REGION.CONNECTION_NAME`
   OPTIONS(uri="s3://BUCKET_NAME/PATH", format="FORMAT", ...)
   AS QUERY
```
更改下列內容：
- CONNECTION_REGION：建立連線的區域。
- CONNECTION_NAME：您建立的連線名稱，具有寫入 Amazon S3 值區的必要權限。
- BUCKET_NAME：您要寫入資料的 Amazon S3 值區。
- PATH：您要將匯出檔案寫入的路徑。路徑字串的葉目錄中必須包含一個萬用字元 *，例如 ../aa/*、../aa/b*c、../aa/*bc 和 ../aa/bc*。根據匯出檔案數量，BigQuery 會將 * 替換為 0000..N。BigQuery 會決定檔案數量和大小。如果 BigQuery 決定匯出兩個檔案，則第一個檔案的檔案名稱中的 * 會替換為 000000000000，第二個檔案的檔案名稱中的 * 則會替換為 000000000001。
- FORMAT：支援的格式為 JSON、AVRO、CSV 和 PARQUET。
- QUERY：用於分析儲存在 BigLake 資料表中的資料。
- 按一下「Run」。

如要進一步瞭解如何執行查詢，請參閱「執行互動式查詢」一文。

Java

在嘗試這個範例之前，請先按照 BigQuery 快速入門：使用用戶端程式庫中的 Java 設定說明進行操作。詳情請參閱 BigQuery Java API 參考說明文件。

如要向 BigQuery 進行驗證，請設定應用程式預設憑證。詳情請參閱「設定用戶端程式庫的驗證機制」。

import com.google.cloud.bigquery.BigQuery;
import com.google.cloud.bigquery.BigQueryException;
import com.google.cloud.bigquery.BigQueryOptions;
import com.google.cloud.bigquery.QueryJobConfiguration;
import com.google.cloud.bigquery.TableResult;

// Sample to export query results to Amazon S3 bucket
public class ExportQueryResultsToS3 {

  public static void main(String[] args) throws InterruptedException {
    // TODO(developer): Replace these variables before running the sample.
    String projectId = "MY_PROJECT_ID";
    String datasetName = "MY_DATASET_NAME";
    String externalTableName = "MY_EXTERNAL_TABLE_NAME";
    // connectionName should be in the format of connection_region.connection_name. e.g.
    // aws-us-east-1.s3-write-conn
    String connectionName = "MY_CONNECTION_REGION.MY_CONNECTION_NAME";
    // destinationUri must contain exactly one * anywhere in the leaf directory of the path string
    // e.g. ../aa/*, ../aa/b*c, ../aa/*bc, and ../aa/bc*
    // BigQuery replaces * with 0000..N depending on the number of files exported.
    // BigQuery determines the file count and sizes.
    String destinationUri = "s3://your-bucket-name/*";
    String format = "EXPORT_FORMAT";
    // Export result of query to find states starting with 'W'
    String query =
        String.format(
            "EXPORT DATA WITH CONNECTION `%s` OPTIONS(uri='%s', format='%s') "
              + "AS SELECT * FROM %s.%s.%s WHERE name LIKE 'W%%'",
            connectionName, destinationUri, format, projectId, datasetName, externalTableName);
    exportQueryResultsToS3(query);
  }

  public static void exportQueryResultsToS3(String query) throws InterruptedException {
    try {
      // Initialize client that will be used to send requests. This client only needs to be created
      // once, and can be reused for multiple requests.
      BigQuery bigquery = BigQueryOptions.getDefaultInstance().getService();

      TableResult results = bigquery.query(QueryJobConfiguration.of(query));

      results
          .iterateAll()
          .forEach(row -> row.forEach(val -> System.out.printf("%s,", val.toString())));

      System.out.println("Query results exported to Amazon S3 successfully.");
    } catch (BigQueryException e) {
      System.out.println("Query not performed \n" + e.toString());
    }
  }
}

疑難排解

如果您收到與 quota failure 相關的錯誤訊息，請檢查是否已為查詢保留容量。如要進一步瞭解時段預留功能，請參閱本文件的「事前準備」一節。

後續步驟

瞭解 BigQuery Omni。
瞭解如何匯出資料表資料。
瞭解如何查詢儲存在 Amazon S3 中的資料。
瞭解如何為 BigQuery Omni 設定 VPC Service Controls。