Join and swap two columns

This page explains how to join column values and swap column names when you prepare data in the Wrangler workspace of the Cloud Data Fusion Studio.

Join two columns

The Wrangler workspace supports joining two columns of the same or different data types. The JOIN operation's output is stored in a new column containing the joined fields from both columns. Wrangler doesn't support joining columns of the boolean and bytes data types with other columns.

To join two columns, follow these steps:

  1. Go to Wrangler workspace in Cloud Data Fusion.
  2. On the Data tab, select the checkbox by two column names.
  3. Click the arrow_drop_down expander arrow by one of the column names.
  4. Select Join two columns, and select an option—for example, Custom selection.
  5. Choose an order, delimiter, and new column name for the JOIN operation's output.
  6. Click Join.

Wrangler joins the columns and adds the merge directive to the recipe. When you run the data pipeline, the transformation is applied to all values in the column.

Swap two column names

The Wrangler workspace supports swapping (or interchanging) two column names. Only the column names interchange, the values in the column rows don't change.

To swap two column names, follow these steps:

  1. Go to Wrangler workspace in Cloud Data Fusion.
  2. On the Data tab, select the checkbox by two column names.
  3. Click the arrow_drop_down expander arrow by one of the column names.
  4. Select Swap two column names

Wrangler swaps the column names and adds the swap directive to the recipe.

What's next