Understand the AML data model and requirements

At the core of AML AI is a detailed and up-to-date understanding of parties of the bank and their activity, covering, in particular, the following data:

Transactional activity
Account holdings
Party demographics
Risk investigation data

This page covers the creation and management of data used by AML AI, including details of the data model, data schema, and data requirements for AML. The schema itself, including details for the individual fields, appears in the AML input data model (CSV file). A synthetic example dataset is also available through the Quickstart.

The following prerequisites are not covered on this page:

Setup to use AML AI with an AML dataset (see Set up a project and permissions)
Security and compliance features (see pages under Security and compliance features)

Overview of data requirements

The AML data model accepts information on retail or commercial parties, their accounts and transactions, and detailed information on risk cases related to these parties. This section introduces important aspects of the data model that are valid across the different entities.

The AML data model schema is arranged into three areas: core banking data, risk investigation data, and supplementary data.

Core banking data

Tables: Party, AccountPartyLink, Transaction
Purpose: Serves as a structured collection of data on your customers and their banking activity, used in detection of risk. All parties, accounts and transactions to be monitored should be included. Provide either retail or commercial data in an AML AI dataset

Risk investigation data

Table: RiskCaseEvent
Purpose:
- Serves as a structured collection of data on risk investigation processes and parties previously identified as risky
- Assists in the creation of training labels for AML risk models

Supplementary data

Table: PartySupplementaryData
Purpose: Optional table that can contain additional information relevant to identifying money laundering risk that is not covered in the rest of the schema. You should start using AML AI without providing any supplementary data.

Table relationships

The following diagram describes the table relationships, primary keys, and foreign keys.

AML data model schema diagram

Errors

When you create a dataset, AML AI automatically performs data validation checks. For information about these checks, the error messages and how to fix them, see Data validation errors.

For more information about the technical schema, see AML input data model (CSV file). To understand the data duration requirements and scope, see Understand data scope and duration. When you have tables ready in BigQuery, you can use AML AI to create and manage a dataset.