このページは Cloud Translation API によって翻訳されました。

パブリックインデックスをクエリして最近傍を取得する

インデックスを作成してデプロイしたら、クエリを実行して最近傍を取得できます。

以下に、k 最近傍探索アルゴリズム（k-NN）を使用して上位の最近傍を見つける一致クエリの例を示します。

パブリックエンドポイントのクエリ例

Vertex AI SDK for Python

Vertex AI SDK for Python のインストールまたは更新の方法については、Vertex AI SDK for Python をインストールするをご覧ください。詳細については、Vertex AI SDK for Python API のリファレンスドキュメントをご覧ください。

def vector_search_find_neighbors(
    project: str,
    location: str,
    index_endpoint_name: str,
    deployed_index_id: str,
    queries: List[List[float]],
    num_neighbors: int,
) -> List[
    List[aiplatform.matching_engine.matching_engine_index_endpoint.MatchNeighbor]
]:
    """Query the vector search index.

    Args:
        project (str): Required. Project ID
        location (str): Required. The region name
        index_endpoint_name (str): Required. Index endpoint to run the query
        against.
        deployed_index_id (str): Required. The ID of the DeployedIndex to run
        the queries against.
        queries (List[List[float]]): Required. A list of queries. Each query is
        a list of floats, representing a single embedding.
        num_neighbors (int): Required. The number of neighbors to return.

    Returns:
        List[List[aiplatform.matching_engine.matching_engine_index_endpoint.MatchNeighbor]] - A list of nearest neighbors for each query.
    """
    # Initialize the Vertex AI client
    aiplatform.init(project=project, location=location)

    # Create the index endpoint instance from an existing endpoint.
    my_index_endpoint = aiplatform.MatchingEngineIndexEndpoint(
        index_endpoint_name=index_endpoint_name
    )

    # Query the index endpoint for the nearest neighbors.
    return my_index_endpoint.find_neighbors(
        deployed_index_id=deployed_index_id,
        queries=queries,
        num_neighbors=num_neighbors,
    )

コマンドライン

下記の publicEndpointDomainName は、デプロイにあります。形式は <number>.<region>-<number>.vdb.vertexai.goog です。


  $ curl -X POST -H "Content-Type: application/json" -H "Authorization: Bearer `gcloud auth print-access-token`" https://1957880287.us-central1-181224308459.vdb.vertexai.goog/v1/projects/181224308459/locations/us-central1/indexEndpoints/3370566089086861312:findNeighbors -d '{deployed_index_id: "test_index_public1", queries: [{datapoint: {datapoint_id: "0", feature_vector: [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1]}, neighbor_count: 5}]}'

この curl の例は、http(s) クライアントから呼び出す方法を示していますが、パブリックエンドポイントは RESTful と grpc_cli のデュアルプロトコルをサポートしています。


  $ curl -X POST -H "Content-Type: application/json" -H "Authorization: Bearer `gcloud auth print-access-token`" https://1957880287.us-central1-181224308459.vdb.vertexai.goog/v1/projects/${PROJECT_ID}/locations/us-central1/indexEndpoints/${INDEX_ENDPOINT_ID}:readIndexDatapoints -d '{deployed_index_id:"test_index_public1", ids: ["606431", "896688"]}'

この curl の例は、トークンと数値の制限を使用してクエリを実行する方法を示しています。


  $ curl -X POST -H "Content-Type: application/json" -H "Authorization: Bearer `gcloud auth print-access-token`"  https://${PUBLIC_ENDPOINT_DOMAIN}/v1/projects/${PROJECT_ID}/locations/${LOCATION}/indexEndpoints/${INDEX_ENDPOINT_ID}:findNeighbors -d '{deployed_index_id:"${DEPLOYED_INDEX_ID}", queries: [{datapoint: {datapoint_id:"x", feature_vector: [1, 1], "sparse_embedding": {"values": [111.0,111.1,111.2], "dimensions": [10,20,30]}, numeric_restricts: [{namespace: "int-ns", value_int: -2, op: "GREATER"}, {namespace: "int-ns", value_int: 4, op: "LESS_EQUAL"}, {namespace: "int-ns", value_int: 0, op: "NOT_EQUAL"}], restricts: [{namespace: "color", allow_list: ["red"]}]}}]}'

コンソール

パブリックエンドポイントにデプロイされたインデックスをコンソールからクエリするには、次の手順を実施します。

コンソールの [Vertex AI] セクションで、[デプロイと使用] セクションに移動します。 Google Cloud [ベクトル検索] を選択します。
[ベクトル検索] に移動
クエリを実行するインデックスを選択します。[インデックスの情報] ページが開きます。
[デプロイされたインデックス] セクションまで下にスクロールし、クエリを実行するデプロイされたインデックスを選択します。[デプロイされるインデックスの情報] ページが開きます。
[インデックスのクエリ] セクションで、密エンベディング値、スパースエンベディング値、ハイブリッドエンベディング値（密エンベディングとスパースエンベディング）、または特定のデータポイントでクエリを実行するかどうかを選択します。
選択したクエリの種類に合わせてクエリパラメータを入力します。たとえば、密エンベディングでクエリを実行する場合は、クエリするエンベディングベクトルを入力します。
指定された curl コマンドを使用して、または Cloud Shell を使用して実行し、クエリを実行します。
Cloud Shell を使用する場合は、[Cloud Shell で実行] を選択します。
Cloud Shell で実行します。
結果として最近傍が返されます。

ハイブリッドクエリ

ハイブリッド検索では、高密度エンベディングとスパースエンベディングの両方を使用して、キーワード検索とセマンティック検索を組み合わせて検索します。

Vertex AI SDK for Python

def vector_search_find_neighbors_hybrid_queries(
    project: str,
    location: str,
    index_endpoint_name: str,
    deployed_index_id: str,
    num_neighbors: int,
) -> List[
    List[aiplatform.matching_engine.matching_engine_index_endpoint.MatchNeighbor]
]:
    """Query the vector search index using example hybrid queries.

    Args:
        project (str): Required. Project ID
        location (str): Required. The region name
        index_endpoint_name (str): Required. Index endpoint to run the query
        against.
        deployed_index_id (str): Required. The ID of the DeployedIndex to run
        the queries against.
        num_neighbors (int): Required. The number of neighbors to return.

    Returns:
        List[List[aiplatform.matching_engine.matching_engine_index_endpoint.MatchNeighbor]] - A list of nearest neighbors for each query.
    """
    # Initialize the Vertex AI client
    aiplatform.init(project=project, location=location)

    # Create the index endpoint instance from an existing endpoint.
    my_index_endpoint = aiplatform.MatchingEngineIndexEndpoint(
        index_endpoint_name=index_endpoint_name
    )

    # Query hybrid datapoints, sparse-only datapoints, and dense-only datapoints.
    hybrid_queries = [
        aiplatform.matching_engine.matching_engine_index_endpoint.HybridQuery(
            dense_embedding=[1, 2, 3],
            sparse_embedding_dimensions=[10, 20, 30],
            sparse_embedding_values=[1.0, 1.0, 1.0],
            rrf_ranking_alpha=0.5,
        ),
        aiplatform.matching_engine.matching_engine_index_endpoint.HybridQuery(
            dense_embedding=[1, 2, 3],
            sparse_embedding_dimensions=[10, 20, 30],
            sparse_embedding_values=[0.1, 0.2, 0.3],
        ),
        aiplatform.matching_engine.matching_engine_index_endpoint.HybridQuery(
            sparse_embedding_dimensions=[10, 20, 30],
            sparse_embedding_values=[0.1, 0.2, 0.3],
        ),
        aiplatform.matching_engine.matching_engine_index_endpoint.HybridQuery(
            dense_embedding=[1, 2, 3]
        ),
    ]

    return my_index_endpoint.find_neighbors(
        deployed_index_id=deployed_index_id,
        queries=hybrid_queries,
        num_neighbors=num_neighbors,
    )

フィルタリングとクラウディングを使用したクエリ

ベクトルの一致をフィルタリングすると、最近傍の結果を特定のカテゴリに制限できます。フィルタを使用して、検索結果から除外するカテゴリを指定することもできます。

クラウディングネイバーごとの上限を使用すると、インデックスデータ内の単一のクラウディングタグから返される結果の数を制限することで、結果の多様性を高めることができます。

Vertex AI SDK for Python

def vector_search_find_neighbors_filtering_crowding(
    project: str,
    location: str,
    index_endpoint_name: str,
    deployed_index_id: str,
    queries: List[List[float]],
    num_neighbors: int,
    filter: List[aiplatform.matching_engine.matching_engine_index_endpoint.Namespace],
    numeric_filter: List[
        aiplatform.matching_engine.matching_engine_index_endpoint.NumericNamespace
    ],
    per_crowding_attribute_neighbor_count: int,
) -> List[
    List[aiplatform.matching_engine.matching_engine_index_endpoint.MatchNeighbor]
]:
    """Query the vector search index with filtering and crowding.

    Args:
        project (str): Required. Project ID
        location (str): Required. The region name
        index_endpoint_name (str): Required. Index endpoint to run the query
        against.
        deployed_index_id (str): Required. The ID of the DeployedIndex to run
        the queries against.
        queries (List[List[float]]): Required. A list of queries. Each query is
        a list of floats, representing a single embedding.
        num_neighbors (int): Required. The number of neighbors to return.
        filter (List[Namespace]): Required. A list of Namespaces for filtering
        the matching results. For example,
        [Namespace("color", ["red"], []), Namespace("shape", [], ["square"])]
        will match datapoints that satisfy "red color" but not include
        datapoints with "square shape".
        numeric_filter (List[NumericNamespace]): Required. A list of
        NumericNamespaces for filtering the matching results. For example,
        [NumericNamespace(name="cost", value_int=5, op="GREATER")] will limit
        the matching results to datapoints with cost greater than 5.
        per_crowding_attribute_neighbor_count (int): Required. The maximum
        number of returned matches with the same crowding tag.

    Returns:
        List[List[aiplatform.matching_engine.matching_engine_index_endpoint.MatchNeighbor]] - A list of nearest neighbors for each query.
    """
    # Initialize the Vertex AI client
    aiplatform.init(project=project, location=location)

    # Create the index endpoint instance from an existing endpoint.
    my_index_endpoint = aiplatform.MatchingEngineIndexEndpoint(
        index_endpoint_name=index_endpoint_name
    )

    # Query the index endpoint for the nearest neighbors.
    return my_index_endpoint.find_neighbors(
        deployed_index_id=deployed_index_id,
        queries=queries,
        num_neighbors=num_neighbors,
        filter=filter,
        numeric_filter=numeric_filter,
        per_crowding_attribute_neighbor_count=per_crowding_attribute_neighbor_count,
    )

パフォーマンスに影響するクエリ時間の設定

ベクトル検索を使用する場合、クエリ時のパラメータはレイテンシ、可用性、コストに影響する可能性があります。このガイダンスはほとんどのケースに適用されます。ただし、必ず構成をテストして、ユースケースに適していることを確認してください。

パラメータの定義については、インデックス構成パラメータをご覧ください。

パラメータ概要パフォーマンスへの影響

パラメータ	概要	パフォーマンスへの影響
`approximateNeighborsCount`	各シャードから取得するおおよその結果数をアルゴリズムに指定します。 `approximateNeighborsCount` の値は常に `setNeighborsCount` の値より大きくする必要があります。`setNeighborsCount` の値が小さい場合は、`approximateNeighborsCount` の値を 10 倍にすることをおすすめします。`setNeighborsCount` 値が大きい場合は、小さい乗数を使用できます。このフィールドに対応する REST API 名は `approximate_neighbor_count` です。	`approximateNeighborsCount` の値を大きくすると、次のようにパフォーマンスに影響する可能性があります。再現率: 増加レイテンシ: 増加する可能性がある可用性: 影響なし費用: 検索時に処理されるデータが増えるため、費用が増加する可能性がある `approximateNeighborsCount` の値を小さくすると、次のようにパフォーマンスに影響する可能性があります。再現率: 減少レイテンシ: 低下する可能性がある可用性: 影響なし費用: 検索時に処理されるデータが少ないため、費用が削減される
`setNeighborCount`	クエリで返す結果の数を指定します。このフィールドに対応する REST API 名は `neighbor_count` です。	300 以下の値は、ほとんどのユースケースでパフォーマンスを維持できます。値が大きい場合は、特定のユースケースでテストします。
`fractionLeafNodesToSearch`	最近傍を検索する際に参照するリーフノードの割合を制御します。これは、リーフノードあたりのエンベディングが多いほど、リーフあたりのデータが多く検査される `leafNodeEmbeddingCount` に関連しています。このフィールドに対応する REST API 名は `fraction_leaf_nodes_to_search_override` です。	`fractionLeafNodesToSearch` の値を大きくすると、次のようにパフォーマンスに影響する可能性があります。再現率: 増加レイテンシ: 増加可用性: 影響なし費用: レイテンシが長くなると、マシンリソースの占有量が増えるため、費用が増加する可能性がある `fractionLeafNodesToSearch` の値を小さくすると、次のようにパフォーマンスに影響する可能性があります。再現率: 減少レイテンシ: 短縮可用性: 影響なし費用: レイテンシが短いほどマシンリソースを占有しなくなるため、費用が削減される

approximateNeighborsCount

各シャードから取得するおおよその結果数をアルゴリズムに指定します。

approximateNeighborsCount の値は常に setNeighborsCount の値より大きくする必要があります。setNeighborsCount の値が小さい場合は、approximateNeighborsCount の値を 10 倍にすることをおすすめします。setNeighborsCount 値が大きい場合は、小さい乗数を使用できます。

このフィールドに対応する REST API 名は approximate_neighbor_count です。

approximateNeighborsCount の値を大きくすると、次のようにパフォーマンスに影響する可能性があります。

再現率: 増加
レイテンシ: 増加する可能性がある
可用性: 影響なし
費用: 検索時に処理されるデータが増えるため、費用が増加する可能性がある

approximateNeighborsCount の値を小さくすると、次のようにパフォーマンスに影響する可能性があります。

再現率: 減少
レイテンシ: 低下する可能性がある
可用性: 影響なし
費用: 検索時に処理されるデータが少ないため、費用が削減される

setNeighborCount

クエリで返す結果の数を指定します。

このフィールドに対応する REST API 名は neighbor_count です。

300 以下の値は、ほとんどのユースケースでパフォーマンスを維持できます。値が大きい場合は、特定のユースケースでテストします。

fractionLeafNodesToSearch

最近傍を検索する際に参照するリーフノードの割合を制御します。これは、リーフノードあたりのエンベディングが多いほど、リーフあたりのデータが多く検査される leafNodeEmbeddingCount に関連しています。

このフィールドに対応する REST API 名は fraction_leaf_nodes_to_search_override です。

fractionLeafNodesToSearch の値を大きくすると、次のようにパフォーマンスに影響する可能性があります。

再現率: 増加
レイテンシ: 増加
可用性: 影響なし
費用: レイテンシが長くなると、マシンリソースの占有量が増えるため、費用が増加する可能性がある

fractionLeafNodesToSearch の値を小さくすると、次のようにパフォーマンスに影響する可能性があります。

再現率: 減少
レイテンシ: 短縮
可用性: 影響なし
費用: レイテンシが短いほどマシンリソースを占有しなくなるため、費用が削減される

次のステップ

インデックスの作成方法、パブリックエンドポイントへのデプロイ方法、クエリの方法に関するエンドツーエンドの例については、公式ノートブック Using Vector Search and Vertex AI Embeddings for Text for StackOverflow Questions をご覧ください。

パブリック インデックスをクエリして最近傍を取得する コレクションでコンテンツを整理 必要に応じて、コンテンツの保存と分類を行います。

パブリック エンドポイントのクエリ例

Vertex AI SDK for Python

コマンドライン

コンソール

ハイブリッド クエリ

Vertex AI SDK for Python

フィルタリングとクラウディングを使用したクエリ

Vertex AI SDK for Python

パフォーマンスに影響するクエリ時間の設定

次のステップ

パブリックインデックスをクエリして最近傍を取得する

パブリックエンドポイントのクエリ例

ハイブリッドクエリ