Demo: Generate multimodal content

This demo shows you how to generate multimodal content by passing multimodal input, such as text and an image to a Gemini model, by using the Vertex AI SDK for ABAP.

Before you begin

Before you run this demo, make sure that you or your administrators have completed the following prerequisites:

Generate multimodal content

To generate multimodal content, perform the following steps:

  1. In SAP GUI, execute the transaction code /GOOG/SDK_IMG.

    Alternatively, execute the transaction code SPRO, and then click SAP Reference IMG.

  2. Go to ABAP SDK for Google Cloud > Demos.

  3. Open Vertex AI SDK: Demo: Invoke Text and Multimodal Gemini Models.

  4. Select Generate multimodal content.

  5. In the Client key field, enter the client key for authentication.

  6. In the Model key field, enter the model key, which is configured in the model generation parameters.

  7. Specify values for the following fields as appropriate:

    • MIME Type: Specify the IANA standard MIME type of the source data. The default value is application/pdf.
    • GCS URI of file: To add an image or other media to your request, specify the URI of the file stored in a Cloud Storage bucket.
    • Upload media: To add an image or other media to your request, upload the file from your local system.

    You can either specify the URI of a file stored in a Cloud Storage bucket or upload a file from your local system.

  8. In the Enter system instructions field, provide system instructions.

  9. In the Enter prompt field, provide your input prompt.

  10. Run the program. The Response from LLM area shows the generated multimodal content.