Starting April 29, 2025, Gemini 1.5 Pro and Gemini 1.5 Flash models are not available in projects that have no prior usage of these models, including new projects. For details, see Model versions and lifecycle.
Stay organized with collections
Save and categorize content based on your preferences.
Extract a specific retail product with the reference image in the video with timestamps.
You can query a model directly and test the results returned when using
different parameter values with the Cloud console, or by calling the
Vertex AI API directly.
You are a video data analyst with expertise in commercial products. Please do not hallucinate. You can just output nothing if there are no positive findings. Do not output findings for products not matching the reference image.
Task description: You are given a retail product's image and a video to label. Your task is to examine every video frame and find out all the occurrences of this reference product in each frame. Explain your finding for each occurrence.
Output specification: You should provide the output in a strictly valid JSON format same as the following example.
[{"timestamp":"Timestamp of the event in mm:ss format.","reason":"How does the product appear in the video frame."},{"timestamp":"Timestamp of the event in mm:ss format.","reason":"How does the product appear in the video frame."},]
Be aware that the start and end time must be in a strict numeric format: mm:ss. Do not output anything after the JSON content. If there are no matching logos, you should not output anything. Please do not hallucinate.
Your answer (as a JSON LIST):
Response
[{"timestamp":"00:05","reason":"The product is shown in a partial, out-of-focus view."},{"timestamp":"00:08","reason":"The product is shown in a partial, out-of-focus view."},{"timestamp":"00:13","reason":"The product is shown in a partial, out-of-focus view."},{"timestamp":"00:16","reason":"The product is shown in a partial, out-of-focus view."},{"timestamp":"00:18","reason":"The product is shown in a partial, out-of-focus view."},{"timestamp":"00:29","reason":"The product is featured in a close-up shot, showcasing its design and functionality."},{"timestamp":"00:33","reason":"The product is featured in a close-up shot, showcasing its design and functionality."},{"timestamp":"00:36","reason":"The product is featured in a close-up shot, showcasing its design and functionality."},{"timestamp":"00:43","reason":"The product is featured in a close-up shot, showcasing its design and functionality."}]
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-09-18 UTC."],[],[],null,[]]