Skip to content

Conversation

@popovaan
Copy link
Contributor

@popovaan popovaan commented Nov 19, 2025

Description

Python and C++ samples for VLM video input.

CVS-175408

Checklist:

  • Tests have been updated or added to cover the new code.
  • This patch fully addresses the ticket. - This PR doesn't cover C sample, need a separate ticket for it.
  • I have made corresponding changes to the documentation.

@Wovchena Wovchena requested a review from Copilot November 20, 2025 06:29
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR adds a Python sample demonstrating video-to-text functionality for Vision Language Models (VLMs). The sample enables users to input video files and interact with VLMs through a chat interface.

  • Adds new video_to_text_chat.py sample for VLM video input processing
  • Updates test configuration to include a tiny random LLaVA-NeXT-Video model and sample video file
  • Updates documentation to describe the new video-to-text sample usage

Reviewed Changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 2 comments.

File Description
tests/python_tests/samples/conftest.py Adds test model configuration for LLaVA-NeXT-Video and sample video file resource
samples/python/visual_language_chat/video_to_text_chat.py New sample implementing video-to-text chat functionality using VLM pipeline
samples/python/visual_language_chat/README.md Updates documentation to describe the new video-to-text sample and its usage
samples/deployment-requirements.txt Adds opencv-python dependency required for video processing

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

@github-actions github-actions bot added the category: cmake / build Cmake scripts label Nov 24, 2025
@github-actions github-actions bot added the category: GHA CI based on Github actions label Nov 24, 2025
@popovaan popovaan changed the title Python sample for VLM video input. Samples for VLM video input. Nov 24, 2025
@github-actions github-actions bot removed the category: GHA CI based on Github actions label Nov 24, 2025
@popovaan popovaan requested a review from Wovchena November 25, 2025 15:55
@popovaan popovaan marked this pull request as ready for review November 25, 2025 15:55
@github-actions github-actions bot added the category: GHA CI based on Github actions label Nov 26, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants