Now you are able to feed graphic on the VLM as condition of generations! This is different from image2video in which the graphic grow to be the primary frame in the video. IP2V uses impression like a Component of the prompt, to extract the strategy and style on the picture.This was promptly seen, and also the design and style spread. By the top wit