Now you can feed impression on the VLM as issue of generations! This is different from image2video in which the picture turn out to be the initial frame on the video. IP2V uses picture for a A part of the prompt, to extract the principle and elegance of the graphic. https://henryi219els5.bloggazzo.com/profile