I put this to the test and found it to be true ... With the Gen-3 Alpha model, users can input text or images to produce unique video clips. You can set the image input as the start, middle ...