Gabi Dobocanblog.telepat.io·Nov 17, 2024StoryTeller: Revolutionizing Long Video DescriptionsArxiv: https://arxiv.org/abs/2411.07076v1 PDF: https://arxiv.org/pdf/2411.07076v1.pdf Authors: Ruicheng Le, Yuchen Zhang, Hanchong Zhang, Jianchao Wu, Yuan Lin, Yichen He Published: 2024-11-11 Understanding videos, especially long-form content, is ...audio-visual processing