Gabi Dobocanblog.telepat.io·Nov 24, 2024Unpacking Multimodal Language Models in VQA: Llava’s InterpretabilityArxiv: https://arxiv.org/abs/2411.10950v1 PDF: https://arxiv.org/pdf/2411.10950v1.pdf Authors: Sophia Ananiadou, Zeping Yu Published: 2024-11-17 Understanding Llava's Contribution to Visual Question Answering The paper, "Understanding Multimodal LLM...CLIP Add a thoughtful commentNo comments yetBe the first to start the conversation.