DJ的小网站

Multimodal Commonsense Knowledge Distillation for Visual Question Answering (Student Abstract)

论文题目:Multimodal Commonsense Knowledge Distillation for Visual Question Answering (Student Abstract) 发表期刊:2025 AAAI Conference on Artificial Intelligen

dj-admin dj-admin Published on 2025-06-18

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

论文题目:Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models 发表期刊:2025 The IEEE / CVF Computer Vision and Pattern Recog

dj-admin dj-admin Published on 2025-05-16

Grounded Chain-of-Thought for Multimodal Large Language Models

论文题目:Grounded Chain-of-Thought for Multimodal Large Language Models 发表期刊:2025 The IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR)

dj-admin dj-admin Published on 2025-05-16