Image and Vision Computing
Multi-modal spatial relational attention networks for visual question answering
Publication date: December 2023Source: Image and Vision Computing, Volume 140Author(s): Haibo Yao, Lipeng Wang, Chengtao Cai, Yuxin Sun, Zhi Zhang, Yongkang Luo