Index95タイトルA multimodal interpretable visual question answering model introducing image caption processor出典2022 IEEE 11th Global Conference on Consumer Electronics (GCCE 2022), pp.805-806