Compared with the general Visual Question Answering (VQA), Medical VQA is more challenging. Medical images contain more complex information than general images. Aiming at this point, we propose the IIF module that can improve the model's ability to obtain visual feature. In addition, we design QAM to help the model analyze the question better. On the VQA-RAD dataset, the accuracy of our model improved to 66.4% on the opened-ended questions and 80.1% on the closed-ended questions, outperforming other relevant models. The results on the VQA-MED 2019 dataset also verify the effectiveness of our model.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.