Multimodal grid features and cell pointers for scene text visual question answering

标题
Multimodal grid features and cell pointers for scene text visual question answering
作者
关键词
Deep learning, Scene text, Visual question answering, Multi-modal learning, MSC, 41A05, 41A10, 65D05, 65D17
出版物
PATTERN RECOGNITION LETTERS
Volume 150, Issue -, Pages 242-249
出版商
Elsevier BV
发表日期
2021-07-20
DOI
10.1016/j.patrec.2021.06.026

向作者/读者发起求助以获取更多资源

Reprint

联系作者

Discover Peeref hubs

Discuss science. Find collaborators. Network.

Join a conversation

Find the ideal target journal for your manuscript

Explore over 38,000 international journals covering a vast array of academic fields.

Search