Multimodal grid features and cell pointers for scene text visual question answering

Title
Multimodal grid features and cell pointers for scene text visual question answering
Authors
Keywords
Deep learning, Scene text, Visual question answering, Multi-modal learning, MSC, 41A05, 41A10, 65D05, 65D17
Journal
PATTERN RECOGNITION LETTERS
Volume 150, Issue -, Pages 242-249
Publisher
Elsevier BV
Online
2021-07-20
DOI
10.1016/j.patrec.2021.06.026

Ask authors/readers for more resources

Reprint

Contact the author

Add your recorded webinar

Do you already have a recorded webinar? Grow your audience and get more views by easily listing your recording on Peeref.

Upload Now

Become a Peeref-certified reviewer

The Peeref Institute provides free reviewer training that teaches the core competencies of the academic peer review process.

Get Started