Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models

标题
Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models
作者
关键词
Computer vision, Language, Region phrase correspondence, Datasets, Crowdsourcing
出版物
INTERNATIONAL JOURNAL OF COMPUTER VISION
Volume 123, Issue 1, Pages 74-93
出版商
Springer Nature
发表日期
2016-10-22
DOI
10.1007/s11263-016-0965-7

向作者/读者发起求助以获取更多资源

Reprint

联系作者

Create your own webinar

Interested in hosting your own webinar? Check the schedule and propose your idea to the Peeref Content Team.

Create Now

Ask a Question. Answer a Question.

Quickly pose questions to the entire community. Debate answers and get clarity on the most important issues facing researchers.

Get Started