Weakly supervised learning, Phrase Grounding, Image-Caption Alignment

Align2ground: Weakly supervised phrase grounding guided by image-caption alignment

This paper addresses the problem of grounding free-form textual phrases by using weak supervision from image-caption pairs.