Phrase Extraction and Grounding (PEG)

1 papers with code • 0 benchmarks • 0 datasets

PEG requires a model to extract phrases from text and locate objects from images simultaneously.

Most implemented papers

DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding

idea-research/dq-detr 28 Nov 2022

As phrase extraction can be regarded as a $1$D text segmentation problem, we formulate PEG as a dual detection problem and propose a novel DQ-DETR model, which introduces dual queries to probe different features from image and text for object prediction and phrase mask prediction.