Person Retrieval
25 papers with code • 1 benchmarks • 2 datasets
Latest papers
Co-Attention Aligned Mutual Cross-Attention for Cloth-Changing Person Re-Identification
In this paper, we first design a novel Shape Semantics Embedding (SSE) module to encode body shape semantic information, which is one of the essential clues to distinguish pedestrians when their clothes change.
Body Part-Based Representation Learning for Occluded Person Re-Identification
Firstly, individual body part appearance is not as discriminative as global appearance (two distinct IDs might have the same local appearance), this means standard ReID training objectives using identity labels are not adapted to local feature learning.
UPAR: Unified Pedestrian Attribute Recognition and Person Retrieval
It is based on four well-known person attribute recognition datasets: PA100K, PETA, RAPv2, and Market1501.
See Finer, See More: Implicit Modality Alignment for Text-based Person Retrieval
To explore the fine-grained alignment, we further propose two implicit semantic alignment paradigms: multi-level alignment (MLA) and bidirectional mask modeling (BMM).
Cross-Camera Trajectories Help Person Retrieval in a Camera Network
To address this issue, we propose a pedestrian retrieval framework based on cross-camera trajectory generation, which integrates both temporal and spatial information.
Part-based Pseudo Label Refinement for Unsupervised Person Re-identification
In this paper, we propose a novel Part-based Pseudo Label Refinement (PPLR) framework that reduces the label noise by employing the complementary relationship between global and part features.
DSSL: Deep Surroundings-person Separation Learning for Text-based Person Retrieval
Many previous methods on text-based person retrieval tasks are devoted to learning a latent common space mapping, with the purpose of extracting modality-invariant features from both visual and textual modality.
HAT: Hierarchical Aggregation Transformers for Person Re-identification
In this work, we take advantages of both CNNs and Transformers, and propose a novel learning framework named Hierarchical Aggregation Transformer (HAT) for image-based person Re-ID with high performance.
APES: Audiovisual Person Search in Untrimmed Video
To showcase the potential of our new dataset, we propose an audiovisual baseline and benchmark for person retrieval.
PeR-ViS: Person Retrieval in Video Surveillance using Semantic Description
Instead of using an image query, in this paper, we study the problem of person retrieval in video surveillance with a semantic description.