TY - JOUR
T1 - Visual Appearance and Soft Biometrics Fusion for Person Re-Identification Using Deep Learning
AU - Khan, Samee Ullah
AU - Khan, Noman
AU - Hussain, Tanveer
AU - Muhammad, Khan
AU - Hijji, Mohammad
AU - Ser, Javier Del
AU - Baik, Sung Wook
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023/3/22
Y1 - 2023/3/22
N2 - Learning descriptions of individual pedestrian is a common goal of both person re-identification (P-ReID) and attribute recognition methods, which are typically differentiated only in terms of their granularity. However, existing P-ReID methods only consider identification labels for individual pedestrian. In this article, we present a multi-scale pyramid attention ( MSPA ) model for P-ReID that jointly manipulates the complementarity between semantic attributes and visual appearance to address this limitation. The proposed MSPA method mainly comprises three steps. Initially, a backbone model followed by appearance and attribute networks is individually trained to perform P-ReID and pedestrian attribute classification tasks. The attribute network primarily focuses on suppressed image areas associated with soft biometric data while retaining the semantic context among attributes using a convolutional long short-term memory architecture. Additionally, the identification network extracts rich contextual features from an image at varying scales using a residual pyramid module. In the second step, the dual network features are fused, and MSPA is re-trained for the P-ReID task to further improve its complementary capabilities. Finally, we experimentally evaluated the proposed model on the two benchmark datasets Market-1501 and DukeMTMC-reID, and the results show that our approach achieved state-of-the-art performance.
AB - Learning descriptions of individual pedestrian is a common goal of both person re-identification (P-ReID) and attribute recognition methods, which are typically differentiated only in terms of their granularity. However, existing P-ReID methods only consider identification labels for individual pedestrian. In this article, we present a multi-scale pyramid attention ( MSPA ) model for P-ReID that jointly manipulates the complementarity between semantic attributes and visual appearance to address this limitation. The proposed MSPA method mainly comprises three steps. Initially, a backbone model followed by appearance and attribute networks is individually trained to perform P-ReID and pedestrian attribute classification tasks. The attribute network primarily focuses on suppressed image areas associated with soft biometric data while retaining the semantic context among attributes using a convolutional long short-term memory architecture. Additionally, the identification network extracts rich contextual features from an image at varying scales using a residual pyramid module. In the second step, the dual network features are fused, and MSPA is re-trained for the P-ReID task to further improve its complementary capabilities. Finally, we experimentally evaluated the proposed model on the two benchmark datasets Market-1501 and DukeMTMC-reID, and the results show that our approach achieved state-of-the-art performance.
KW - Deep learning
KW - multi-view surveillance data
KW - person re-identification
KW - soft biometric
UR - https://doi.org/10.1109/JSTSP.2023.3260627
UR - http://www.scopus.com/inward/record.url?scp=85151496540&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85151496540&partnerID=8YFLogxK
UR - https://www.mendeley.com/catalogue/6109d075-099b-37a0-a4e3-e35a64855cb0/
U2 - 10.1109/JSTSP.2023.3260627
DO - 10.1109/JSTSP.2023.3260627
M3 - Article (journal)
SN - 1941-0484
VL - 17
SP - 575
EP - 586
JO - IEEE Journal of Selected Topics in Signal Processing
JF - IEEE Journal of Selected Topics in Signal Processing
IS - 3
ER -