Abstract
Multiple people can have similar appearances and portions of images can be occluded or have viewpoint changes in real scenarios, causing the increased difficulty of person reidentification (Re-ID). To address these problems, we propose
a dual-channel person Re-ID algorithm that integrates person Re-ID image-text pairs into the classification network for end-to end learning. We construct an image channel and a text channel, and subsequently extract visual information and text information using a convolutional neural network and a simple recurrent units network, respectively. The text information is used to assist in the learning of visual information, consequently improving
the robustness of the visual information. In addition, the visual features are divided into two branches to calculate the global and local features. Global features focus on the overall appearance of a person, whereas local features provide more fine-grained detail. Text information is more accurate and reliable, and it is thus more robust to occlusion and viewpoint changes. Visual information complemented by text information can describe a person more
accurately and reliably. Extensive experiments demonstrate our method achieves state-of-the-art performance
a dual-channel person Re-ID algorithm that integrates person Re-ID image-text pairs into the classification network for end-to end learning. We construct an image channel and a text channel, and subsequently extract visual information and text information using a convolutional neural network and a simple recurrent units network, respectively. The text information is used to assist in the learning of visual information, consequently improving
the robustness of the visual information. In addition, the visual features are divided into two branches to calculate the global and local features. Global features focus on the overall appearance of a person, whereas local features provide more fine-grained detail. Text information is more accurate and reliable, and it is thus more robust to occlusion and viewpoint changes. Visual information complemented by text information can describe a person more
accurately and reliably. Extensive experiments demonstrate our method achieves state-of-the-art performance
Original language | American English |
---|---|
Article number | 2513216 |
Pages (from-to) | 1-16 |
Number of pages | 16 |
Journal | IEEE Transactions on Instrumentation and Measurement |
Volume | 72 |
Early online date | 21 Apr 2023 |
DOIs | |
Publication status | Published - 18 May 2023 |
Keywords
- Person re-identification
- Text feature
- Global and local visual features
- Convolutional neural network
- Recurrent units network
- person re-identification (Re-ID)
- text feature
- recurrent units network
- global and local visual features
- Convolutional neural network (CNN)
Research Centres
- Data and Complex Systems Research Centre