TY - GEN
T1 - Local and Global Contextual Features Fusion for Pedestrian Intention Prediction
AU - Azarmi, M.
AU - Rezaei, M.
AU - Hussain, T.
AU - Qian, C.
N1 - Publisher Copyright:
© 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.
PY - 2023/10/5
Y1 - 2023/10/5
N2 - Autonomous vehicles (AVs) are becoming an indispensable part of future transportation. However, safety challenges and lack of reliability limit their real-world deployment. Towards boosting the appearance of AVs on the roads, the interaction of AVs with pedestrians including “prediction of the pedestrian crossing intention” deserves extensive research. This is a highly challenging task as involves multiple non-linear parameters. In this direction, we extract and analyse spatio-temporal visual features of both pedestrian and traffic contexts. The pedestrian features include body pose and local context features that represent the pedestrian’s behaviour. Additionally, to understand the global context, we utilise location, motion, and environmental information using scene parsing technology that represents the pedestrian’s surroundings, and may affect the pedestrian’s intention. Finally, these multi-modality features are intelligently fused for effective intention prediction learning. The experimental results of the proposed model on the JAAD dataset show a superior result on the combined AUC and F1-score compared to the state-of-the-art.
AB - Autonomous vehicles (AVs) are becoming an indispensable part of future transportation. However, safety challenges and lack of reliability limit their real-world deployment. Towards boosting the appearance of AVs on the roads, the interaction of AVs with pedestrians including “prediction of the pedestrian crossing intention” deserves extensive research. This is a highly challenging task as involves multiple non-linear parameters. In this direction, we extract and analyse spatio-temporal visual features of both pedestrian and traffic contexts. The pedestrian features include body pose and local context features that represent the pedestrian’s behaviour. Additionally, to understand the global context, we utilise location, motion, and environmental information using scene parsing technology that represents the pedestrian’s surroundings, and may affect the pedestrian’s intention. Finally, these multi-modality features are intelligently fused for effective intention prediction learning. The experimental results of the proposed model on the JAAD dataset show a superior result on the combined AUC and F1-score compared to the state-of-the-art.
KW - Pedestrian Crossing Intention
KW - Pose Estimation
KW - Semantic Segmentation
KW - Pedestrian Intent Prediction
KW - Autonomous Vehicles
KW - Computer Vision
KW - Human Action Prediction
UR - http://www.scopus.com/inward/record.url?eid=2-s2.0-85174447382&partnerID=MN8TOARS
UR - http://www.scopus.com/inward/record.url?scp=85174447382&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85174447382&partnerID=8YFLogxK
UR - https://www.mendeley.com/catalogue/8c784ce5-a166-3d5b-9c64-5b9eb231bb53/
U2 - 10.1007/978-3-031-43763-2_1
DO - 10.1007/978-3-031-43763-2_1
M3 - Conference proceeding (ISBN)
SN - 9783031437625
T3 - Communications in Computer and Information Science
SP - 1
EP - 13
BT - Artificial Intelligence and Smart Vehicles
A2 - Ghatee, Mehdi
A2 - Hashemi, S. Mehdi
PB - Springer
T2 - ICAISV 2023
Y2 - 24 May 2024 through 25 May 2024
ER -