Abstract
We introduce a novel approach for traffic scene analysis in driving videos by exploring spatio-temporal relationships captured by a temporal frame-to-frame (f2f) bipartite graph, eliminating the need for complex image-level high-dimensional feature extraction. Instead, we rely on object detectors that provide bounding box information. The proposed graph approach efficiently connects objects across frames where nodes represent essential object attributes, and edges signify interactions based on simple spatial metrics such as distance and angles between objects. A key innovation is the integration of dynamic edge attributes, computed using Multilayer Perceptrons (MLP) by exploring this spatial metric. These attributes enhance our Interaction-aware Graph Neural Networks (IA-GNNs) framework by adapting the PageRank-driven approximate personalized propagation of neural predictions (APPNP) scheme and graph attention mechanism in a novel way. This has significantly improved our model’s
ability to understand spatio-temporal interactions of multiple objects in traffic scenarios. We have rigorously evaluated our approach on two benchmark datasets, METEOR and INTERACTION, demonstrating its accuracy in analyzing traffic scenarios. This streamlined, graph-based strategy marks a significant shift towards more efficient and insightful traffic scene analysis using video data. Our source code is available at: https://github.com/Addy-1998/Bip_DTG.
ability to understand spatio-temporal interactions of multiple objects in traffic scenarios. We have rigorously evaluated our approach on two benchmark datasets, METEOR and INTERACTION, demonstrating its accuracy in analyzing traffic scenarios. This streamlined, graph-based strategy marks a significant shift towards more efficient and insightful traffic scene analysis using video data. Our source code is available at: https://github.com/Addy-1998/Bip_DTG.
Original language | English |
---|---|
Title of host publication | 2024 IEEE International Conference on Image Processing, ICIP 2024 - Proceedings |
Publisher | IEEE Computer Society |
Pages | 908-914 |
Number of pages | 7 |
ISBN (Electronic) | 9798350349399 |
ISBN (Print) | 9798350349399 |
DOIs | |
Publication status | Published - 30 Oct 2024 |
Event | 31st IEEE International Conference on Image Processing, ICIP 2024 - Abu Dhabi, United Arab Emirates Duration: 27 Oct 2024 → 30 Oct 2024 |
Publication series
Name | Proceedings - International Conference on Image Processing, ICIP |
---|---|
ISSN (Print) | 1522-4880 |
Conference
Conference | 31st IEEE International Conference on Image Processing, ICIP 2024 |
---|---|
Country/Territory | United Arab Emirates |
City | Abu Dhabi |
Period | 27/10/24 → 30/10/24 |
Keywords
- bipartite graphs
- graph attention
- graph neural networks
- knowledge representation
- relational learning
- spatio-temporal relationships
- traffic scene analysis