Abstract
This paper presents a novel approach for Fine-Grained Visual Classification (FGVC) by exploring Graph Neural Networks (GNNs) to facilitate high-order feature interactions, with a specific focus on constructing both inter- and intra-region graphs. Unlike previous FGVC techniques that often isolate global and local features, our method combines both features seamlessly during learning via graphs. Inter-region graphs capture long-range dependencies to recognize global patterns, while intra-region graphs delve into finer details within specific regions of an object by exploring high-dimensional convolutional features. A key innovation is the use of shared GNNs with an attention mechanism coupled with the Approximate Personalized Propagation of Neural Predictions (APPNP) message-passing algorithm, enhancing information propagation efficiency for better discriminability and simplifying the model architecture for computational efficiency. Additionally, the introduction of residual connections improves performance and training stability. Comprehensive experiments showcase state-of-the-art results on benchmark FGVC datasets, affirming the efficacy of our approach. This work underscores the potential of GNN in modeling high-level feature interactions, distinguishing it from previous FGVC methods that typically focus on singular aspects of feature representation. Our source code is available at https://github.com/Arindam-1991/I2-HOFI.
Original language | English |
---|---|
Number of pages | 25 |
Journal | International Journal of Computer Vision |
DOIs | |
Publication status | Published - 20 Oct 2024 |
Keywords
- Computer Vision
- Fine-grained visual recognition
- Inter-region and intra-region graphs
- Convolutional neural networks
- Residual graph neural networks
- Graph attention networks
- High-order feature interaction
- Artificial Intelligence
Research Centres
- Centre for Intelligent Visual Computing Research
- Data and Complex Systems Research Centre
- Data Science STEM Research Centre