TY - JOUR
T1 - Interweaving Insights: High-Order Feature Interaction for Fine-Grained Visual Recognition
AU - SIKDAR, ARINDAM
AU - LIU, YONGHUAI
AU - Kedarisetty, Siddhardha
AU - Zhao, Yitian
AU - AHMED, AMR
AU - BEHERA, ARDHENDU
N1 - Publisher Copyright:
© The Author(s) 2024.
PY - 2024/10/20
Y1 - 2024/10/20
N2 - This paper presents a novel approach for Fine-Grained Visual Classification (FGVC) by exploring Graph Neural Networks (GNNs) to facilitate high-order feature interactions, with a specific focus on constructing both inter- and intra-region graphs. Unlike previous FGVC techniques that often isolate global and local features, our method combines both features seamlessly during learning via graphs. Inter-region graphs capture long-range dependencies to recognize global patterns, while intra-region graphs delve into finer details within specific regions of an object by exploring high-dimensional convolutional features. A key innovation is the use of shared GNNs with an attention mechanism coupled with the Approximate Personalized Propagation of Neural Predictions (APPNP) message-passing algorithm, enhancing information propagation efficiency for better discriminability and simplifying the model architecture for computational efficiency. Additionally, the introduction of residual connections improves performance and training stability. Comprehensive experiments showcase state-of-the-art results on benchmark FGVC datasets, affirming the efficacy of our approach. This work underscores the potential of GNN in modeling high-level feature interactions, distinguishing it from previous FGVC methods that typically focus on singular aspects of feature representation. Our source code is available at https://github.com/Arindam-1991/I2-HOFI.
AB - This paper presents a novel approach for Fine-Grained Visual Classification (FGVC) by exploring Graph Neural Networks (GNNs) to facilitate high-order feature interactions, with a specific focus on constructing both inter- and intra-region graphs. Unlike previous FGVC techniques that often isolate global and local features, our method combines both features seamlessly during learning via graphs. Inter-region graphs capture long-range dependencies to recognize global patterns, while intra-region graphs delve into finer details within specific regions of an object by exploring high-dimensional convolutional features. A key innovation is the use of shared GNNs with an attention mechanism coupled with the Approximate Personalized Propagation of Neural Predictions (APPNP) message-passing algorithm, enhancing information propagation efficiency for better discriminability and simplifying the model architecture for computational efficiency. Additionally, the introduction of residual connections improves performance and training stability. Comprehensive experiments showcase state-of-the-art results on benchmark FGVC datasets, affirming the efficacy of our approach. This work underscores the potential of GNN in modeling high-level feature interactions, distinguishing it from previous FGVC methods that typically focus on singular aspects of feature representation. Our source code is available at https://github.com/Arindam-1991/I2-HOFI.
KW - Computer Vision
KW - Fine-grained visual recognition
KW - Inter-region and intra-region graphs
KW - Convolutional neural networks
KW - Residual graph neural networks
KW - Graph attention networks
KW - High-order feature interaction
KW - Artificial Intelligence
UR - http://www.scopus.com/inward/record.url?scp=85206975550&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85206975550&partnerID=8YFLogxK
U2 - 10.1007/s11263-024-02260-y
DO - 10.1007/s11263-024-02260-y
M3 - Article (journal)
SN - 0920-5691
JO - International Journal of Computer Vision
JF - International Journal of Computer Vision
ER -