Abstract
Multi-view clustering aims to learn discriminative
representations from multi-view data. Although existing methods
show impressive performance by leveraging contrastive learning
to tackle the representation gap between every two views,
they share the common limitation of not performing semantic
alignment from a global perspective, resulting in the undermining
of semantic patterns in multi-view data. This paper presents
CSOT, namely Common Semantics via Optimal Transport, to
boost contrastive multi-view clustering via semantic learning in
a common space that integrates all views. Through optimal
transport, the samples in multiple views are mapped to the
joint clusters which represent the multi-view semantic patterns
in the common space. With the semantic assignment derived
from the optimal transport plan, we design a semantic learning
module where the soft assignment vector works as a global
supervision to enforce the model to learn consistent semantics
among all views. Moreover, we propose a semantic-aware reweighting strategy to treat samples differently according to their
semantic significance, which improves the effectiveness of crossview contrastive representation learning. Extensive experimental
results demonstrate that CSOT achieves the state-of-the-art
clustering performance.
representations from multi-view data. Although existing methods
show impressive performance by leveraging contrastive learning
to tackle the representation gap between every two views,
they share the common limitation of not performing semantic
alignment from a global perspective, resulting in the undermining
of semantic patterns in multi-view data. This paper presents
CSOT, namely Common Semantics via Optimal Transport, to
boost contrastive multi-view clustering via semantic learning in
a common space that integrates all views. Through optimal
transport, the samples in multiple views are mapped to the
joint clusters which represent the multi-view semantic patterns
in the common space. With the semantic assignment derived
from the optimal transport plan, we design a semantic learning
module where the soft assignment vector works as a global
supervision to enforce the model to learn consistent semantics
among all views. Moreover, we propose a semantic-aware reweighting strategy to treat samples differently according to their
semantic significance, which improves the effectiveness of crossview contrastive representation learning. Extensive experimental
results demonstrate that CSOT achieves the state-of-the-art
clustering performance.
Original language | English |
---|---|
Journal | IEEE Transactions on Image Processing |
Publication status | Accepted/In press - 24 Jul 2024 |
Keywords
- Multi-view clustering
- semantic alignment
- optimal transport
- contrastive learning
Research Centres
- Centre for Intelligent Visual Computing Research
- Data and Complex Systems Research Centre