In this paper, we introduce a novel framework for automatic Semantic Video Annotation. As this framework detects possible events occurring in video clips, it forms the annotating base of a video search engine. To achieve this purpose, the system has to able to operate on uncontrolled wide-domain videos. Thus, all layers have to be based on generic features. The aim is to help bridge the “semantic gap“, which is the difference between the low-level visual features and the human's perception, by finding videos with similar visual events, then analyzing their free text annotation to find the best description for this new video using commonsense knowledgebases. Experiments were performed on wide-domain video clips from the TRECVID 2005 BBC rush standard database. Results from these experiments show promising integrity between those two layers in order to find expressing annotations for the input video. These results were evaluated based on retrieval performance.
|Title of host publication||ICSIPA09 - 2009 IEEE International Conference on Signal and Image Processing Applications, Conference Proceedings|
|Publication status||Published - 19 Nov 2009|
|Event||2009 IEEE International conference on Signal and Image Processing Applications - Kuala Lumpur, Malaysia|
Duration: 18 Nov 2009 → 19 Nov 2009
|Conference||2009 IEEE International conference on Signal and Image Processing Applications|
|Period||18/11/09 → 19/11/09|