Pavel Kisilev, Daniel Freedman, et al.
ICPR 2012
In this paper, we describe a novel end-to-end video automatic labeling system, which accepts MPEG-1 sequence inputs and generates MPEG-7 XML metadata files based on the prior established anchor models. Seven modules were developed for the system: Shot Segmentation, Region Segmentation, Annotation, Feature Extraction, Model Learning, Classification, and XML Rendering. The performance of this system has been tested in the NIST TREC-2002 video concept detection benchmark. The proposed system performs best in the mean average precision out of 18 worldwide participants.
Pavel Kisilev, Daniel Freedman, et al.
ICPR 2012
Michelle X. Zhou, Fei Wang, et al.
ICMEW 2013
Sudeep Sarkar, Kim L. Boyer
Computer Vision and Image Understanding
James E. Gentile, Nalini Ratha, et al.
BTAS 2009