Abstract
In this paper we present a novel algorithm for anchor shot detection (ASD). ASD is a fundamental step for segmenting news video into stories that is among key issues for achieving efficient treatment of news-based digital libraries.
The proposed algorithm creates a set of audio/video templates of anchorperson shots in an unsupervised way, then classifies shots by comparing them to the templates. Audio similarity is evaluated by means of a new index and helps to achieve better performance than a pure video approach. The method has been tested on a wide database and compared with other state-of-the-art algorithms, demonstrating its effectiveness with respect to them.
Chapter PDF
References
De Santo, M., Percannella, G., Sansone, C., Vento, M.: An Unsupervised Shot Classification System for News Video Story Detection. In: Abate, A.F., Nappi, M., Sebillo, M. (eds.) Multimedia Database and Image Communication, pp. 93–104. World Scientific Publ., Singapore (2005)
Gao, X., Tang, X.: Unsupervised Video-Shot Segmentation and Model-Free Anchorperson Detection for News Video Story Parsing. IEEE Transactions on Circuits and Systems for Video Technology 12(9), 765–776 (2002)
Gunsel, B., Ferman, A.M., Tekalp, A.M.: Video Indexing Through Integration of Syntactic and Semantic Features. In: Proc. of Workshop Applications of Computer Vision, Sarasota, FL, pp. 90–95 (1996)
Swanberg, D., Shu, C.F., Jain, R.: Knowledge Guided Parsing in Video Databases. In: Proc. of SPIE Symposium on Electronic Imaging: Science and Technology, San Jose, CA, pp. 13–24 (1993)
Smoliar, S.W., Zhang, H.J., Tao, S.Y., Gong, Y.: Automatic Parsing and Indexing of News Video. Multimedia Systems 2(6), 256–265 (1995)
Hanjalic, A., Lagendijk, R.L., Biemond, J.: Semi-Automatic News Analysis, Indexing, and Classification System Based on Topics Preselection. In: Proc. of SPIE, Electronic Imaging: Storage and Retrieval of Image and Video Databases, San Jose (CA) (1999)
Bertini, M., Del Bimbo, A., Pala, P.: Content-Based Indexing and Retrieval of TV News. Pattern Recognition Letters 22, 503–516 (2001)
Snoek, C.G.M., Worring, M.: Multimodal Video Indexing: A Review of the State-of-the-art. Multimedia Tools and Applications 25, 5–35 (2005)
Eickeler, S., Muller, S.: Content-based video indexing of TV broadcast news using Hidden Markov Models. In: ICASSP 1999, pp. 2997–3000 (1999)
Qi, W., Gu, L., Jiang, H., Chen, X.R., Zhang, H.J.: Integrating Visual, Audio and Text Analysis for News Video. In: 7th IEEE International Conference on Image Processing, Vancouver, British Columbia, Canada (2000)
Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press, New York (1981)
Viola, P., Jones, M.: Rapid Object Detection Using a Boosted Cascade of Simple Features. In: Proc. of the IEEE CVPR Conference, vol. 1, pp. 511–518 (2001)
Lee, H.Y., Lee, H.K., Ha, Y.H.: Spatial Color Descriptor for Image Retrieval and Video Segmentation. IEEE Transactions on Multimedia 5(3), 358–367 (2003)
Cordella, L.P., Foggia, P., Sansone, C., Vento, M.: A Real-Time Text-Independent Speaker Identification System. In: 12th International Conference on Image Analysis and Processing, September 17-19, pp. 632–637. IEEE Computer Society Press, Mantova, Italy (2003)
Wang, D., Lu, L., Zhang, H.-J.: Speech Segmentation Without Speech Recognition. In: ICASSP 2003, vol. I, pp. 468–471 (2003)
Gargi, U., Kasturi, R., Strayer, S.H.: Performance Characterization of Video-Shot-Change Detection Methods. IEEE Trans. on Circuits and Systems for Video Technology 10(1), 1–13 (2000)
De Santo, M., Percannella, G., Sansone, C., Vento, M.: A Comparison of Unsupervised Shot Classification Algorithms for News Video Segmentation. In: Fred, A., Caelli, T.M., Duin, R.P.W., Campilho, A.C., de Ridder, D. (eds.) SSPR&SPR 2004. LNCS, vol. 3138, pp. 233–241. Springer, Heidelberg (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
D’Anna, L., Marrazzo, G., Percannella, G., Sansone, C., Vento, M. (2006). A Multi-stage Approach for Anchor Shot Detection. In: Yeung, DY., Kwok, J.T., Fred, A., Roli, F., de Ridder, D. (eds) Structural, Syntactic, and Statistical Pattern Recognition. SSPR /SPR 2006. Lecture Notes in Computer Science, vol 4109. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11815921_85
Download citation
DOI: https://doi.org/10.1007/11815921_85
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37236-3
Online ISBN: 978-3-540-37241-7
eBook Packages: Computer ScienceComputer Science (R0)