This paper presents a new approach for classifying individual video frames as being a `cartoon' or a `photographic image'. The task arose from experiments performed at the TREC-2002 video retrieval benchmark: `cartoons' are returned unexpectedly at high ranks even if the query gave only `photographic' image examples. Distinguishing between the two genres has proved difficult because of their large intra-class variation. In addition to image descriptors used in prior cartoon-classification work, we introduce novel descriptors like ones based on the pattern spectrum of parabolic size distributions derived from parabolic granulometries and the complexity of the image signal approximated by its compression ratio. We evaluate the effectiveness of the proposed feature set for classification (using Support Vector Machines) on a large set of keyframes from the TREC-2002 video track collection and a set of web images. The paper reports the identification error rates against the number of images used as training set. The system is compared with one that classifies Web images as photographs or graphics and its superior performance is evident.

IEEE
IEEE International Conference on Multimedia and Expo
Database Architectures

Ianeva, T., de Vries, A., & Röhrig, H. (2003). Detecting cartoons: a case study in automatic video-genre classification. In Proceedings of IEEE International Conference on Multimedia and Expo 2003 (ICME) (pp. 1449–1452). IEEE.