Rule-based scene extraction from video
2002
Abstract
Abstract Instead of clustering video shots into scenes using low level image features, in this paper, we propose a rule-based model to extract simple dialog or action scenes. Through analyzing video editing rules and observing temporal appearance patterns of shots in dialog scenes of movies, we deduce a set of rules to recognize dialog or action scenes. Based on these rides, a finite state machine is designed to extract dialog or action scenes from videos automatically.
FAQs
AI
What visual patterns aid the extraction of dialog and action scenes?
The research identifies specific visual patterns in dialog scenes, such as actor positioning and camera placement, revealing three primary shot types, labeled A, B, and C.
How does the Finite State Machine model assist in scene extraction?
The proposed Finite State Machine model extracts dialog scenes by processing video shot sequences, effectively identifying regular languages from the temporal patterns of shots.
What editing techniques enhance the construction of dialog scenes?
Dialog scenes typically consist of alternating shots (Type C or patterns AB/BA), ensuring clarity regarding actor involvement from the outset.
What were the precision and recall results for the different movies analyzed?
The model achieved high retrieval precision and recall, notably performing best on 'Patch Adams,' which contains solely dialog scenes.
How does shot length differentiate between dialog and action scenes?
The model differentiates action from dialog scenes by analyzing average shot lengths, with longer shots indicating action sequences.
References (9)
- REFERENCES
- M. M. Yeung and B. L. Yeo, "Time-constrained clus- tering for segmentation of video into story units," in Proceedings of 13th International Conference on Pat- tern Recognition, 1996, pp. 375-380.
- Y. Rui, T. S. Huang, and S. Mehrotra, "Exploring video structure beyond the shots," in Proceedings of IEEE In- ternational Conference on Multimedia Computing and Systems, 1992, pp. 237-240.
- A. Hanjalic, R. L. Lagendijk, and J. Biemond, "Auto- matically segmenting movies into logical story units," in Proceedings of International Conference on Visual Information Systems, 1999, pp. 229-236.
- D. Arijon, Grammar of the Film Language, Focal Press, 1976.
- S. D. Katz, Film Directing Shot by Shot Visualizing From Concept to Screen, Michael Wiese Productions, 1991.
- J. L. Hein, Theory of Computation: An introduction, Jones and Bartlett Publishers, 1996.
- A. Yoshitaka, T. Ishii, and A. Hirakawa, "Content- based retrieval of video data by the grammar of film," in Proceedings of IEEE Symposium on Visual Languages, 1997, pp. 310-317.
- R. Lienhart, S. Pfeiffer, and W. Effelsberg, "Scene de- termination based on video and audio features," in Pro- ceedings of International Conference on Visual Infor- mation Systems, 1999, pp. 685-690.