Rule-based scene extraction from video

M. Tamer Özsu

Outline

Rule-based scene extraction from video

M. Tamer Özsu

2002

Abstract

Abstract Instead of clustering video shots into scenes using low level image features, in this paper, we propose a rule-based model to extract simple dialog or action scenes. Through analyzing video editing rules and observing temporal appearance patterns of shots in dialog scenes of movies, we deduce a set of rules to recognize dialog or action scenes. Based on these rides, a finite state machine is designed to extract dialog or action scenes from videos automatically.

FAQs

What visual patterns aid the extraction of dialog and action scenes?add

The research identifies specific visual patterns in dialog scenes, such as actor positioning and camera placement, revealing three primary shot types, labeled A, B, and C.

How does the Finite State Machine model assist in scene extraction?add

The proposed Finite State Machine model extracts dialog scenes by processing video shot sequences, effectively identifying regular languages from the temporal patterns of shots.

What editing techniques enhance the construction of dialog scenes?add

Dialog scenes typically consist of alternating shots (Type C or patterns AB/BA), ensuring clarity regarding actor involvement from the outset.

What were the precision and recall results for the different movies analyzed?add

The model achieved high retrieval precision and recall, notably performing best on 'Patch Adams,' which contains solely dialog scenes.

How does shot length differentiate between dialog and action scenes?add

The model differentiates action from dialog scenes by analyzing average shot lengths, with longer shots indicating action sequences.

Figures (5)

Fig. 1. A FSM extracts VSS of dialog scenes between actor a and actor b

Table 4. Dialog scenes extracted by the FSM Table 3. The experiment data

a common actor and overlapping durations. Table 6 shows the performance of this approach in detecting multi-actor dialog scenes in the three movies under consideration.

Table 5. Action scenes extracted by the FSM

Table 6. The detected group conversion scenes

References (9)

REFERENCES
M. M. Yeung and B. L. Yeo, "Time-constrained clus- tering for segmentation of video into story units," in Proceedings of 13th International Conference on Pat- tern Recognition, 1996, pp. 375-380.
Y. Rui, T. S. Huang, and S. Mehrotra, "Exploring video structure beyond the shots," in Proceedings of IEEE In- ternational Conference on Multimedia Computing and Systems, 1992, pp. 237-240.
A. Hanjalic, R. L. Lagendijk, and J. Biemond, "Auto- matically segmenting movies into logical story units," in Proceedings of International Conference on Visual Information Systems, 1999, pp. 229-236.
D. Arijon, Grammar of the Film Language, Focal Press, 1976.
S. D. Katz, Film Directing Shot by Shot Visualizing From Concept to Screen, Michael Wiese Productions, 1991.
J. L. Hein, Theory of Computation: An introduction, Jones and Bartlett Publishers, 1996.
A. Yoshitaka, T. Ishii, and A. Hirakawa, "Content- based retrieval of video data by the grammar of film," in Proceedings of IEEE Symposium on Visual Languages, 1997, pp. 310-317.
R. Lienhart, S. Pfeiffer, and W. Effelsberg, "Scene de- termination based on video and audio features," in Pro- ceedings of International Conference on Visual Infor- mation Systems, 1999, pp. 685-690.

Rule-based scene extraction from video

Sign up for access to the world's latest research

Abstract

FAQs

Related papers

References (9)

Related papers