FAQ
What is shot break detection?
Roughly speaking, a shot break is the change from one camera shot
to another. In the research literature, the definitions change
depending upon the paper. Many researchers use the movie maker
terminology in which a scene is composed of one or more shots.
Why is this useful?
Shot break detection is useful for a wide variety of real world
applications. Video editors and producers usually work in terms
of shots and scenes, not individual frames. This work is useful
not only in finding shots and deleting shots within a current
movie, but also in locating shots and scenes from previous movies.
It also gives more search options to the home user. Suppose you
have recorded six hours of video from cable. This technology gives
you the ability to skip over shots with a button clock.
Why is this important?
Since the information in video databases can be measured
in thousands of gigabytes of uncompressed data, video analysis methods
which ignore the redundancy in video sequences can not be considered
practical methods. Thus, we integrate the video analysis
methods with video compression in order to exploit the redundancy
in video sequences.
What problems are addressed?
Two problems will be addressed: (1) Video analysis, which includes
segmentation, classification, and recognition, and (2) Computational
efficiency through integration with video compression methods.
Regarding the video analysis, a method based on 2D pixel motion flow
fields and the Karhunen-Loeve transform is proposed to determine the
location of scene cuts and the type of camera movement within the
scene. The 2D pixel motion can be found from methods which include
optical flow and correlation. The scene cuts can be detected by
analyzing the displaced frame difference or the smoothness of the pixel
motion field. The Karhunen-Loeve transform was chosen because it
extracts optimal linear features which describe the pixel motion
fields. These optimal linear features are denoted as eigenflow images
since they describe the set of pixel motion flow images which
correspond to a particular camera movement. Regarding face and object
classification, the Karhunen-Loeve transform has been used
successfully for recognition of near frontal views of human faces
|