Using audio time scale modification for video browsing
A. Amir, D. Ponceleon, et al.
HICSS 2000
This paper describes a new representation for the audio and visual information in a video signal. We use reduce the dimensionality of the signals with singular-value decomposition (SVD) or mel-frequency cepstral coefficients (MFCC). We apply these transforms to word, (word transcript, semantic space or latent semantic indexing), image (color histogram data) and audio (timbre) data. Using scale-space techniques we find large jumps in a video's path, which are evidence for events. We use these techniques to analyze the temporal properties of the audio and image data in a video. This analysis creates a hierarchical segmentation of the video, or a table-of-contents, from both audio and the image data.
A. Amir, D. Ponceleon, et al.
HICSS 2000
T. Syeda-Mahmood, A. Vasilescu, et al.
EVENT 2001
J.H. Kaufman, T.R. Koehler, et al.
Journal of Applied Physics
T. Syeda-Mahmood, D. Ponceleon
MM 2001