Paper
10 January 2003 Temporal audio segmentation using MPEG-7 descriptors
Author Affiliations +
Proceedings Volume 5021, Storage and Retrieval for Media Databases 2003; (2003) https://doi.org/10.1117/12.476256
Event: Electronic Imaging 2003, 2003, Santa Clara, CA, United States
Abstract
In this paper we present an audio segmentation technique by searching similar sections of a song. The search is performed on MPEG-7 low-level audio feature descriptors as a growing source of multimedia meta data. These descriptors are available every 10 ms of audio data. For each block the similarity to each other block is determined. The result of this operation is a matrix which contains off-diagonal stripes representing similar regions. At that point some postprocessing is necessary due to a very disturbed structure of the similarity matrix. Using the a-priori knowledge that we search off-diagonal stripes which must represent several seconds of audio data we implemented a filter to enhance the structure of the similarity matrix. The last step is to extract the off-diagonal stripes and match them into the time domain of the audio data.
© (2003) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Jens Wellhausen and Holger Crysandt "Temporal audio segmentation using MPEG-7 descriptors", Proc. SPIE 5021, Storage and Retrieval for Media Databases 2003, (10 January 2003); https://doi.org/10.1117/12.476256
Lens.org Logo
CITATIONS
Cited by 5 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Multimedia

Databases

Detection and tracking algorithms

Feature extraction

Compact discs

Computer programming

Communication engineering

RELATED CONTENT

Color image retrieval based on refined edge histograms
Proceedings of SPIE (July 19 2013)
A peer to peer music sharing system based on query...
Proceedings of SPIE (September 10 2007)
Quality evaluation of watermarked audio tracks
Proceedings of SPIE (April 29 2002)
Music classification with MPEG-7
Proceedings of SPIE (January 10 2003)
Recognition of the basic terrain features based on CD TIN...
Proceedings of SPIE (December 29 2008)
Audio thumbnailing using MPEG-7 low-level audio descriptors
Proceedings of SPIE (November 26 2003)
Spatial encoding using differences of global features
Proceedings of SPIE (January 15 1997)

Back to Top