Header menu link for other important links
X
Inter and intra item segmentation of continuous audio recordings of carnatic music for archival
, Padi Sarala
Published in International Society for Music Information Retrieval
2013
Pages: 487 - 492
Abstract
The purpose of this paper is to segment carnatic music recordings into individual items for archival purposes using applauses. A concert in carnatic music is replete with applauses. These applauses may be inter-item or intra-item applauses. A property of an item in carnatic music, is that within every item, a small portion of the audio corresponds to the rendering of a composition which is rendered by the entire ensemble of lead performer and accompanying instruments. A concert is divided into segments using applauses and the location of the ensemble in every item is first obtained using Cent Filterbank Cepstral Coefficients (CFCC) combined with Gaussian Mixture Models (GMMs). Since constituent parts of an item are rendered in a single raga, raga information is used to merge adjacent segments belonging to the same item. Inter-item applauses are used to locate the end of an item in a concert. The results are evaluated for fifty live recordings with 990 applauses in total. The classification accuracy for inter and intra item applauses is 93%. Given a song list and the audio, the song list is mapped to the segmented audio of items, which are then stored in the database. © 2013 International Society for Music Information Retrieval.
About the journal
JournalProceedings of the 14th International Society for Music Information Retrieval Conference, ISMIR 2013
PublisherInternational Society for Music Information Retrieval
Open AccessNo
Concepts (7)
  •  related image
    Audio acoustics
  •  related image
    Information retrieval
  •  related image
    CEPSTRAL COEFFICIENTS
  •  related image
    Classification accuracy
  •  related image
    GAUSSIAN MIXTURE MODEL (GMMS)
  •  related image
    MUSIC RECORDING
  •  related image
    Audio recordings