The IRMA Community
Newsletters
Research IRM
Click a keyword to search titles using our InfoSci-OnDemand powered search:
|
An Improved Approach to Audio Segmentation and Classification in Broadcasting Industries
|
Author(s): Jingzhou Sun (School of Computer Science and Cybersecurity, Communication of China, Beijing, China)and Yongbin Wang (School of Computer Science and Cybersecurity, Communication of China, Beijing, China)
Copyright: 2019
Volume: 30
Issue: 2
Pages: 23
Source title:
Journal of Database Management (JDM)
Editor(s)-in-Chief: Keng Siau (City University of Hong Kong, Hong Kong SAR)
DOI: 10.4018/JDM.2019040103
Purchase
|
Abstract
Audio segmentation and classification are the basis of audio processing in broadcasting industries. A Dual-CNN (Dual-Convolutional Neural Network) method is proposed in this article in which it is possible to pre-train a CNN with unlabeled audio data so as to deal with the scarcity of labeled data. Auto-encoders (including an encoder and a decoder) are utilized, thus the name “Dual.” In the first place, audio sampling points and the derived STFT (Short-Time Fourier Transform) spectrograms pass through their own CNNs. Fusion of the extracted features is then performed. Finally, the merged features are sent to a fully connected network and the classification results are produced via Softmax. Being one of the segmentation-by-classification approaches, our solution also presents a novel smoothing method (SEG-smoothing) in order to deliver the best result of segmentation. A series of experiments have been conducted and their result verifies that the proposed approach for segmentation and classification outperforms alternative solutions.
Related Content
Pasi Raatikainen, Samuli Pekkola, Maria Mäkelä.
© 2024.
30 pages.
|
Zhongliang Li, Yaofeng Tu, Zongmin Ma.
© 2024.
25 pages.
|
Zongmin Ma, Daiyi Li, Jiawen Lu, Ruizhe Ma, Li Yan.
© 2024.
32 pages.
|
Lavlin Agrawal, Pavankumar Mulgund, Raj Sharman.
© 2024.
37 pages.
|
Jizi Li, Xiaodie Wang, Justin Z. Zhang, Longyu Li.
© 2024.
34 pages.
|
Amit Singh, Jay Prakash, Gaurav Kumar, Praphula Kumar Jain, Loknath Sai Ambati.
© 2024.
25 pages.
|
Ruizhe Ma, Weiwei Zhou, Zongmin Ma.
© 2024.
21 pages.
|
|
|