IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

An Improved Approach to Audio Segmentation and Classification in Broadcasting Industries

An Improved Approach to Audio Segmentation and Classification in Broadcasting Industries
View Sample PDF
Author(s): Jingzhou Sun (School of Computer Science and Cybersecurity, Communication of China, Beijing, China)and Yongbin Wang (School of Computer Science and Cybersecurity, Communication of China, Beijing, China)
Copyright: 2019
Volume: 30
Issue: 2
Pages: 23
Source title: Journal of Database Management (JDM)
Editor(s)-in-Chief: Keng Siau (City University of Hong Kong, Hong Kong SAR)
DOI: 10.4018/JDM.2019040103

Purchase

View An Improved Approach to Audio Segmentation and Classification in Broadcasting Industries on the publisher's website for pricing and purchasing information.

Abstract

Audio segmentation and classification are the basis of audio processing in broadcasting industries. A Dual-CNN (Dual-Convolutional Neural Network) method is proposed in this article in which it is possible to pre-train a CNN with unlabeled audio data so as to deal with the scarcity of labeled data. Auto-encoders (including an encoder and a decoder) are utilized, thus the name “Dual.” In the first place, audio sampling points and the derived STFT (Short-Time Fourier Transform) spectrograms pass through their own CNNs. Fusion of the extracted features is then performed. Finally, the merged features are sent to a fully connected network and the classification results are produced via Softmax. Being one of the segmentation-by-classification approaches, our solution also presents a novel smoothing method (SEG-smoothing) in order to deliver the best result of segmentation. A series of experiments have been conducted and their result verifies that the proposed approach for segmentation and classification outperforms alternative solutions.

Related Content

Pasi Raatikainen, Samuli Pekkola, Maria Mäkelä. © 2024. 30 pages.
Zhongliang Li, Yaofeng Tu, Zongmin Ma. © 2024. 25 pages.
Zongmin Ma, Daiyi Li, Jiawen Lu, Ruizhe Ma, Li Yan. © 2024. 32 pages.
Lavlin Agrawal, Pavankumar Mulgund, Raj Sharman. © 2024. 37 pages.
Jizi Li, Xiaodie Wang, Justin Z. Zhang, Longyu Li. © 2024. 34 pages.
Amit Singh, Jay Prakash, Gaurav Kumar, Praphula Kumar Jain, Loknath Sai Ambati. © 2024. 25 pages.
Ruizhe Ma, Weiwei Zhou, Zongmin Ma. © 2024. 21 pages.
Body Bottom