IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

Efficient Imbalanced Multimedia Concept Retrieval by Deep Learning on Spark Clusters

Efficient Imbalanced Multimedia Concept Retrieval by Deep Learning on Spark Clusters
View Sample PDF
Author(s): Yilin Yan (University of Miami, Department of Electrical and Computer Engineering, Coral Gables, FL, USA), Min Chen (University of Washington Bothell, Bothell, Computing and Software Systems, School of STEM, WA, USA), Saad Sadiq (University of Miami, Department of Electrical and Computer Engineering, Coral Gables, FL, USA)and Mei-Ling Shyu (University of Miami, Department of Electrical and Computer Engineering, Coral Gables, FL, USA)
Copyright: 2017
Volume: 8
Issue: 1
Pages: 20
Source title: International Journal of Multimedia Data Engineering and Management (IJMDEM)
Editor(s)-in-Chief: Chengcui Zhang (University of Alabama at Birmingham, USA)and Shu-Ching Chen (University of Missouri-Kansas City, United States)
DOI: 10.4018/IJMDEM.2017010101

Purchase

View Efficient Imbalanced Multimedia Concept Retrieval by Deep Learning on Spark Clusters on the publisher's website for pricing and purchasing information.

Abstract

The classification of imbalanced datasets has recently attracted significant attention due to its implications in several real-world use cases. The classifiers developed on datasets with skewed distributions tend to favor the majority classes and are biased against the minority class. Despite extensive research interests, imbalanced data classification remains a challenge in data mining research, especially for multimedia data. Our attempt to overcome this hurdle is to develop a convolutional neural network (CNN) based deep learning solution integrated with a bootstrapping technique. Considering that convolutional neural networks are very computationally expensive coupled with big training datasets, we propose to extract features from pre-trained convolutional neural network models and feed those features to another full connected neutral network. Spark implementation shows promising performance of our model in handling big datasets with respect to feasibility and scalability.

Related Content

Yasasi Abeysinghe, Bhanuka Mahanama, Gavindya Jayawardena, Yasith Jayawardana, Mohan Sunkara, Andrew T. Duchowski, Vikas Ashok, Sampath Jayarathna. © 2024. 20 pages.
Chengxuan Huang, Evan Brock, Dalei Wu, Yu Liang. © 2023. 23 pages.
Duleep Rathgamage Don, Jonathan Boardman, Sudhashree Sayenju, Ramazan Aygun, Yifan Zhang, Bill Franks, Sereres Johnston, George Lee, Dan Sullivan, Girish Modgil. © 2023. 17 pages.
Wei-An Teng, Su-Ling Yeh, Homer H. Chen. © 2023. 17 pages.
Hemanth Gudaparthi, Prudhviraj Naidu, Nan Niu. © 2022. 20 pages.
Anchen Sun, Yudong Tao, Mei-Ling Shyu, Angela Blizzard, William Andrew Rothenberg, Dainelys Garcia, Jason F. Jent. © 2022. 19 pages.
Suvojit Acharjee, Sheli Sinha Chaudhuri. © 2022. 16 pages.
Body Bottom