IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

Robust Statistical Methods for Rapid Data Labelling

Robust Statistical Methods for Rapid Data Labelling
View Sample PDF
Author(s): Jamie Godwin (University of Durham, UK)and Peter Matthews (University of Durham, UK)
Copyright: 2014
Pages: 35
Source title: Data Mining and Analysis in the Engineering Field
Source Author(s)/Editor(s): Vishal Bhatnagar (Ambedkar Institute of Advanced Communication Technologies and Research, India)
DOI: 10.4018/978-1-4666-6086-1.ch007

Purchase

View Robust Statistical Methods for Rapid Data Labelling on the publisher's website for pricing and purchasing information.

Abstract

Labelling of data is an expensive, labour-intensive, and time consuming process and, as such, results in vast quantities of data being unexploited when performing analysis through data mining. This chapter presents a new paradigm using robust multivariate statistical methods to encapsulate normal operational behaviour—not failure behaviour—to autonomously derive unsupervised classifier labels for previously collected data in a rapid, cost-effective manner. This enables traditional machine learning to take place on a much richer dataset. Two case studies are presented in the mechanical engineering domain, namely, a wind turbine gearbox and a rolling element bearing. A statistically sound and robust methodology is contributed, allowing for rapid labelling of data to enable traditional data mining techniques. Model development is detailed, along with a comparative evaluation of the metrics. Robust derivatives are presented and their superiority is shown. Example “R” code is given in the appendix, allowing readers to employ the techniques discussed. High levels of agreement between the derived statistical approaches and the underlying condition of the components can be found, showing the practical nature and benefit of this approach.

Related Content

. © 2023. 34 pages.
. © 2023. 15 pages.
. © 2023. 15 pages.
. © 2023. 18 pages.
. © 2023. 24 pages.
. © 2023. 32 pages.
. © 2023. 21 pages.
Body Bottom