Please use this identifier to cite or link to this item: https://doi.org/10.21256/zhaw-18449
Full metadata record
DC FieldValueLanguage
dc.contributor.authorTempl, Matthias-
dc.contributor.authorGussenbauer, J.-
dc.contributor.authorFilzmoser, P.-
dc.date.accessioned2019-10-17T12:15:28Z-
dc.date.available2019-10-17T12:15:28Z-
dc.date.issued2019-
dc.identifier.issn0266-4763de_CH
dc.identifier.issn1360-0532de_CH
dc.identifier.urihttps://digitalcollection.zhaw.ch/handle/11475/18449-
dc.description.abstractOutlier detection can be seen as a pre-processing step for locating data points in a data sample, which do not conform to the majority of observations. Various techniques and methods for outlier detection can be found in the literature dealing with different types of data. However, many data sets are inflated by true zeros and, in addition, some components/variables might be of compositional nature. Important examples of such data sets are the Structural Earnings Survey, the Structural Business Statistics, the European Statistics on Income and Living Conditions, tax data or – as in this contribution – household expenditure data which are used, for example, to estimate the Purchase Power Parity of a country. In this work, robust univariate and multivariate outlier detection methods are compared by a complex simulation study that considers various challenges included in data sets, namely structural (true) zeros, missing values, and compositional variables. These circumstances make it difficult or impossible to flag true outliers and influential observations by well-known outlier detection methods. Our aim is to assess the performance of outlier detection methods in terms of their effectiveness to identify outliers when applied to challenging data sets such as the household expenditures data surveyed all over the world. Moreover, different methods are evaluated through a close-to-reality simulation study. Differences in performance of univariate and multivariate robust techniques for outlier detection and their shortcomings are reported. We found that robust multivariate methods outperform robust univariate methods. The best performing methods in finding the outliers and in providing a low false discovery rate were found to be the generalized S estimators (GSE), the BACON-EEM algorithm and a compositional method (CoDa-Cov). In addition, these methods performed also best when the outliers are imputed based on the corresponding outlier detection method and indicators are estimated from the data sets.de_CH
dc.language.isoende_CH
dc.publisherTaylor & Francisde_CH
dc.relation.ispartofJournal of Applied Statisticsde_CH
dc.rightshttp://creativecommons.org/licenses/by-nc-nd/4.0/de_CH
dc.subjectOutlier detectionde_CH
dc.subjectZerosde_CH
dc.subjectRobust methodde_CH
dc.subjectHousehold expenditurede_CH
dc.subject.ddc005: Computerprogrammierung, Programme und Datende_CH
dc.subject.ddc510: Mathematikde_CH
dc.titleEvaluation of robust outlier detection methods for zero-inflated complex datade_CH
dc.typeBeitrag in wissenschaftlicher Zeitschriftde_CH
dcterms.typeTextde_CH
zhaw.departementSchool of Engineeringde_CH
zhaw.organisationalunitInstitut für Datenanalyse und Prozessdesign (IDP)de_CH
dc.identifier.doi10.1080/02664763.2019.1671961de_CH
dc.identifier.doi10.21256/zhaw-18449-
zhaw.funding.euNode_CH
zhaw.issue7de_CH
zhaw.originated.zhawYesde_CH
zhaw.pages.end1167de_CH
zhaw.pages.start1144de_CH
zhaw.publication.statuspublishedVersionde_CH
zhaw.volume47de_CH
zhaw.publication.reviewPeer review (Publikation)de_CH
zhaw.author.additionalNode_CH
Appears in collections:Publikationen School of Engineering

Files in This Item:
File Description SizeFormat 
2020_Templ-etal_Robust-outlier-detection-complex-data.pdf2.15 MBAdobe PDFThumbnail
View/Open
Show simple item record
Templ, M., Gussenbauer, J., & Filzmoser, P. (2019). Evaluation of robust outlier detection methods for zero-inflated complex data. Journal of Applied Statistics, 47(7), 1144–1167. https://doi.org/10.1080/02664763.2019.1671961
Templ, M., Gussenbauer, J. and Filzmoser, P. (2019) ‘Evaluation of robust outlier detection methods for zero-inflated complex data’, Journal of Applied Statistics, 47(7), pp. 1144–1167. Available at: https://doi.org/10.1080/02664763.2019.1671961.
M. Templ, J. Gussenbauer, and P. Filzmoser, “Evaluation of robust outlier detection methods for zero-inflated complex data,” Journal of Applied Statistics, vol. 47, no. 7, pp. 1144–1167, 2019, doi: 10.1080/02664763.2019.1671961.
TEMPL, Matthias, J. GUSSENBAUER und P. FILZMOSER, 2019. Evaluation of robust outlier detection methods for zero-inflated complex data. Journal of Applied Statistics. 2019. Bd. 47, Nr. 7, S. 1144–1167. DOI 10.1080/02664763.2019.1671961
Templ, Matthias, J. Gussenbauer, and P. Filzmoser. 2019. “Evaluation of Robust Outlier Detection Methods for Zero-Inflated Complex Data.” Journal of Applied Statistics 47 (7): 1144–67. https://doi.org/10.1080/02664763.2019.1671961.
Templ, Matthias, et al. “Evaluation of Robust Outlier Detection Methods for Zero-Inflated Complex Data.” Journal of Applied Statistics, vol. 47, no. 7, 2019, pp. 1144–67, https://doi.org/10.1080/02664763.2019.1671961.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.