Logo BSU

Please use this identifier to cite or link to this item: https://elib.bsu.by/handle/123456789/233378
Title: Robust and sparse k-means clustering for high-dimensional data
Authors: Filzmoser, P.
Brodinova S.
Ortner, T.
Breitender, C.
Rohm, M.
Keywords: ЭБ БГУ::ЕСТЕСТВЕННЫЕ И ТОЧНЫЕ НАУКИ::Математика
ЭБ БГУ::ЕСТЕСТВЕННЫЕ И ТОЧНЫЕ НАУКИ::Кибернетика
Issue Date: 2019
Publisher: Minsk : BSU
Citation: Computer Data Analysis and Modeling: Stochastics and Data Science : Proc. of the Twelfth Intern. Conf., Minsk, Sept. 18-22, 2019. – Minsk : BSU, 2019. – P. 32.
Abstract: We introduce a robust k-means-based clustering method for high-dimensional data where not only outliers but also a large number of noise variables are very likely to be present [4]. Although Kondo et al. [2] already addressed such an application scenario, our approach goes even further. Firstly, the introduced method is designed to identify clusters, informative variables, and outliers simultaneously. Secondly, the proposed clustering technique additionally aims at optimizing required parameters, e.g. the number of clusters. This is a great advantage over most existing methods. Moreover, the robustness aspect is achieved through a robust initialization [3] and a proposed weighting function using the Local Outlier Factor [1]. The weighting function provides a valuable source of information about the outlyingness of each observation for a subsequent outlier detection. In order to reveal both clusters and informative variables properly, the approach uses a lasso-type penalty [5]. The method has thoroughly been tested on simulated as well as on real high-dimensional datasets. The conducted experiments demonstrated a great ability of the clustering method to identify clusters, outliers, and informative variables.
URI: http://elib.bsu.by/handle/123456789/233378
ISBN: 978-985-566-811-5
Appears in Collections:2019. Computer Data Analysis and Modeling : Stochastics and Data Science

Files in This Item:
File Description SizeFormat 
32.pdf189,51 kBAdobe PDFView/Open
Show full item record Google Scholar



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.