Robust and sparse k-means clustering for high-dimensional data

Filzmoser, P.; Brodinova S.; Ortner, T.; Breitender, C.; Rohm, M.

Даты публикации Авторы Заглавия Темы

Пожалуйста, используйте этот идентификатор, чтобы цитировать или ссылаться на этот документ: https://elib.bsu.by/handle/123456789/233378

Полная запись метаданных

Поле DC	Значение	Язык
dc.contributor.author	Filzmoser, P.
dc.contributor.author	Brodinova S.
dc.contributor.author	Ortner, T.
dc.contributor.author	Breitender, C.
dc.contributor.author	Rohm, M.
dc.date.accessioned	2019-10-29T12:06:18Z	-
dc.date.available	2019-10-29T12:06:18Z	-
dc.date.issued	2019
dc.identifier.citation	Computer Data Analysis and Modeling: Stochastics and Data Science : Proc. of the Twelfth Intern. Conf., Minsk, Sept. 18-22, 2019. – Minsk : BSU, 2019. – P. 32.
dc.identifier.isbn	978-985-566-811-5
dc.identifier.uri	http://elib.bsu.by/handle/123456789/233378	-
dc.description.abstract	We introduce a robust k-means-based clustering method for high-dimensional data where not only outliers but also a large number of noise variables are very likely to be present [4]. Although Kondo et al. [2] already addressed such an application scenario, our approach goes even further. Firstly, the introduced method is designed to identify clusters, informative variables, and outliers simultaneously. Secondly, the proposed clustering technique additionally aims at optimizing required parameters, e.g. the number of clusters. This is a great advantage over most existing methods. Moreover, the robustness aspect is achieved through a robust initialization [3] and a proposed weighting function using the Local Outlier Factor [1]. The weighting function provides a valuable source of information about the outlyingness of each observation for a subsequent outlier detection. In order to reveal both clusters and informative variables properly, the approach uses a lasso-type penalty [5]. The method has thoroughly been tested on simulated as well as on real high-dimensional datasets. The conducted experiments demonstrated a great ability of the clustering method to identify clusters, outliers, and informative variables.
dc.language.iso	en
dc.publisher	Minsk : BSU
dc.subject	ЭБ БГУ::ЕСТЕСТВЕННЫЕ И ТОЧНЫЕ НАУКИ::Математика
dc.subject	ЭБ БГУ::ЕСТЕСТВЕННЫЕ И ТОЧНЫЕ НАУКИ::Кибернетика
dc.title	Robust and sparse k-means clustering for high-dimensional data
dc.type	conference paper
Располагается в коллекциях:	2019. Computer Data Analysis and Modeling : Stochastics and Data Science

Полный текст документа:

Файл	Описание	Размер	Формат
32.pdf		189,51 kB	Adobe PDF	Открыть

Показать базовое описание документа Статистика Google Scholar

Все документы в Электронной библиотеке защищены авторским правом, все права сохранены.