Entrepôts, Représentation et Ingénierie des Connaissances
Publications du laboratoire

Recherche approfondie

par Année
par Auteur
par Thème
par Type
- MaxMin Linear Initialization for Fuzzy C-Means hal link

Auteur(s): Oztürk A., Lallich S., Darmont J., Waksman Sylvie Yona

Conference: 14th International Conference on Machine Learning and Data Mining (MLDM 2018) (New York, US, 2018-07-14)
Actes de conférence: , vol. p. (2018)

Ref HAL: hal-01771204_v1
Exporter : BibTex | endNote

Clustering is an extensive research area in data science. The aim of clustering is to discover groups and to identify interesting patterns in datasets. Crisp (hard) clustering considers that each data point belongs to one and only one cluster. However, it is inadequate as some data points may belong to several clusters, as is the case in text categorization. Thus, we need more flexible clustering. Fuzzy clustering methods, where each data point can belong to several clusters, are an interesting alternative. Yet, seeding iterative fuzzy algorithms to achieve high quality clustering is an issue. In this paper, we propose a new linear and efficient initialization algorithm MaxMin Linear to deal with this problem. Then, we validate our theoretical results through extensive experiments on a variety of numerical real-world and artificial datasets. We also test several validity indices, including a new validity index that we propose, Transformed Standardized Fuzzy Difference (TSFD).