Bicocca Open Archive

In this paper an original soft hierarchical Fuzzy Clustering algorithm is proposed, named Hierarchical Hyper-spherical Divisive Fuzzy C-Means (H2D-FCM), with the following characteristics: it generates a "soft" hierarchy in which a document can belong to several child clusters of a node, and the clusters in the same hierarchical level are more specific (general) than the clusters in the upper (lower) level. The proposed algorithm is a divisive algorithm based on a modified bisective K-Means, applying a modified probabilistic Fuzzy C Means algorithm to divide each node into child-nodes. The algorithm determines the proper number of cluster to generate at the first level based on an entropy measure and decides if a node can be further split based on a "density" measure. The paper presents the algorithm and its evaluations on two standard collections. © 2009 IEEE.

Pasi, G., Bordogna, G. (2009). Hierarchical-Hyperspherical Divisive Fuzzy C-Means (H2D-FCM) Clustering for Information Retrieval. In WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (pp.614-621). Washington D.C. : IEEE [10.1109/WI-IAT.2009.104].

Hierarchical-Hyperspherical Divisive Fuzzy C-Means (H2D-FCM) Clustering for Information Retrieval

PASI, GABRIELLA;Bordogna, G.

2009

Abstract

In this paper an original soft hierarchical Fuzzy Clustering algorithm is proposed, named Hierarchical Hyper-spherical Divisive Fuzzy C-Means (H2D-FCM), with the following characteristics: it generates a "soft" hierarchy in which a document can belong to several child clusters of a node, and the clusters in the same hierarchical level are more specific (general) than the clusters in the upper (lower) level. The proposed algorithm is a divisive algorithm based on a modified bisective K-Means, applying a modified probabilistic Fuzzy C Means algorithm to divide each node into child-nodes. The algorithm determines the proper number of cluster to generate at the first level based on an entropy measure and decides if a node can be further split based on a "density" measure. The paper presents the algorithm and its evaluations on two standard collections. © 2009 IEEE.

Scheda breve

Scheda completa

Scheda completa (DC)

	Tipo di intervento
	
				paper
			
	Parole chiave
	
				hierarchical, hyperspherical, divisive, fuzzy, means, fcm, clustering, information, retrieval
			
	Lingua del contenuto
	
				English
			
	Nome del convegno
	
				2009 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2009
			
	Anno del convegno
	
				2009
			
	Titolo degli atti
	
				WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology
			
	ISBN del volume degli atti
	
				978-0-7695-3801-3
			
	Data di pubblicazione
	
				2009
			
	Numero del volume
	
				1
			
	Pagina iniziale
	
				614
			
	Pagina finale
	
				621
			
	Article number
	
				5284910
			
	DOI dell'intervento
	
				https://dx.doi.org/10.1109/WI-IAT.2009.104
			
	Fulltext
	
				none
			
	Citazione
	
				Pasi, G., Bordogna, G. (2009). Hierarchical-Hyperspherical Divisive Fuzzy C-Means (H2D-FCM) Clustering for Information Retrieval. In WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (pp.614-621). Washington D.C. : IEEE [10.1109/WI-IAT.2009.104].
			
	Appare nelle tipologie:
	
				02 - Intervento a convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/20809

Citazioni

18

9

Social impact