Agglomerative Clustering Genetic Algorithm

This web site is an extension of work presented in Agglomerative Genetic Algorithm for Clustering in Social Networks. The paper can be found in GECCO ’09: Proceedings of the 11th annual conference on genetic and evolutionary computation

Abstract: Size and complexity of data repositories collaboratively created by Web users generate a need for new processing approaches. In this paper, we study the problem of detection of fine-grained communities of users in social networks, which can be defined as clustering with a large number of clusters. The practical size of social networks makes the traditional evolutionary based clustering approaches, which represent the entire clustering solution as one individual, hard to apply. We propose an Agglomerative Clustering Genetic Algorithm (ACGA): a population of clusters evolves from the initial state in which each cluster represents one user to a high quality clustering solution. Each step of the evolutionary process is performed locally, engaging only a small part of the social network limited to two clusters and their direct neighborhood. This makes the algorithm practically useful independently of the size of the network. Evaluation on two social network models indicates that ACGA is potentially able to detect communities with accuracy comparable or better than two typical centralized clustering algorithms even though ACGA works under much stricter conditions.

Additional resources:
JAVA applet - a web version of ACGA visualization.
Datasets - social network datasets generated by plain and hierarchical models.
Source code - I'm sorry but the code still needs some work (cleaning and commenting) before making it publically available. However, if you are interested in the code please send me an e-mail (address is available on the linked site)

go to main page
Created: 2007.09.09
Last update: 2009.07.29