A graph theoretical perspective for the unsupervised clustering of free text corpora