...or they may apply existing taxonomies – hierarchical knowledge representations – to classify documents, and extracted datamay be used for other forms of analysis. They apply statistical techniques to cluster documents according to discovered characteristics...
http://altaplana.com/WhatsNextForText.pdf