Comparing taxonomies for organising collections of documents

Samuel Fernando, Mark Hall, Eneko Agirre, Aitor Soroa, Paul Clough, Mark Stevenson

Research output: Contribution to conferencePaperpeer-review

11 Citations (Scopus)


There is a demand for taxonomies to organise large collections of documents into categories for browsing and exploration. This paper examines four existing taxonomies that have been manually created, along with two methods for deriving taxonomies automatically from data items. We use these taxonomies to organise items from a large online cultural heritage collection. We then present two human evaluations of the taxonomies. The first measures the cohesion of the taxonomies to determine how well they group together similar items under the same concept node. The second analyses the concept relations in the taxonomies. The results show that the manual taxonomies have high quality well defined relations. However the novel automatic method is found to generate very high cohesion
Original languageEnglish
Publication statusPublished - 2012
EventProceedings of COLING 2012: Technical Papers - Mumbai, India
Duration: 8 Dec 201215 Dec 2012


ConferenceProceedings of COLING 2012: Technical Papers


Dive into the research topics of 'Comparing taxonomies for organising collections of documents'. Together they form a unique fingerprint.

Cite this