The Sheffield and Basque Country Universities Entry to CHiC: Using Random Walks and Similarity to Access Cultural Heritage

Eneko Agirre, Paul Clough, Samuel Fernando, Mark Hall, Arantxa Otegi, Mark Stevenson

Research output: Contribution to conferenceOther

3 Citations (Scopus)
6 Downloads (Pure)

Abstract

The Cultural Heritage in CLEF 2012 (CHiC) pilot evaluation included these tasks: ad-hoc retrieval, semantic enrichment and variability tasks. At CHiC 2012, the University of She�eld and the University of the Basque Country submitted a joint entry, attempting the three English monolingual tasks. For the ad-hoc task, the baseline approach used the Indri Search engine. Query expansion approaches used random walks using Personalised Page Rank over graphs constructed from Wikipedia and WordNet, and also by �nding similar articles within Wikipedia. For the semantic enrichment task, random walks using Personalised Page Rank were again used. Additionally links to Wikipedia were added and further approaches used this information to �nd enrichment terms. Finally for the variability task, TF-IDF scores were calculated from text and meta-data �elds. The �final results were selected using MMR (Maximal Marginal Relevance) and cosine similarity.
Original languageEnglish
Publication statusPublished - 2012
EventConference & Labs of the Evaluation Forum (CLEF) - Rome, Italy
Duration: 17 Sep 201220 Sep 2012

Conference

ConferenceConference & Labs of the Evaluation Forum (CLEF)
Country/TerritoryItaly
CityRome
Period17/09/1220/09/12

Fingerprint

Dive into the research topics of 'The Sheffield and Basque Country Universities Entry to CHiC: Using Random Walks and Similarity to Access Cultural Heritage'. Together they form a unique fingerprint.

Cite this