Keyness: Matching metrics to definitions

Costas Gabrielatos, Anna Marchi

Research output: Contribution to conferencePaper

107 Downloads (Pure)

Abstract

In this paper we examine the definitions of two widely-used interrelated constructs in corpus linguistics, keyness and keywords, as presented in the literature and corpus software manuals. In particular, we focus on a. the consistency of definitions given in different sources; b. the metrics used to calculate the level of keyness; c. the compatibility between definitions and metrics. Our survey of studies employing keyword analysis has indicated that the vast majority of studies examine a subset of keywords – almost always the top 100 keywords as ranked by the metric used. This renders the issue of the appropriate metric central to any study using keyword analysis. In this pilot study, we first argue that an appropriate, and therefore useful, metric for keyness needs to be fully consistent with the definition of keyword. We then use two sets of comparisons between corpora of different sizes, in order to test whether and to what extent the use of different metrics affects the ranking of keywords. More precisely, we look at the extent of overlap in the keyword rankings resulting from the adoption of different metrics, and we discuss the implications of ranking-based analysis adopting one metric or another. Finally, we propose a new metric for keyness, and demonstrate a simple way to calculate the metric, which supplements the keyword extraction in existing corpus software.
Original languageEnglish
Number of pages27
Publication statusPublished - 5 Nov 2011
EventTheoretical-methodological challenges in corpus approaches to discourse studies and some ways of addressing them - Portsmouth, United Kingdom
Duration: 5 Nov 2011 → …

Other

OtherTheoretical-methodological challenges in corpus approaches to discourse studies and some ways of addressing them
CountryUnited Kingdom
CityPortsmouth
Period5/11/11 → …

Keywords

  • keyword
  • keyness
  • keyword analysis
  • effect size
  • statistical significance
  • corpus linguistics

Fingerprint Dive into the research topics of 'Keyness: Matching metrics to definitions'. Together they form a unique fingerprint.

  • Research Output

    Keyness Analysis: nature, metrics and techniques

    Gabrielatos, C., 7 Feb 2018, Corpus Approaches To Discourse: A critical review. Taylor, C. & Marchi, A. (eds.). Oxford: Routledge, p. 225-258 298 p.

    Research output: Chapter in Book/Report/Conference proceedingChapter

    Open Access
    File
  • Activities

    • 1 Invited talk

    Keyness Analysis: A critical overview

    Costas Gabrielatos (Invited speaker)

    22 Feb 2018

    Activity: Talk or presentation typesInvited talk

    File

    Cite this

    Gabrielatos, C., & Marchi, A. (2011). Keyness: Matching metrics to definitions. Paper presented at Theoretical-methodological challenges in corpus approaches to discourse studies and some ways of addressing them, Portsmouth, United Kingdom.