Keyness Analysis: nature, metrics and techniques

Research output: Chapter in Book/Report/Conference proceedingChapter

13 Citations (Scopus)
13 Downloads (Pure)

Abstract

Keyness analysis is perhaps the most widely used technique within corpus approaches to (critical) discourse studies. This chapter will first define the nature of keyness, and outline the research foci that keyness analysis can be usefully employed to address (e.g. not only differences, but also similarities). It will then provide a brief historical overview of the use of frequency comparisons in corpus linguistics, from manual comparisons of the frequency of particular linguistic items to the automated comparison of the frequency of all words in two corpora (or sub-corpora). The above will also act as the springboard for the discussion of the core aspects of keyness analysis, and associated common misconceptions. 1. The unit of analysis. Most studies select the word as the unit of analysis (as indicated by the common use of the term ‘keyword’). However, this need not be the default selection: frequency comparisons can focus on any lexical or morphosyntactic unit, or particular semantic or pragmatic meanings – as long as the appropriate corpus annotation has been carried out. 2. The selection of the corpora to be compared. The discussion will focus on issues relating to the nature of ‘study’ and ‘reference’ corpus, and their relative size. 3. The issue of appropriate metrics. The discussion will focus on the difference between metrics of effect size and statistical significance, and the usefulness of combining the two types. It will also provide a comparison of the effect size metrics currently adopted by widely used corpus tools. 4. The selection of the key items that a study can usefully focus on. As an automated keyness analysis usually returns a much larger number of key items than is feasible to tackle, the approach to selection is of paramount importance, as it will determine the results and conclusions. These issues, and the associated methodological decisions and their implications, will be discussed with reference to a case study.
Original languageEnglish
Title of host publicationCorpus Approaches To Discourse: A critical review
EditorsCharlotte Taylor, Anna Marchi
Place of PublicationOxford
PublisherRoutledge
Pages225-258
Number of pages298
ISBN (Print)9781138895782
Publication statusPublished - 7 Feb 2018

Fingerprint

analysis
comparison
effect
decision
need

Keywords

  • corpus linguistics
  • keyness analysis
  • quantitative analysis

Cite this

Gabrielatos, C. (2018). Keyness Analysis: nature, metrics and techniques. In C. Taylor, & A. Marchi (Eds.), Corpus Approaches To Discourse: A critical review (pp. 225-258). Oxford: Routledge.
Gabrielatos, Costas. / Keyness Analysis: nature, metrics and techniques. Corpus Approaches To Discourse: A critical review. editor / Charlotte Taylor ; Anna Marchi. Oxford : Routledge, 2018. pp. 225-258
@inbook{7304f57f57124281ad6df37086a965eb,
title = "Keyness Analysis: nature, metrics and techniques",
abstract = "Keyness analysis is perhaps the most widely used technique within corpus approaches to (critical) discourse studies. This chapter will first define the nature of keyness, and outline the research foci that keyness analysis can be usefully employed to address (e.g. not only differences, but also similarities). It will then provide a brief historical overview of the use of frequency comparisons in corpus linguistics, from manual comparisons of the frequency of particular linguistic items to the automated comparison of the frequency of all words in two corpora (or sub-corpora). The above will also act as the springboard for the discussion of the core aspects of keyness analysis, and associated common misconceptions. 1. The unit of analysis. Most studies select the word as the unit of analysis (as indicated by the common use of the term ‘keyword’). However, this need not be the default selection: frequency comparisons can focus on any lexical or morphosyntactic unit, or particular semantic or pragmatic meanings – as long as the appropriate corpus annotation has been carried out. 2. The selection of the corpora to be compared. The discussion will focus on issues relating to the nature of ‘study’ and ‘reference’ corpus, and their relative size. 3. The issue of appropriate metrics. The discussion will focus on the difference between metrics of effect size and statistical significance, and the usefulness of combining the two types. It will also provide a comparison of the effect size metrics currently adopted by widely used corpus tools. 4. The selection of the key items that a study can usefully focus on. As an automated keyness analysis usually returns a much larger number of key items than is feasible to tackle, the approach to selection is of paramount importance, as it will determine the results and conclusions. These issues, and the associated methodological decisions and their implications, will be discussed with reference to a case study.",
keywords = "corpus linguistics, keyness analysis, quantitative analysis",
author = "Costas Gabrielatos",
year = "2018",
month = "2",
day = "7",
language = "English",
isbn = "9781138895782",
pages = "225--258",
editor = "Charlotte Taylor and Anna Marchi",
booktitle = "Corpus Approaches To Discourse: A critical review",
publisher = "Routledge",
address = "United Kingdom",

}

Gabrielatos, C 2018, Keyness Analysis: nature, metrics and techniques. in C Taylor & A Marchi (eds), Corpus Approaches To Discourse: A critical review. Routledge, Oxford, pp. 225-258.

Keyness Analysis: nature, metrics and techniques. / Gabrielatos, Costas.

Corpus Approaches To Discourse: A critical review. ed. / Charlotte Taylor; Anna Marchi. Oxford : Routledge, 2018. p. 225-258.

Research output: Chapter in Book/Report/Conference proceedingChapter

TY - CHAP

T1 - Keyness Analysis: nature, metrics and techniques

AU - Gabrielatos, Costas

PY - 2018/2/7

Y1 - 2018/2/7

N2 - Keyness analysis is perhaps the most widely used technique within corpus approaches to (critical) discourse studies. This chapter will first define the nature of keyness, and outline the research foci that keyness analysis can be usefully employed to address (e.g. not only differences, but also similarities). It will then provide a brief historical overview of the use of frequency comparisons in corpus linguistics, from manual comparisons of the frequency of particular linguistic items to the automated comparison of the frequency of all words in two corpora (or sub-corpora). The above will also act as the springboard for the discussion of the core aspects of keyness analysis, and associated common misconceptions. 1. The unit of analysis. Most studies select the word as the unit of analysis (as indicated by the common use of the term ‘keyword’). However, this need not be the default selection: frequency comparisons can focus on any lexical or morphosyntactic unit, or particular semantic or pragmatic meanings – as long as the appropriate corpus annotation has been carried out. 2. The selection of the corpora to be compared. The discussion will focus on issues relating to the nature of ‘study’ and ‘reference’ corpus, and their relative size. 3. The issue of appropriate metrics. The discussion will focus on the difference between metrics of effect size and statistical significance, and the usefulness of combining the two types. It will also provide a comparison of the effect size metrics currently adopted by widely used corpus tools. 4. The selection of the key items that a study can usefully focus on. As an automated keyness analysis usually returns a much larger number of key items than is feasible to tackle, the approach to selection is of paramount importance, as it will determine the results and conclusions. These issues, and the associated methodological decisions and their implications, will be discussed with reference to a case study.

AB - Keyness analysis is perhaps the most widely used technique within corpus approaches to (critical) discourse studies. This chapter will first define the nature of keyness, and outline the research foci that keyness analysis can be usefully employed to address (e.g. not only differences, but also similarities). It will then provide a brief historical overview of the use of frequency comparisons in corpus linguistics, from manual comparisons of the frequency of particular linguistic items to the automated comparison of the frequency of all words in two corpora (or sub-corpora). The above will also act as the springboard for the discussion of the core aspects of keyness analysis, and associated common misconceptions. 1. The unit of analysis. Most studies select the word as the unit of analysis (as indicated by the common use of the term ‘keyword’). However, this need not be the default selection: frequency comparisons can focus on any lexical or morphosyntactic unit, or particular semantic or pragmatic meanings – as long as the appropriate corpus annotation has been carried out. 2. The selection of the corpora to be compared. The discussion will focus on issues relating to the nature of ‘study’ and ‘reference’ corpus, and their relative size. 3. The issue of appropriate metrics. The discussion will focus on the difference between metrics of effect size and statistical significance, and the usefulness of combining the two types. It will also provide a comparison of the effect size metrics currently adopted by widely used corpus tools. 4. The selection of the key items that a study can usefully focus on. As an automated keyness analysis usually returns a much larger number of key items than is feasible to tackle, the approach to selection is of paramount importance, as it will determine the results and conclusions. These issues, and the associated methodological decisions and their implications, will be discussed with reference to a case study.

KW - corpus linguistics

KW - keyness analysis

KW - quantitative analysis

UR - https://www.routledge.com/

M3 - Chapter

SN - 9781138895782

SP - 225

EP - 258

BT - Corpus Approaches To Discourse: A critical review

A2 - Taylor, Charlotte

A2 - Marchi, Anna

PB - Routledge

CY - Oxford

ER -

Gabrielatos C. Keyness Analysis: nature, metrics and techniques. In Taylor C, Marchi A, editors, Corpus Approaches To Discourse: A critical review. Oxford: Routledge. 2018. p. 225-258