Comparing Frequency and Dispersion Keywords: Effects of Variations in Target and Reference Corpora

Journal article


Authors/Editors


Strategic Research Themes


Publication Details

Author listPUNJAPORN POJANAPUNYA

Publication year2025

Volume number32

Issue number3

Start page1428

End page1446

Number of pages19


Abstract

Dispersion keyword analysis, which identifies words that occur in significantly more texts in the target corpus than in the reference corpus, has recently been introduced as a more effective method than traditional frequency keyword analysis. Previous research has used this method to identify keywords within a target corpus, usually consisting of hundreds of texts, and used a much larger corpus as a reference. However, questions remain regarding its applicability for cases involving fewer texts and comparisons between smaller specific corpora. This study compares the top 100 frequency keywords and dispersion keywords identified under several conditions, which varied in terms of the number of texts in the target corpus (24, 100, and 200 texts) and the types of reference corpora used. Both methods identified unique and shared keywords; however, frequency keywords are found more frequent and widely dispersed not only within the target corpus but also in the reference corpus compared to dispersion ones, which are notably more relevant to the target corpus. The selection between frequency and dispersion methods and the relevance of frequency and dispersion keywords in research with differing focuses are discussed.


Keywords

No matching items found.


Last updated on 2025-02-10 at 12:00