Comparing Frequency and Dispersion Keywords: Effects of Variations in Target and Reference Corpora
Journal article
Authors/Editors
Strategic Research Themes
Publication Details
Author list: PUNJAPORN POJANAPUNYA
Publication year: 2025
Volume number: 32
Issue number: 3
Start page: 1428
End page: 1446
Number of pages: 19
Abstract
Dispersion keyword analysis, which identifies words that occur in significantly more texts in the target corpus than in the reference corpus, has recently been introduced as a more effective method than traditional frequency keyword analysis. Previous research has used this method to identify keywords within a target corpus, usually consisting of hundreds of texts, and used a much larger corpus as a reference. However, questions remain regarding its applicability for cases involving fewer texts and comparisons between smaller specific corpora. This study compares the top 100 frequency keywords and dispersion keywords identified under several conditions, which varied in terms of the number of texts in the target corpus (24, 100, and 200 texts) and the types of reference corpora used. Both methods identified unique and shared keywords; however, frequency keywords are found more frequent and widely dispersed not only within the target corpus but also in the reference corpus compared to dispersion ones, which are notably more relevant to the target corpus. The selection between frequency and dispersion methods and the relevance of frequency and dispersion keywords in research with differing focuses are discussed.
Keywords
No matching items found.