An analysis of latent semantic term self-correlation

Laurence A. F. Park, Kotagiri Ramamohanarao

Research output: Contribution to journalArticle

17 Citations (Scopus)

Abstract

Latent semantic analysis (LSA) is a generalized vector space method that uses dimension reduction to generate term correlations for use during the information retrieval process. We hypothesized that even though the dimension reduction establishes correlations between terms, the dimension reduction is causing a degradation in the correlation of a term to itself (self-correlation). In this article, we have proven that there is a direct relationship to the size of the LSA dimension reduction and the LSA self-correlation. We have also shown that by altering the LSA term self-correlations we gain a substantial increase in precision, while also reducing the computation required during the information retrieval process.
Original languageEnglish
Pages (from-to)0.334027777777778-0.357638888888889
Number of pages35
JournalACM Transactions on Information Systems
Volume27
Issue number2
DOIs
Publication statusPublished - 2009

Keywords

  • information retrieval
  • latent semantic indexing

Fingerprint

Dive into the research topics of 'An analysis of latent semantic term self-correlation'. Together they form a unique fingerprint.

Cite this