An analysis of latent semantic term self-correlation

Laurence A. F. Park, Kotagiri Ramamohanarao

    Research output: Contribution to journalArticle

    17 Citations (Scopus)

    Abstract

    Latent semantic analysis (LSA) is a generalized vector space method that uses dimension reduction to generate term correlations for use during the information retrieval process. We hypothesized that even though the dimension reduction establishes correlations between terms, the dimension reduction is causing a degradation in the correlation of a term to itself (self-correlation). In this article, we have proven that there is a direct relationship to the size of the LSA dimension reduction and the LSA self-correlation. We have also shown that by altering the LSA term self-correlations we gain a substantial increase in precision, while also reducing the computation required during the information retrieval process.
    Original languageEnglish
    Pages (from-to)0.334027777777778-0.357638888888889
    Number of pages35
    JournalACM Transactions on Information Systems
    Volume27
    Issue number2
    DOIs
    Publication statusPublished - 2009

    Keywords

    • information retrieval
    • latent semantic indexing

    Fingerprint

    Dive into the research topics of 'An analysis of latent semantic term self-correlation'. Together they form a unique fingerprint.

    Cite this