Second order probabilistic models for Within-Document Novelty detection in academic articles

Research output: Chapter in Book / Conference PaperConference Paperpeer-review

6 Citations (Scopus)

Abstract

It is becoming increasingly difficult to stay aware of the state-of-the- art in any research field due to the exponential increase in the number of academic publications. This problem effects authors and reviewers of submissions to academic journals and conferences, who must be able to identify which portions of an article are novel and which are not. Therefore, having a process to automatically judge the flow of novelty though a document would assist academics in their quest for truth. In this article, we propose the concept of Within Document Novelty Location, a method of identifying locations of novelty and non-novelty within a given document. In this preliminary investigation, we examine if a second order statistical model has any benefit, in terms of accuracy and confidence, over a simpler first order model. Experiments on 928 text sequences taken from three academic articles showed that the second order model provided a significant increase in novelty location accuracy for two of the three documents. There was no significant difference in accuracy for the remaining document, which is likely to be due to the absence of context analysis.
Original languageEnglish
Title of host publicationSIGIR'14 - Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 6 July-11 July, 2014, Gold coast, Queensland, Australia
PublisherAssociation for Computing Machinery
Pages1103-1106
Number of pages4
ISBN (Print)9781450322591
DOIs
Publication statusPublished - 2014
EventACM SIGIR Conference on Research and Development in Information Retrieval -
Duration: 6 Jul 2014 → …

Conference

ConferenceACM SIGIR Conference on Research and Development in Information Retrieval
Period6/07/14 → …

Keywords

  • Within-document Novelty Location
  • originality

Fingerprint

Dive into the research topics of 'Second order probabilistic models for Within-Document Novelty detection in academic articles'. Together they form a unique fingerprint.

Cite this