Informing the curious negotiator: Automatic news extraction from the internet

Debbie Zhang, Simeon J. Simoff

Research output: Chapter in Book / Conference PaperConference Paperpeer-review

11 Citations (Scopus)

Abstract

Information acquisition and validation play an important role in the decision making process during negotiation. In this chapter we briefly present the framework of a smart data mining system for providing contextual information extracted from the Internet to a negotiation agent. We then present one of its components in more details - an effective automated technique for extracting relevant articles from news web sites, so that they can be used further by the mining agents. Most current techniques experience difficulties in coping with changes in web site structure and formats. The proposed extraction process is completely automatic and independent of web site formats. Proposed technique identifies regularities in both format and content of news web sites. The algorithms are applicable to both single- and multi-document web sites. Since invalid URLs can cause errors in data extraction, we also present a method for the negotiation agent to estimate the validity of the extracted data based on the frequency of the relevant words in the news title. Once the news articles are extracted the next task is to construct sets of given articles. This chapter presents a new procedure for constructing news data sets on given topics. The extracted news data set is further utilised by the parties involved in negotiation. The information retrieved from the data set can support both human and automated negotiators.

Original languageEnglish
Title of host publicationData Mining
Subtitle of host publicationTheory, Methodology, Techniques, and Applications
PublisherSpringer Verlag
Pages176-191
Number of pages16
ISBN (Print)3540325476, 9783540325475
DOIs
Publication statusPublished - 2006
Externally publishedYes

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume3755 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Fingerprint

Dive into the research topics of 'Informing the curious negotiator: Automatic news extraction from the internet'. Together they form a unique fingerprint.

Cite this