Building sparse support vector machines for multi-instance classification

Zhouyu Fu, Guojun Lu, Kai Ming Ting, Dengsheng Zhang

    Research output: Chapter in Book / Conference PaperConference Paperpeer-review

    3 Citations (Scopus)

    Abstract

    ![CDATA[We propose a direct approach to learning sparse Support Vector Machine (SVM) prediction models for Multi-Instance (MI) classification. The proposed sparse SVM is based on a "label-mean" formulation of MI classification which takes the average of predictions of individual instances for bag-level prediction. This leads to a convex optimization problem, which is essential for the tractability of the optimization problem arising from the sparse SVM formulation we derived subsequently, as well as the validity of the optimization strategy we employed to solve it. Based on the "label-mean" formulation, we can build sparse SVM models for MI classification and explicitly control their sparsities by enforcing the maximum number of expansions allowed in the prediction function. An effective optimization strategy is adopted to solve the formulated sparse learning problem which involves the learning of both the classifier and the expansion vectors. Experimental results on benchmark data sets have demonstrated that the proposed approach is effective in building very sparse SVM models while achieving comparable performance to the state-of-the-art MI classifiers.]]
    Original languageEnglish
    Title of host publicationMachine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2011, Athens, Greece, September 5-9, 2011: Proceedings, Part I
    PublisherSpringer
    Pages471-486
    Number of pages16
    ISBN (Print)9783642237799
    DOIs
    Publication statusPublished - 2011
    EventECML PKDD (Conference) -
    Duration: 5 Sept 2011 → …

    Publication series

    Name
    ISSN (Print)0302-9743

    Conference

    ConferenceECML PKDD (Conference)
    Period5/09/11 → …

    Fingerprint

    Dive into the research topics of 'Building sparse support vector machines for multi-instance classification'. Together they form a unique fingerprint.

    Cite this