This site has been permanently archived. This is a static copy provided by the University of Southampton.
%A Cyril Goutte
%T Fast & Confident Probabilistic Categorization
%X We describe NRC's submission to the Anomaly Detection/Text Mining competition organised at the Text Mining Workshop 2007. This submission relies on a straightforward implementation of the probabilistic categoriser described in (Gaussier et al., ECIR'02). This categoriser is adapted to handle multiple labelling and a piecewise-linear confidence estimation layer is added to provide an estimate of the labelling confidence. This technique achieves a score of 1.689 on the test data.
%D 2007
%K Text categorization, probabilistic model, confidence estimation, multi-label categorization, category description
%L cogprints5626