Goutte, Cyril (2007) Fast & Confident Probabilistic Categorization. [Conference Paper]
Full text available as:
|
PDF
121Kb |
Abstract
We describe NRC's submission to the Anomaly Detection/Text Mining competition organised at the Text Mining Workshop 2007. This submission relies on a straightforward implementation of the probabilistic categoriser described in (Gaussier et al., ECIR'02). This categoriser is adapted to handle multiple labelling and a piecewise-linear confidence estimation layer is added to provide an estimate of the labelling confidence. This technique achieves a score of 1.689 on the test data.
Item Type: | Conference Paper |
---|---|
Keywords: | Text categorization, probabilistic model, confidence estimation, multi-label categorization, category description |
Subjects: | Computer Science > Statistical Models Linguistics > Computational Linguistics Computer Science > Machine Learning |
ID Code: | 5626 |
Deposited By: | Goutte, Dr. Cyril |
Deposited On: | 28 Jul 2007 |
Last Modified: | 11 Mar 2011 08:56 |
References in Article
Select the SEEK icon to attempt to find the referenced article. If it does not appear to be in cogprints you will be forwarded to the paracite service. Poorly formated references will probably not work.
Metadata
- ASCII Citation
- Atom
- BibTeX
- Dublin Core
- EP3 XML
- EPrints Application Profile (experimental)
- EndNote
- HTML Citation
- ID Plus Text Citation
- JSON
- METS
- MODS
- MPEG-21 DIDL
- OpenURL ContextObject
- OpenURL ContextObject in Span
- RDF+N-Triples
- RDF+N3
- RDF+XML
- Refer
- Reference Manager
- Search Data Dump
- Simple Metadata
- YAML
Repository Staff Only: item control page