Combining Independent Modules in Lexical Multiple-Choice Problems

Turney, Peter D. and Littman, Michael L. and Bigham, Jeffrey and Shnayder, Victor (2004) Combining Independent Modules in Lexical Multiple-Choice Problems. [Book Chapter]

Full text available as:



Existing statistical approaches to natural language problems are very coarse approximations to the true complexity of language processing. As such, no single technique will be best for all problem instances. Many researchers are examining ensemble methods that combine the output of multiple modules to create more accurate solutions. This paper examines three merging rules for combining probability distributions: the familiar mixture rule, the logarithmic rule, and a novel product rule. These rules were applied with state-of-the-art results to two problems used to assess human mastery of lexical semantics -- synonym questions and analogy questions. All three merging rules result in ensembles that are more accurate than any of their component modules. The differences among the three rules are not statistically significant, but it is suggestive that the popular mixture rule is not the best rule for either of the two problems.

Item Type:Book Chapter
Subjects:Computer Science > Statistical Models
Computer Science > Language
Linguistics > Computational Linguistics
Linguistics > Semantics
Computer Science > Machine Learning
ID Code:4027
Deposited By: Turney, Peter
Deposited On:10 Jan 2005
Last Modified:11 Mar 2011 08:55

References in Article

Select the SEEK icon to attempt to find the referenced article. If it does not appear to be in cogprints you will be forwarded to the paracite service. Poorly formated references will probably not work.

Brill, Eric & Jun Wu. 1998. "Classifier Combination for Improved Lexical Disambiguation". Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL’98), vol. 1, 191-195. Montreal, Canada.

Chalmers, David J., Robert M. French, & Douglas R. Hofstadter. 1992. "High-level Perception, Representation and Analogy: A Critique of Artificial Intelligence Methodology". Journal of Experimental and Theoretical Artificial Intelligence 4:185-211.

Fellbaum, Christiane. 1998. WordNet: An Electronic Lexical Database. Cambridge, Mass.: MIT Press.

Florian, Radu & David Yarowsky. 2002. "Modeling Consensus: Classifier Combination for Word Sense Disambiguation". Proceedings of the Conference on Empirical Methods in Natural Language Processing, 25-32. Philadelphia.

Heskes, Tom. 1998. "Selecting Weighting Factors in Logarithmic Opinion Pools". Advances in Neural Information Processing Systems 10:266-272.

Hinton, Geoffrey E. 1999. "Products of Experts". Proceedings of the 9th International Conference on Artificial Neural Networks (ICANN 99), 1:1-6. Edinburgh, Scotland.

Jacobs, Robert A. 1995. "Methods for Combining Experts’ Probability Assessments". Neural Computation 7:5.867-888.

Jacobs, Robert A., Michael I. Jordan, Steve J. Nowlan & Geoffrey E. Hinton. 1991. "Adaptive Mixtures of Experts". Neural Computation 3:79-87.

Jarmasz, Mario & Stan Szpakowicz. 2003. "Roget’s Thesaurus and Semantic Similarity". Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP-03), 212-219. Borovets, Bulgaria.

Lakoff, George & Mark Johnson. 1980. Metaphors We Live By. Chicago: University of Chicago Press.

Landauer, Thomas K. & Susan T. Dumais. 1997. "A Solution to Plato’s Problem: The Latent Semantic Analysis Theory of Acquisition, Induction and Representation of Knowledge". Psychological Review 104:2.211-240.

Littman, Michael L., Greg A. Keim & Noam Shazeer. 2002. "A Probabilistic Approach to Solving Crossword Puzzles". Artificial Intelligence 134:23-55.

Schapire, Robert E. 1999. "A Brief Introduction to Boosting". Proceedings of the 16th International Joint Conference on Artificial Intelligence (IJCAI-99), 1401-1406. Stockholm, Sweden.

Terra, Egidio & C. L.A. Clarke. 2003. "Frequency Estimates for Statistical Word Similarity Measures". Proceedings of the Human Language Technology and North American Chapter of Association of Computational Linguistics Conference 2003 (HLT/NAACL 2003), 244-251. Edmonton, Alberta, Canada.

Turney, Peter D. 2001. "Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL". Proceedings of the 12th European Conference on Machine Learning (ECML-2001), 491-502. Freiburg, Germany.

Turney, Peter D. & Michael L. Littman. 2003a. "Learning Analogies and Semantic Relations". Technical Report ERB-1103. Ottawa, Ontario, Canada: National Research Council, Institute for Information Technology.

Turney, Peter D. & Michael L. Littman. 2003b. "Measuring Praise and Criticism: Inference of Semantic Orientation from Association". ACMTransactions on Information Systems 21:4.315-346.

Turney, Peter D., Michael L. Littman, Jeffrey Bigham & Victor Shnayder. 2003. "Combining Independent Modules to SolveMultiple-Choice Synonym and Analogy Problems". Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP-03), 482-489. Borovets, Bulgaria.

Xu, Lei, Adam Krzyzak & Ching Y. Suen. 1992. "Methods of Combining Multiple Classifiers and Their Applications to Handwriting Recognition". IEEE Transactions on Systems, Man and Cybernetics 22:3.418-435.


Repository Staff Only: item control page