creators_name: Nadeau, David creators_name: Barrière, Caroline creators_name: George, Foster creators_id: pythonner creators_id: creators_id: editors_name: Hahn, Walther V. editors_name: Hutchins, John editors_name: Vertan, Cristina type: confpaper datestamp: 2005-11-12 lastmod: 2011-03-11 08:56:13 metadata_visibility: show title: BIKE: Bilingual Keyphrase Experiments ispublished: pub subjects: comp-sci-lang full_text_status: public keywords: statistical machine translation, lexical resources, keyphrase list. abstract: This paper presents a novel strategy for translating lists of keyphrases. Typical keyphrase lists appear in scientific articles, information retrieval systems and web page meta-data. Our system combines a statistical translation model trained on a bilingual corpus of scientific papers with sense-focused look-up in a large bilingual terminological resource. For the latter, we developed a novel technique that benefits from viewing the keyphrase list as contextual help for sense disambiguation. The optimal combination of modules was discovered by a genetic algorithm. Our work applies to the French / English language pair. date: 2005 date_type: published pagerange: 36-41 refereed: TRUE referencetext: Peter F. Brown, Stephen A. Della Pietra, Vincent J. Della Pietra, and Robert L. Mercer (1993). The Mathematics of Machine Translation: Parameter Estimation. Computational Linguistics 19(2), 1993. Yunbo Cao and Hang Li (2002). Base Noun Phrase Translation Using Web Data and the EM Algorithm. Coling02. Nigel Collier, Hideki Hirakawa, and Akira Kumano (1998). Machine Translation vs. Dictionary Term Translation - a Comparison for English-Japanese News Article Alignment. ColingACL98. P. Fung and K. McKeown (1997). A Technical Wordand Term-Translation Aid Using Noisy Parallel Corpora across Language Groups. Machine Translation. Philipp Koehn and Kevin Knight (2003). Feature-Rich Statistical Translation of Noun Phrases. ACL03. Philipp Koehn, Franz Josef Och, and Daniel Marcu (2003). Statistical Phrase-Based Translation. NAACL03. Julian Kupiec (1993). An Algorithm For Finding Noun Phrase Correspondences In Bilingual Corpora. ACL93. Robert C. Moore (2003). Learning Translations of Named Entity Phrases from Parallel Corpora. EACL03. Fatiha Sadat, Akira Maeda, Masatoshi Yoshikawa, and Shunsuke Uemura 2001. Query Expansion Techniques for the CLEF Bilingual Track. In Proceedings of the CLEF 2001 Workshop on Cross-language System Evaluation Campaign, pp. 99-104, Darmstadt, Germany. Nizar Hibash and Bonnie Dorr (2002). Handling Translation Divergences: Combining Statistical and Symbolic Techniques in Generation-Heavy Machine Translation. AMT02. Judith Klavans and Philipp Resnik (1994). The Balancing Act: Combining Symbolic and Statistical Approaches to Language. ACL Workshop Proceedings. David Nadeau, Mario Jarmasz, Caroline Barrière, George Foster and Claude St-Jacques (2004). Using COTS Search Engines and Custom Query Strategies at CLEF. Cross-Language Evaluation Forum CLEF 2004. citation: Nadeau, David and Barrière, Caroline and George, Foster (2005) BIKE: Bilingual Keyphrase Experiments. [Conference Paper] document_url: http://cogprints.org/4603/1/NRC-48286.pdf