--- abstract: "We propose to use a statistical phrase-based machine translation system in a post-editing task: the system takes as input raw machine translation output (from a commercial rule-based MT system), and produces post-edited target-language text. We report on experiments that were performed on data collected in precisely such a setting: pairs of raw MT output and their manually post-edited versions. In our evaluation, the output of our automatic post-editing (APE) system is not only better quality than the rule-based MT (both in terms of the BLEU and TER metrics), it is also better than the output of a state-of-the-art phrase-based MT system used in standalone translation mode. These results indicate that automatic post-editing constitutes a simple and efficient way of combining rule-based and statistical MT technologies.\n" altloc: - http://www.aclweb.org/anthology/N/N07/N07-1064 chapter: ~ commentary: ~ commref: ~ confdates: April 2007 conference: 'Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics' confloc: 'Rochester, USA' contact_email: ~ creators_id: [] creators_name: - family: Simard given: Michel honourific: '' lineage: '' - family: Goutte given: Cyril honourific: '' lineage: '' - family: Isabelle given: Pierre honourific: '' lineage: '' date: 2007 date_type: published datestamp: 2007-07-28 department: ~ dir: disk0/00/00/56/27 edit_lock_since: ~ edit_lock_until: ~ edit_lock_user: ~ editors_id: [] editors_name: [] eprint_status: archive eprintid: 5627 fileinfo: /style/images/fileicons/application_pdf.png;/5627/1/N07%2D1064.pdf full_text_status: public importid: ~ institution: ~ isbn: ~ ispublished: pub issn: ~ item_issues_comment: [] item_issues_count: 0 item_issues_description: [] item_issues_id: [] item_issues_reported_by: [] item_issues_resolved_by: [] item_issues_status: [] item_issues_timestamp: [] item_issues_type: [] keywords: 'Machine Translation, Post-editing, Statistical MT, Phrase-based MT' lastmod: 2011-03-11 08:56:56 latitude: ~ longitude: ~ metadata_visibility: show note: ~ number: ~ pagerange: 508-515 pubdom: FALSE publication: ~ publisher: ~ refereed: TRUE referencetext: | Jeffrey Allen and Christofer Hogan. 2000. Toward the development of a post-editing module for Machine Translation raw output: a new productivity tool for processing controlled language. In Third International Controlled Language Applications Workshop (CLAW2000), Washington, USA. Jeffrey Allen. 2004. Case study: Implementing MT for the translation of pre-sales marketing and post-sales software deployment documentation. In Proceedings of AMTA-2004, pages 1--6, Washington, USA. Peter~F Brown, Stephen A~Della Pietra, Vincent J~Della Pietra, and Robert~L Mercer. 1993. The Mathematics of Statistical Machine Translation: Parameter Estimation. Computational Linguistics, 19(2):263--311. Jakob Elming. 2006. Transformation-based corrections of rule-based MT. In Proceedings of the EAMT 11th Annual Conference, Oslo, Norway. George Foster, Roland Kuhn, and Howard Johnson. 2006. Phrasetable Smoothing for Statistical Machine Translation. In Proceedings of EMNLP 2006, pages 53--61, Sydney, Australia. Kevin Knight and Ishwar Chander. 1994. Automated Postediting of Documents. In Proceedings of National Conference on Artificial Intelligence, pages 779--784, Seattle, USA. Philipp Koehn, Franz~J. Och, and Daniel Marcu. 2003. Statistical Phrase-Based Translation. In Proceedings of HLT-NAACL 2003, pages 127--133, Edmonton, Canada. Philipp Koehn. 2004. Pharaoh: a Beam Search Decoder for Phrase-Based Statistical Machine Translation Models. In Proceedings of AMTA 2004, pages 115--124, Washington, USA. Daniel Marcu and William Wong. 2002. A Phrase-Based, Joint Probability Model for Statistical Machine Translation. In Proceedings of EMNLP 2002, Philadelphia, USA. Franz~Josef Och. 2003. Minimum error rate training in Statistical Machine Translation. In Proceedings of ACL-2003, pages 160--167, Sapporo, Japan. Fatiha Sadat, Howard Johnson, Akakpo Agbago, George Foster, Roland Kuhn, Joel Martin, and Aaron Tikuisis. 2005. PORTAGE: A Phrase-Based Machine Translation System. In Proceedings of the ACL Workshop on Building and Using Parallel Texts, pages 129--132, Ann Arbor, USA. Matthew Snover, Bonnie Dorr, Richard Schwartz, Linnea Micciulla, and John Makhoul. 2006. A Study of Translation Edit Rate with Targeted Human Annotation. In Proceedings of AMTA-2006, Cambridge, USA. relation_type: [] relation_uri: [] reportno: ~ rev_number: 12 series: ~ source: ~ status_changed: 2007-09-12 17:11:08 subjects: - ling-comput - comp-sci-mach-learn - comp-sci-art-intel succeeds: ~ suggestions: ~ sword_depositor: ~ sword_slug: ~ thesistype: ~ title: Statistical Phrase-based Post-editing type: confpaper userid: 7131 volume: ~