creators_name: Bayraktar, Murat creators_name: Say, Bilge creators_name: Akman, Varol type: journalp datestamp: 1998-11-10 lastmod: 2011-03-11 08:53:44 metadata_visibility: show title: An Analysis of English Punctuation: The Special Case of Comma ispublished: pub subjects: ling-comput subjects: ling-syntax full_text_status: public keywords: punctuation, structural punctuation marks, comma, the Penn Treebank, the Wall Street Journal, corpus linguistics. abstract: Punctuation has usually been ignored by researchers in computational linguistics over the years. Recently, it has been realized that a true understanding of written language will be impossible if punctuation marks are not taken into account. This paper contains the details of a computer-aided exercise to investigate English punctuation practice for the special case of comma (the most significant punctuation mark) in a parsed corpus. The study classifies the various ``structural'' uses of the comma according to the syntax-patterns in which a comma occurs. The corpus (Penn Treebank) consists of syntactically annotated sentences with no part-of-speech tag information about individual words. date: 1998 date_type: published publication: International Journal of Corpus Linguistics volume: 3 number: 1 pagerange: 33-57 refereed: TRUE citation: Bayraktar, Murat and Say, Bilge and Akman, Varol (1998) An Analysis of English Punctuation: The Special Case of Comma. [Journal (Paginated)] document_url: http://cogprints.org/214/2/ijcl.ps