Manin, Dmitrii (2006) Experiments on predictability of word in context and information rate in natural language. [Journal (Paginated)]
Full text available as:
|
PDF
Available under License Creative Commons Attribution Non-commercial. 171Kb |
Abstract
Based on data from a large-scale experiment with human subjects, we conclude that the logarithm of probability to guess a word in context (unpredictability) depends linearly on the word length. This result holds both for poetry and prose, even though with prose, the subjects don't know the length of the omitted word. We hypothesize that this effect reflects a tendency of natural language to have an even information rate.
Item Type: | Journal (Paginated) |
---|---|
Additional Information: | Text is somewhat extended compared to the published version. |
Keywords: | Natural language, information theory, information rate, entropy, experiment, word guessing |
Subjects: | Computer Science > Language Linguistics > Computational Linguistics |
ID Code: | 5817 |
Deposited By: | Manin, Dmitrii |
Deposited On: | 13 Nov 2007 00:51 |
Last Modified: | 11 Mar 2011 08:57 |
References in Article
Select the SEEK icon to attempt to find the referenced article. If it does not appear to be in cogprints you will be forwarded to the paracite service. Poorly formated references will probably not work.
Metadata
- ASCII Citation
- Atom
- BibTeX
- Dublin Core
- EP3 XML
- EPrints Application Profile (experimental)
- EndNote
- HTML Citation
- ID Plus Text Citation
- JSON
- METS
- MODS
- MPEG-21 DIDL
- OpenURL ContextObject
- OpenURL ContextObject in Span
- RDF+N-Triples
- RDF+N3
- RDF+XML
- Refer
- Reference Manager
- Search Data Dump
- Simple Metadata
- YAML
Repository Staff Only: item control page