This site has been permanently archived. This is a static copy provided by the University of Southampton.
TY - GEN
N1 - Text is somewhat extended compared to the published version.
ID - cogprints5817
UR - http://cogprints.org/5817/
A1 - Manin, Dmitrii
Y1 - 2006/12/26/
N2 - Based on data from a large-scale experiment with human subjects, we conclude that the logarithm of probability to guess a word in context (unpredictability) depends linearly on the word length. This result holds both for poetry and prose, even though with prose, the subjects don't know the length of the omitted word. We hypothesize that this effect reflects a tendency of natural language to have an even information rate.
PB - Keldysh Institute of Applied Mathematics (KIAM) RAS
KW - Natural language
KW - information theory
KW - information rate
KW - entropy
KW - experiment
KW - word guessing
TI - Experiments on predictability of word in context and information rate in natural language
SP - 229
AV - public
EP - 236
ER -