--- abstract: |- Studies of the categorical perception (CP) of sensory continua have a long and rich history in psychophysics. In 1977, Macmillan et al. introduced the use of signal detection theory to CP studies. Anderson et al. simultaneously proposed the first neural model for CP, yet this line of research has been less well explored. In this paper, we assess the ability of neural-network models of CP to predict the psychophysical performance of real observers with speech sounds and artificial/novel stimuli. We show that a variety of neural mechanisms is capable of gen-erating the characteristics of categorical perception. Hence, CP may not be a special mode of perception but an emergent property of any sufficiently powerful general learning system. altloc: - http://www.bib.ecs.soton.ac.uk/records/513 chapter: ~ commentary: ~ commref: ~ confdates: ~ conference: ~ confloc: ~ contact_email: ~ creators_id: [] creators_name: - family: Damper given: R.I. honourific: '' lineage: '' - family: Harnad given: S.R. honourific: '' lineage: '' date: 2000 date_type: published datestamp: 2001-06-19 department: ~ dir: disk0/00/00/16/20 edit_lock_since: ~ edit_lock_until: ~ edit_lock_user: ~ editors_id: [] editors_name: [] eprint_status: archive eprintid: 1620 fileinfo: /style/images/fileicons/application_pdf.png;/1620/3/paper.pdf full_text_status: public importid: ~ institution: ~ isbn: ~ ispublished: pub issn: ~ item_issues_comment: [] item_issues_count: 0 item_issues_description: [] item_issues_id: [] item_issues_reported_by: [] item_issues_resolved_by: [] item_issues_status: [] item_issues_timestamp: [] item_issues_type: [] keywords: 'categorical perception, neural networks' lastmod: 2011-03-11 08:54:42 latitude: ~ longitude: ~ metadata_visibility: show note: ~ number: 4 pagerange: 843-867 pubdom: FALSE publication: Perception and Psychophysics publisher: ~ refereed: TRUE referencetext: |- Abramson, A., & Lisker, L. (1970). Discrimination along the voicing continuum: Cross-language tests. In Proceedings of 6th international congress of phonetic sciences, prague, 1967 (pp. 569-573). Academia, Prague. Amari, S., & Takeuchi, A. (1978). A mathematical theory on for-mation of category detecting neurons. Biological Cybernetics, 29, 127-136. Anderson, J. A. (1968). A memory storage model utilizing spatial correlation functions. Kybernetik, 5, 113-119. Anderson, J. A. (1977). Neural models with cognitive implications. In D. LaBerge & S. J. Samuels (Eds.), Basic processes in read-ing: Perception and comprehension (pp. 27-90). Hillsdale, NJ: Lawrence Erlbaum. Anderson, J. A. (1995). An introduction to neural networks. Cam-bridge, MA: MIT Press. Anderson, J. A., & Silverstein, J. W. (1978). Reply to Grossberg. Psychological Review, 85, 597-603. Anderson, J. A., Silverstein, J. W., & Ritz, S. A. (1977). Vowel pre-processing with a neurally based model. In Proceedings of ieee international conference on acoustics, speech and signal processing, icassp?77 (pp. 265-269). Hartford, CN. Anderson, J. A., Silverstein, J. W., Ritz, S. A., & Jones, R. S. (1977). Distinctive features, categorical perception, and prob-ability learning: Some applications for a neural model. Psycho-logical Review, 84, 413-451. Arbib, M. A. (1995). Handbook of brain theory and neural net-works. Cambridge, MA: MIT Press. Baldi, P., & Hornik, K. (1989). Neural networks and principal com-ponent analysis: Learning from examples without local minima. Neural Networks, 2, 53-58. Beale, J. M., & Keil, F. C. (1995). Categorical effects in the per-ception of faces. Cognition, 57, 217-239. Bī egin, J., & Proulx, R. (1996). Categorization in unsupervised neu-ral networks: The Eidos model. IEEE Transactions on Neural Networks, 7, 147-154. Bornstein, M. H. (1987). Perceptual categories in vision and au-dition. In S. Harnad (Ed.), Categorical perception: the ground-work of cognition (pp. 287-300). Cambridge, UK: Cambridge University Press. Bourland, H., & Kamp, Y. (1988). Auto-association by multilayer perceptrons and singular value decomposition. Biological Cy-bernetics, 59, 291-294. Brady, S. A., & Darwin, C. J. (1978). Range effects in the percep-tion of voicing. Journal of the Acoustical Society of America, 63, 1556-1558. Braida, L. D., & Durlach, N. I. (1972). Intensity perception: II. Resolution in one-interval paradigms. Journal of the Acous-tical Society of America, 51, 483-502. Carpenter, G. A., Grossberg, S., & Rosen, D. B. (1991a). ART 2-A: An adaptive resonance algorithm for rapid category learning and recognition. Neural Networks, 4, 493-504. Carpenter, G. A., Grossberg, S., & Rosen, D. B. (1991b). Fuzzy ART: Fast stable learning and categorization of analog patterns by an adaptive resonance system. Neural Networks, 4, 759-771. Damper, R. I. (1998). Auditory representations of speech sounds in a neural model: The role of peripheral processing. In Pro-ceedings of international joint conference on neural networks (ijcnn 98) (pp. 2196-2201). Anchorage, AL. Damper, R. I., Gunn, S. R., & Gore, M. O. (2000). Extracting pho-netic knowledge from learning systems: Perceptrons, support vector machines and linear discriminants. Applied Intelligence, 12, 43-62. Damper, R. I., Pont, M. J., & Elenius, K. (1990). Representation of initial stop consonants in a computational model of the dor-sal cochlear nucleus (Tech. Rep. No. STL-QPSR 4/90). Speech Transmission Laboratory Quarterly Progress and Status Report, Royal Institute of Technology (KTH), Stockholm. (Also pub-lished in W. A. Ainsworth (Ed.), Advances in Speech, Hearing and Language Processing, Vol. 3 (Part B), pp. 497-546. Green-wich, CT: JAI Press, 1996) Diehl, R. L., Elman, J. E., & McCusker, S. B. (1978). Contrast ef-fects on stop consonant identification. Journal of Experimental Psychology: Human Perception and Performance, 4, 599-609. Diehl, R. L., & Kluender, K. R. (1987). On the categorization of speech sounds. In S. Harnad (Ed.), Categorical perception: the groundwork of cognition (pp. 226-253). Cambridge, UK: Cambridge University Press. Dreyfus, H. L., & Dreyfus, S. E. (1988). Making a mind ver-sus modeling the brain: Artificial intelligence back at a branch-point. Daedalus, 117, 15-43. Eijkman, E. G. J. (1992). Neural nets tested by psychophysical methods. Neural Networks, 5, 153-162. Elman, J. L. (1979). Perceptual origins of the phoneme boundary effect and selective adaptation to speech: A signal detection the-ory analysis. Journal of the Acoustical Society of America, 65, 190-207. Elman, J. L. (1990). Finding structure in time. Cognitive Science, 14(2), 179-211. Elman, J. L., & McClelland, J. L. (1986). Exploiting lawful vari-ability in the speech wave. In J. S. Perkell & D. H. Klatt (Eds.), Invariance and variability in speech processes (pp. 360-380). Hillsdale, NJ: Lawrence Erlbaum. Elman, J. L., & Zipser, D. (1988). Learning the hidden structure of speech. Journal of the Acoustical Society of America, 83, 1615-1626. Finney, D. J. (1975). Probit analysis: A statistical treatment of the sigmoid response curve (3rd edition). Cambridge, UK: Cam-bridge University Press. Fry, D. B., Abramson, A. S., Eimas, P. D., & Liberman, A. M. (1962). The identification and discrimination of synthetic vow-els. Language and Speech, 5, 171-189. Fujisaki, H., & Kawashima, T. (1969). On the modes and mecha-nisms of speech perception. Annual Report of the Engineering Research Institute, Faculty of Engineering, University of Tokyo, 28, 67-73. Fujisaki, H., & Kawashima, T. (1970). Some experiments on speech perception and a model for the perceptual mechanism. Annual Report of the Engineering Research Institute, Faculty of Engi-neering, University of Tokyo, 29, 207-214. Fujisaki, H., & Kawashima, T. (1971). A model of the mecha-nisms for speech perception - quantitative analysis of categori-cal effects in discrimination. Annual Report of the Engineering Research Institute, Faculty of Engineering, University of Tokyo, 30, 59-68. Goldstone, R. L. (1994). Influences of categorization on perceptual discrimination. Journal of Experimental Psychology: General, 123, 178-200. Goldstone, R. L. (1998). Perceptual learning. Annual Review of Psychology, 49, 585-612. Goldstone, R. L., Steyvers, M., & Larimer, K. (1996). Categorical perception of novel dimensions. In Proceedings of the 18th an-nual conference of the cognitive science society (pp. 243-248). Hillsdale, NJ: Lawrence Erlbaum. Green, D. M., & Swets, J. (1966). Signal detection theory and psy-chophysics. New York: Wiley. ((1988 reprint edition published by Penisula Press, Los Altos, CA)) Greenwood, D. D. (1961). Critical bandwidth and the frequency coordinates on the basilar membrane. Journal of the Acoustical Society of America, 33, 780-801. Grossberg, S. (1968a). Some nonlinear networks capable of learn-ing a spatial pattern of arbitrary complexity. Proceedings of the National Academy of Sciences, USA, 59, 368-372. Grossberg, S. (1968b). Some physiological and biological con-sequences of psychological postulates. Proceedings of the Na-tional Academy of Sciences, USA, 60, 758-765. Grossberg, S. (1969). Embedding fields: A theory of learning with physiological implications. Journal of Mathematical Psychol-ogy, 6, 209-239. Grossberg, S. (1978). Do all neural models really look alike? A comment on Anderson, Silverstein, Ritz and Jones. Psycho-logical Review, 85, 592-596. Grossberg, S. (1986). The adaptive self-organization of serial or-der in behavior: Speech, language and motor control. In E. C. Schwab & H. C. Nusbaum (Eds.), Pattern recognition by hu-mans and machines. vol. 1: Speech perception (pp. 187-294). London, UK: Academic. Grossberg, S. (1987). Competitive learning: From interactive acti-vation to adaptive resonance. Cognitive Science, 11, 23-63. Guenter, F. H., & Gjaja, M. N. (1996). The perceptual magnet effect as an emergent property of neural map formation. Journal of the Acoustical Society of America, 100, 1111-1121. Hanson, S. J., & Burr, D. J. (1990). What connectionist models learn: Learning and representation in connectionist networks. Behavioral and Brain Sciences, 13, 471-518. Harnad, S. (1982). Metaphor and mental duality. In T. Simon & R. Scholes (Eds.), Language, mind and brain (pp. 189-211). Hillsdale, NJ: Lawrence Erlbaum. Harnad, S. (Ed.). (1987). Categorical perception: the groundwork of cognition. Cambridge, UK: Cambridge University Press. Harnad, S., Hanson, S. J., & Lubin, J. (1991). Categorical percep-tion and the evolution of supervised learning in neural nets. In D. W. Powers & L. Reeker (Eds.), Working papers of the aaai spring symposium on machine learning of natural language and ontology (pp. 65-74). Stanford, CA. Harnad, S., Hanson, S. J., & Lubin, J. (1995). Learned categorical perception in neural nets: Implications for symbol grounding. In V. Honavar & L. Uhr (Eds.), Symbol processors and con-nectionist network models in artificial intelligence and cognitive modeling: Steps towards principled integration (pp. 191-206). London: Academic Press. Hary, J. M., & Massaro, D. M. (1982). Categorical results do not imply categorical perception. Perception and Psychophysics, 32, 409-418. Healy, A. F., & Repp, B. H. (1982). Context independence and phonetic mediation in categorical perception. Journal of Ex-perimental Psychology: Human Perception and Performance, 8, 68-80. Hinton, G. E., & Anderson, J. A. (Eds.). (1981). Parallel models of associative memory. Hillsdale, NJ: Lawrence Erlbaum. Holland, J. H. (1998). Emergence: From chaos to order. Reading, MA: Addison-Wesley. Howell, P., Rosen, S., Laing, H., & Sackin, S. (1992). The role of F1 transitions in the perception of voicing in initial plosives (Tech. Rep. No. 6). Speech, Hearing and Language: Work in Progress, University College London. Iverson, P., & Kuhl, P. K. (1995). Mapping the perceptual magnet effect for speech using signal detection theory and multidimen-sional scaling. Journal of the Acoustical Society of America, 97, 553-562. Jordan, M. I. (1986). Attractor dynamics and parallelism in a con-nectionist sequential machine. In Proceedings of the 8th annual conference of the cognitive science society (pp. 531-546). Hills-dale, NJ: Lawrence Erlbaum. Kaplan, H. L., Macmillan, N. A., & Creelman, C. D. (1978). Tables of d n for variable standard discrimination paradigms. Behavioral Research Methods and Instrumentation, 10, 796-813. Kohonen, T. (1977). Associative memory: A system theoretic ap-proach. Berlin: Springer-Verlag. Kruschke, J. K. (1992). ALCOVE: An exemplar-based connec-tionist model of category learning. Psychological Review, 99, 22-44. Kuhl, P. K. (1987). The special-mechanisms debate in speech re-search: Categorization tests on animals and infants. In S. Har-nad (Ed.), Categorical perception: the groundwork of cognition. Cambridge, UK: Cambridge University Press. Kuhl, P. K. (1991). Human adults and human infants show a ?per-ceptual magnet effect? for the prototypes of speech categories, monkeys do not. Perception and Psychophysics, 50, 93-107. Kuhl, P. K., & Miller, J. D. (1978). Speech perception by the chinchilla: Identification functions for synthetic VOT stimuli. Journal of the Acoustical Society of America, 63, 905-917. Kuhl, P. K., & Padden, D. M. (1982). Enhanced discriminability at the phonetic boundaries for the voicing feature in macaques. Perception and Psychophysics, 32, 542-550. Kuhl, P. K., & Padden, D. M. (1983). Enhanced discriminability at the phonetic boundaries for the place feature in macaques. Journal of the Acoustical Society of America, 73, 1003-1010. Lacerda, F. (1998). An exemplar-based account of emergent pho-netic categories. Journal of the Acoustical Society of America, 103, 2980(A). Liberman, A. M. (1996). Speech: A special code. Cambridge, MA: MIT Press. Liberman, A. M., Cooper, F. S., Shankweiler, D. P., & Studdert-Kennedy, M. (1967). Perception of the speech code. Psycholog-ical Review, 74, 431-461. Liberman, A. M., Delattre, P. C., & Cooper, F. S. (1958). Some cues for the distinction between voiced and voiceless stops in initial position. Language and Speech, 1, 153-167. Liberman, A. M., Harris, K. S., Hoffman, H. S., & Griffith, B. C. (1957). The discrimination of speech sounds within and across phoneme boundaries. Journal of Experimental Psychology, 54, 358-368. Liberman, A. M., & Mattingly, I. (1985). The motor theory of speech perception revised. Cognition, 21, 1-36. Liberman, A. M., & Mattingly, I. (1989). A specialization for speech perception. Science, 243, 489-494. Lisker, L., & Abramson, A. (1964). A cross-language study of voicing in initial stops. Word, 20, 384-422. Lisker, L., & Abramson, A. (1970). The voicing dimension: Some experiments in comparative phonetics. In Proceedings of 6th international congress of phonetic sciences, prague, 1967 (pp. 563-567). Academia, Prague. Livingstone, K. R., Andrews, J. K., & Harnad, S. (1998). Cate-gorical perception effects induced by category learning. Journal of Experimental Psychology: Learning, Memory and Cognition, 123, 178-200. Lloyd, D. (1989). Simple minds. Cambridge, MA: Bradford Books/MIT Press. Lotto, A. J., Kluender, K. R., & Holt, L. L. (1998). Depolarizing the perceptual magnet effect. Journal of the Acoustical Society of America, 103, 3648-3655. Luce, R. D. (1959). Individual choice behavior. New York, NY: John Wiley. Luce, R. D. (1963). A threshold theory for simple detection exper-iments. Psychological Review, 70, 61-79. Macmillan, N. A. (1987). Beyond the categorical/continuous dis-tinction: A psychophysical approach to processing modes. In S. Harnad (Ed.), Categorical perception: the groundwork of cognition (pp. 53-85). Cambridge, UK: Cambridge University Press. Macmillan, N. A., Braida, L. D., & Goldberg, R. F. (1987). Central and peripheral effects in the perception of speech and nonspeech sounds. In M. E. H. Schouten (Ed.), The psychophysics of speech perception (pp. 28-45). Dordrecht, The Netherlands: Martinus Nijhoff. Macmillan, N. A., & Creelman, C. D. (1991). Detection theory: A user?s guide. New York, NY: Cambridge University Press. Macmillan, N. A., Goldberg, R. F., & Braida, L. D. (1988). Resolu-tion for speech sounds: Basic sensitivity and context memory on vowel and consonant continua. Journal of the Acoustical Society of America, 84, 1262-1280. Macmillan, N. A., Kaplan, H. L., & Creelman, C. D. (1977). The psychophysics of categorical perception. Psychological Review, 84, 452-471. Massaro, D. W. (1987a). Categorical partition: A fuzzy logical model of categorical behavior. In S. Harnad (Ed.), Categorical perception: the groundwork of cognition (pp. 254-283). Cam-bridge, UK: Cambridge University Press. Massaro, D. W. (1987b). Speech perception by ear and eye: A paradigm for psychological inquiry. Hillsdale, NJ: Lawrence Erlbaum Associates. Massaro, D. W., & Oden, G. C. (1980). Speech perception: A framework for research and theory. In N. Lass (Ed.), Speech and language, vol. 3: Advances in basic research and practice (pp. 129-165). New York, NY: Academic Press. McClelland, J. L., & Elman, J. L. (1986). The TRACE model of speech perception. Cognitive Psychology, 18, 1-86. Miller, G. A. (1956). The magical number seven, plus or minus two: Some limits on our capacity for processing information. Psychological Review, 63, 81-97. Miller, J. L. (1994). On the internal structure of phonetic categories: A progress report. Cognition, 50, 271-285. Nearey, T. M. (1997). Speech perception as pattern recognition. Journal of the Acoustical Society of America, 101, 3241-3254. Nossair, Z. B., & Zahorian, S. A. (1991). Dynamic spectral shape features as acoustic correlates for initial stop consonants. Jour-nal of the Acoustical Society of America, 89, 2978-2991. Pastore, R. E. (1987). Categorical perception: Some psychophys-ical models. In S. Harnad (Ed.), Categorical perception: the groundwork of cognition (pp. 29-52). Cambridge, UK: Cam-bridge University Press. Pastore, R. E. (1987a). Possible acoustic bases for the percep-tion of voicing contrasts. In M. E. H. Schouten (Ed.), The psy-chophysics of speech perception (pp. 188-198). Dordrecht, The Netherlands: Martinus Nijhoff. Pastore, R. E., Szczesiul, R., Wielgus, V., Nowikas, K., & Logan, R. (1984). Categorical perception, category boundary effects, and continuous perception: A reply to Hary and Massaro. Perception and Psychophysics, 35, 583-585. Pevtzow, R., & Harnad, S. (1997). Warping similarity space in category learning by human subjects: The role of task difficulty. In Proceedings of SimCat 1997: Interdisciplinary workshop on similarity and categorisation (pp. 189-195). Department of Ar-tificial Intelligence, University of Edinburgh. Pisoni, D. B., & Lazarus, J. H. (1974). Categorical and non-categorical modes of speech perception along the voicing con-tinuum. Journal of the Acoustical Society of America, 55, 328- 333. Pisoni, D. B., & Tash, J. (1974). Reaction time to comparisons within and across phonetic categories. Perception and Psy-chophysics, 15, 285-290. Pont, M. J., & Damper, R. I. (1991). A computational model of afferent neural activity from the cochlea to the dorsal acoustic stria. Journal of the Acoustical Society of America, 89, 1213- 1228. Port, R. F. (1990). Representation and recognition of temporal patterns. Connection Science, 2, 151-176. Quinlan, P. (1991). Connectionism and psychology: A Psycholog-ical perspective on new connectionist research. Hemel Hemp-stead, UK: Harvester Wheatsheaf. Repp, B. H. (1984). Categorical perception: Issues, methods and findings. In N. Lass (Ed.), Speech and language, vol. 10: Ad-vances in basic research and practice (pp. 244-335). Orlando, FL: Academic Press. Repp, B. H., Healy, A. F., & Crowder, R. G. (1979). Categories and context in the perception of isolated steady-state vowels. Jour-nal of Experimental Psychology: Human Perception and Perfor-mance, 5, 129-145. Repp, B. H., & Liberman, A. M. (1987). Phonetic category bound-aries are flexible. In S. Harnad (Ed.), Categorical perception: the groundwork of cognition (pp. 89-112). Cambridge, UK: Cambridge University Press. Rosen, S. M. (1979). Range and frequency effects in consonant categorization. Journal of Phonetics, 7, 393-402. Rumelhart, D. E., Hinton, G. E., & Williams, R. (1986). Learning representations by back-propagating errors. Nature, 323, 533- 536. Rumelhart, D. E., & McClelland, J. L. (Eds.). (1986). Parallel dis-tributed processing: Explorations in the microstructure of cogni-tion (2 volumes). Cambridge, MA: Bradford Books/MIT Press. Rumelhart, D. E., & Zipser, D. (1985). Feature discovery by com-petitive learning. Cognitive Science, 9, 75-112. Schyns, P. G. (1991). A modular neural network model of concept acquisition. Cognitive Science, 15, 461-508. Sinex, D. G., & McDonald, L. P. (1988). Average discharge rate representation of voice-onset time in the chinchilla auditory nerve. Journal of the Acoustical Society of America, 83, 1817- 1827. Steels, L. (1991). Towards a theory of emergent functionality. In J.-A. Meyer & S. W. Wilson (Eds.), From animals to animats: Proceedings of the 1st international conference on simulation of adaptive behavior (pp. 451-461). Cambridge, MA: Bradford Books/MIT Press. Stevenage, S. V. (1995). Which twin are you? A demonstration of induced category learning. British Journal of Psychology, 89, 39-57. Studdert-Kennedy, M., Liberman, A. M., Harris, K. S., & Cooper, F. S. (1970). Motor theory of speech perception: A reply to Lane?s critical review. Psychological Review, 77, 234-239. Takagi, N. (1995). Signal detection modeling of Japanese listen-ers? /r/-/l/ labeling behavior in a one-interval identification task. Journal of the Acoustical Society of America, 97, 563-574. Tijsseling, A., & Harnad, S. (1997). Warping similarity space in category learning by backprop nets. In Proceedings of SimCat 1997: Interdisciplinary workshop on similarity and categorisa-tion (pp. 263-269). Department of Artificial Intelligence, Uni-versity of Edinburgh. Treisman, M., Faulkner, A., Naish, P. L. N., & Rosner, B. S. (1995). Voice-onset time and tone-onset time: The role of criterion-setting mechanisms in categorical perception. Quarterly Journal of Experimental Psychology, 48A, 334-366. Treisman, M., & Williams, T. C. (1984). A theory of criterion setting with an application to sequential dependencies. Psycho-logical Review, 91, 68-111. Volaitis, L. E., & Miller, J. L. (1992). Phonetic prototypes: In-fluence of place of articulation and speaking rate on the internal structure of voicing categories. Journal of the Acoustical Society of America, 92, 723-735. Wood, C. C. (1976). Discriminability, response bias, and phoneme categories in discrimination of voice onset time. Journal of the Acoustical Society of America, 60, 1381-1389. Wood, C. C. (1978). Variations on a theme by Lashley: Lesion ex-periments with the neural model of Anderson, Silverstein, Ritz and Jones. Psychological Review, 85, 582-591. relation_type: [] relation_uri: [] reportno: ~ rev_number: 12 series: ~ source: ~ status_changed: 2007-09-12 16:39:06 subjects: - comp-sci-neural-nets - percep-cog-psy succeeds: ~ suggestions: ~ sword_depositor: ~ sword_slug: ~ thesistype: ~ title: Neural Network Models of Categorical Perception type: journalp userid: 63 volume: 62