A system design for human factors studies of speech-enabled Web browsing

Adams, L. J and Damper, S. and Harnad, Stevan and Hall, W (1999) A system design for human factors studies of speech-enabled Web browsing. [Conference Paper]

Full text available as:



This paper describes the design of a system which will subsequently be used as the basis of a range of empirical studies aimed at discovering how best to harness speech recognition capabilities in multimodal multimedia computing. Initial work focuses on speech-enabled browsing of the World Wide Web, which was never designed for such use. System design is complete, and is being evaluated via usability testing.

Item Type:Conference Paper
Keywords:speech recognition, web navigation
Subjects:Computer Science > Speech
ID Code:2135
Deposited By: Harnad, Stevan
Deposited On:12 Mar 2002
Last Modified:11 Mar 2011 08:54

References in Article

Select the SEEK icon to attempt to find the referenced article. If it does not appear to be in cogprints you will be forwarded to the paracite service. Poorly formated references will probably not work.

[1] P. R. Cohen and S. L. Oviatt. The role of voice input

for human-machine communication. Proceedings of

the National Academy of Sciences, USA, 92:9921–

9927, 1995.

[2] C. Kamm, M. Walker, and L. Rabiner. The role

of speech processing in human-computer intelligent

communication. Speech Communication, 23:263–

278, 1997.

[3] A. F. Newell. Speech – the natural modality for man-machine

interaction? In B. Shackel, editor, Human-Computer

Interaction – INTERACT ’84, pages 231–

235. Elsevier (North-Holland), Amsterdam, 1985.

[4] R. I. Damper. Speech as an interface medium:

How can it best be used? In C. Baber and

J. Noyes, editors, Interactive Speech Technology:

Human Factors Issues in the Application of Speech

Input/Output to Computers, pages 59–71. Taylor

and Francis, London, 1993.

[5] R. A. Sharman. Speech interfaces for computer sys-tems:

Problems and potential. Displays, 14:21–31,


[6] G. L. Martin. The utility of speech input in user-computer

interfaces. International Journal of Man-Machine

Studies, 30:355–375, 1989.

[7] S. Oviatt. User-centered modeling for spoken lan-guage

and multimodal interfaces. IEEE Multimedia,

3(4):26–35, 1996.

[8] S. Oviatt. Multimodal interactive maps: Designing

for human performance. Human-Computer Interac-tion,

12(1–2):93–129, 1997.

[9] M. G. Helander. Foreword. In C. Baber and

J. Noyes, editors, Interactive Speech Technology:

Human Factors Issues in the Application of Speech

Input/Output to Computers, pages ix–xii. Taylor and

Francis, London, 1993.

[10] R. I. Damper. Foreword. In R. I. Damper, W. Hall,

and J. W. Richards, editors, Multimedia Technolo-gies

and Future Applications. Pentech Press, Lon-don,


[11] S. Furui. Prospects for spoken dialogue systems in

a multimedia environment. In Proceedings of Eu-ropean

Speech Communication Association (ESCA)

Tutorial and Research Workshop on Spoken Dia-logue

Systems: Theories and Applications, pages 9–

16, Vigsø, Denmark, 1995.

[12] B. Tognazzini. Ordinary citizens and the national in-formation

infrastructure. In Proceedings of National

Research Council Workshop: Towards an Every-Citizen

Interface to the National Information Infras-tructure,

Washington, DC, 1996.

[13] R. A. North. Task functional demands as factors in

dual task performance. In Proceedings of the Human

Factors Society 21st Annual Meeting, pages 367–

371, San Antonio, TX, 1977.

[14] C. D. Wickens, D. L. Sandry, and M. Vidulich.

Compatibility and resource competition between

modalities of input, central processing and output.

Human Factors, 25:227–248, 1983.

[15] R. I. Damper, A. D. Lambourne, and D. P. Guy.

Speech input as an adjunct to keyboard entry in tele-vision

subtitling. In B. Shackel, editor, Human-Computer

Interaction – INTERACT’84, pages 203–

208. Elsevier (North-Holland), Amsterdam, 1985.

[16] R. I. Damper, M. A. Tranchant, and S. M. Lewis.

Speech versus keying in command and control: Ef-fect

of concurrent tasking. International Journal of

Human-Computer Studies, 45:337–348, 1996.

[17] S. Harnad. Interactive cognition: Exploring the

potential of electronic quote/commenting. In

B. Gorayska and J. L. Mey, editors, Cognitive Tech-nology:

In Search of the Humane Interface, pages

397–414. Elsevier, Amsterdam, 1995.

[18] D. House. Spoken language access to multimedia

(SLAM): A multimodal interface to the World-Wide

Web. Master’s thesis, Department of Computer Sci-ence

and Engineering, Oregon Graduate Institute of

Science & Technology, Portland, OR, 1995.

[19] K. S. Hone and C. Baber. Modelling the effects

of constraint upon speech-based human-computer

interaction. International Journal of Human-Computer

Studies, 50(1):85–107, 1999.

[20] W. Hall. The role of hypermedia in multimedia

information systems. ACM Computing Surveys,

27(4):599–601, 1995.

[21] L. Carr, D. de Roure, H. Davis, and W. Hall. Im-plementing

an open link service for the World Wide

Web. World Wide Web Journal, 1(2):61–71, 1998.

[22] S. Kuhn and M. J. Muller. Participatory design.

Communications of the ACM, 36:25–103, 1993.

[23] R. S. Pressman. Software Engineering: A Practi-tioner’s

Approach. McGraw-Hill, New York, NY,

1997. Fourth Edition.


Repository Staff Only: item control page