UMBC ebiquity

The Effect of Perceptual Structure on Multimodal Speech Recognition Interfaces

Authors: Michael A. Grasso, David Ebert, and Tim Finin

Date: January 01, 1998

Abstract: A framework of complementary behavior has been proposed which maintains that direct manipulation and speech interfaces have reciprocal strengths and weaknesses. This suggests that user interface performance and acceptance may increase by adopting a multimodal approach that combines speech and direct manipulation. This effort examined the hypothesis that the speed, accuracy, and acceptance of multimodal speech and direct manipulation interfaces will increase when the modalities match the perceptual structure of the input attributes. A software prototype which supported a typical biomedical data collection task was developed to test this hypothesis. A group of 20 clinical and veterinary pathologists evaluated the prototype in an experimental setting using repeated measures. The results of this experiment supported the hypothesis that the perceptual structure of an input task is an important consideration when designing a multimodal computer interface. Task completion time, the number of speech errors, and user acceptance improved when interface best matched the perceptual structure of the input attributes.

Type: TechReport

Organization: Computer Science and Electrical Engineering

Institution: University of Maryland, Baltimore County

Google Scholar: search

Number of downloads: 498


Available for download as

size: 38844 bytes