Key points are not available for this paper at this time.
Recent technological advances in connected-speech recognition and position sensing in space have encouraged the notion that voice and gesture inputs at the graphics interface can converge to provide a concerted, natural user modality.
Richard A. Bolt (Tue,) studied this question.