UWA Logo
  Faculty Home | School Home | Internal Page | Awesome Animations   
           
Home
About the School
Contact and People
Future Undergraduate Students
Prospective Postgraduates
Current Students
Current Postgraduates
Research
IT News
Awards
Industry Links and Prizes
School and IT Information
Other
Internal Information

Research Seminar - October 13, 2000

Seminar Announcement



Title: Visual solutions for the translation between sign language and spoken language.
Speaker: Dr Eunjung Holden
  Department of Computer Science & Software Engineering, UWA
Date: Friday 13th October, 2000
Time: 3.00pm
Venue: Seminar Room 1.24

Abstract

 

Deaf communities in Australia use a sign language called Auslan. Building a 2-way translation system between Auslan and spoken English requires research in various areas such as gesture recognition, speech recognition and graphical sign display. We have been working on vision-based solutions towards building such a system.

Three separate systems are presented at this talk. Firstly, a fine-grain hand motion recognition system will be introduced. This system detects Auslan basic hand shapes and motion from the visual input and recognises them as a sign. It uses a combination of a 3D hand tracker for motion sensing and an adaptive fuzzy expert system for classification. Secondly, a visual lipreading technique will be presented. This technique uses a combination of image processing and acoustic speech recognition techniques. From a sequence of images, the speaker's mouth contour is tracked and used to recognise the inner mouth appearance which is achieved by using Cepstral image analysis on mouth region images in order to produce Higher Order Local AutoCorrelation (HLAC) features. These features are used for lipreading. Thirdly, a 3D head tracker that detects a speaker's head orientation will be presented. In automatic lipreading, the speaker's head movement can affect the mouth shape appearing in the captured images independently of the true mouth shape. Since such distortion can lead to incorrect recognition, the head tracker extracts 6 degrees-of-freedom head orientation parameters and then corrects the inaccurate 2D mouth dimensions detected from the images.

The talk will also discuss how these techniques can be integrated towards building a translation system.

Top of Page