Back to articles
Regular Articles
Volume: 63 | Article ID: jist0603
Image
A Study on Utilization of Three-Dimensional Sensor Lip Image for Developing a Pronunciation Recognition System
  DOI :  10.2352/J.ImagingSci.Technol.2019.63.5.050402  Published OnlineSeptember 2019
Abstract
Abstract

The acoustic-based automatic speech recognition (ASR) technique has been a matured technique and widely seen to be used in numerous applications. However, acoustic-based ASR will not maintain a standard performance for the disabled group with an abnormal face, that is atypical eye or mouth geometrical characteristics. For governing this problem, this article develops a three-dimensional (3D) sensor lip image based pronunciation recognition system where the 3D sensor is efficiently used to acquire the action variations of the lip shapes of the pronunciation action from a speaker. In this work, two different types of 3D lip features for pronunciation recognition are presented, 3D-(x, y, z) coordinate lip feature and 3D geometry lip feature parameters. For the 3D-(x, y, z) coordinate lip feature design, 18 location points, each of which has 3D-sized coordinates, around the outer and inner lips are properly defined. In the design of 3D geometry lip features, eight types of features considering the geometrical space characteristics of the inner lip are developed. In addition, feature fusion to combine both 3D-(x, y, z) coordinate and 3D geometry lip features is further considered. The presented 3D sensor lip image based feature evaluated the performance and effectiveness using the principal component analysis based classification calculation approach. Experimental results on pronunciation recognition of two different datasets, Mandarin syllables and Mandarin phrases, demonstrate the competitive performance of the presented 3D sensor lip image based pronunciation recognition system.

Subject Areas :
Views 52
Downloads 4
 articleview.views 52
 articleview.downloads 4
  Cite this article 

Ing-Jr Ding, Chong-Min Ruan, "A Study on Utilization of Three-Dimensional Sensor Lip Image for Developing a Pronunciation Recognition Systemin Journal of Imaging Science and Technology,  2019,  pp 050402-1 - 050402-9,  https://doi.org/10.2352/J.ImagingSci.Technol.2019.63.5.050402

 Copy citation
  Copyright statement 
Copyright © Society for Imaging Science and Technology 2019
  Article timeline 
  • received November 2018
  • accepted April 2019
  • PublishedSeptember 2019

Preprint submitted to:
  Login or subscribe to view the content