Current Issue Cover
与语速相关的人脸语音动画合成及其评估

周维1, 汪增福1(中国科学技术大学自动化系,合肥 230027)

摘 要
为了有效地合成人脸语音动画,根据对唇区肌肉收缩力与语速关系的分析,以及在对皮肤肌肉组织的粘弹性力学进行研究的基础上,提出了一种新的基于不同语速的唇动模型,并将其应用在了汉语人脸语音动画系统中。该模型根据获得的肌肉收缩力与语速的关系,并通过对皮肤肌肉组织的粘弹性分析,首先得到了语速、唇动速度与唇动位移三者之间的关系,并建立了不同语速下的唇动模型;然后通过这个唇动模型合成了不同语速状态下的具有较高自然度和个性化的人脸语音动画;最后,通过设计感知学评估实验,对合成的语音动画的效果和可理解性进行了评估。实验结果表明,该模型可以合成较高可接受性和可理解性的不同语速状态下的人脸语音动画。
关键词
Speech Rate Related Facial Animation Synthesis and Evaluation

()

Abstract
A novel speech rate related lip movement model is proposed in this paper. The model is based on the research results on the viscoelasticity of skin-muscle tissue and the quantitative relationship between lip muscle force and speech rate. In order to show the validity of the model, we have applied it to our Chinese speech animation system. The experimental results show that our system can synthesize the individualized speech animation with high naturalness at different speech rates. Finally, the perceptual evaluation experiment is designed to evaluate the quality and intelligibility of the synthesized speech animation.
Keywords

订阅号|日报