Title:A Real - time Decomposition / Synthesis Method Based on Auditory Perception
Country:China
Patent No.:201611026399.6
Legal Status:Authorized
Inventor:Donmei Li,Youwei Yang,Rui Jia,Runsheng Liu
Assignee:Tsinghua University
Address:Tsinghua University,Haidian District Beijing 100084, China
Filing Date:2016-11-18
Issue Date:2020-06-05
Abstract:
The invention discloses a digital speech real-time decomposition/synthesis method based on auditory perception characteristics, and relates to the field of voice signal processing. The method comprises the following steps: forming an N-order Gammatone filter through N-stage-cascaded second-order band-pass filters, and then, constructing an arbitrary-order Gammatone digital filter model and parameters thereof; in the speech decomposition stage, decomposing an input speech into M paths of signals by adopting a floating-point algorithm or a fixed-point algorithm and through M paths of Gammatone filters; and in the speech synthesis stage, introducing time delay in a Gammatone filterbank to accord with characteristics of the human ear better, human ear basilar membrane time delay being inversely proportional to frequency, and finally, carrying out speech synthesis operation. Through reference to equiloudness curve characteristics of the human ear, the speech decomposition/synthesis method is improved, and thus the final speech synthesis effect is allowed to be close to the effect of an ideal band-pass filter. The method can be applied to speech equipment of a mobile phone, an artificial cochlea and a hearing aid and the like.
Patent Certificate:
PDF/Jpg