A Real - time Decomposition / Synthesis Method Based on Auditory Perception

【Publications】

Title：A Real - time Decomposition / Synthesis Method Based on Auditory Perception

Country：China

Patent No.：201611026399.6

Legal Status：Authorized

Inventor：Donmei Li,Youwei Yang,Rui Jia,Runsheng Liu

Assignee：Tsinghua University

Address：Tsinghua University,Haidian District Beijing 100084, China

Filing Date：2016-11-18

Issue Date：2020-06-05

Abstract：

The invention discloses a digital speech real-time decomposition/synthesis method based on auditory perception characteristics, and relates to the field of voice signal processing. The method comprises the following steps: forming an N-order Gammatone filter through N-stage-cascaded second-order band-pass filters, and then, constructing an arbitrary-order Gammatone digital filter model and parameters thereof; in the speech decomposition stage, decomposing an input speech into M paths of signals by adopting a floating-point algorithm or a fixed-point algorithm and through M paths of Gammatone filters; and in the speech synthesis stage, introducing time delay in a Gammatone filterbank to accord with characteristics of the human ear better, human ear basilar membrane time delay being inversely proportional to frequency, and finally, carrying out speech synthesis operation. Through reference to equiloudness curve characteristics of the human ear, the speech decomposition/synthesis method is improved, and thus the final speech synthesis effect is allowed to be close to the effect of an ideal band-pass filter. The method can be applied to speech equipment of a mobile phone, an artificial cochlea and a hearing aid and the like.

Patent Certificate： PDF/Jpg