基于线性预测编码与AMDF的高精度基音检测算法

时间:2022-08-30 04:51:19

基于线性预测编码与AMDF的高精度基音检测算法

摘要:根据语音信号产生原理,结合线性预测编码(LPC)与平均幅度差函数法(AMDF),提出了一种高精度的基音检测算法。该算法首先利用线性预测分析提取残差信号;然后采用累积平均归一化差分函数与差分信号修正,使基音周期的谷值点更加尖锐;最后利用二次函数拟合与基音周期的倍数检查筛选候选值,得到了准确的基音周期。实验结果表明,与传统方法相比, 该算法的基音检测效果有了明显改善,减少了基音检测中的半频错误,在高信噪比下具有良好的准确性和鲁棒性。

关键词:语音信号;基音周期;线性预测编码;平均幅度差函数;自相关函数

中图分类号: TP391.42 文献标志码:A

Super resolution pitch detection based on LPC and AMDF

WANG En.cheng1, SU Teng.fang1*, YUAN Kai.guo2, WU Chun.hua2

1. School of Information Engineering, North China University of Technology, Beijing 100144, China;

2. School of Computer, Beijing University of Posts and Telecommunications, Beijing 100876, China

Abstract:

According to the mechanism of speech signal, a super resolution pitch detection algorithm which combined Linear Predictive Coding (LPC) with Average Magnitude Difference Function (AMDF) was proposed. Firstly, residual of LPC was extracted by linear predictive analysis. Then, cumulative mean normalized difference function and difference signal revision were used to make pitch valley sharper. At last, parabolic interpolation and pitch multiple check were taken to select real pitch period. Experimental results indicate that the pitch detection effect of the algorithm is superior to that of the conventional algorithms. The proposed algorithm conquers half frequency errors, and has good accuracy and robustness under the condition of high SNR.

According to the mechanism of speech signal, a super resolution pitch detection algorithm, which combined Linear Predictive Coding (LPC) with Average Magnitude Difference Function (AMDF), was proposed. Firstly, residual of LPC was extracted by linear predictive analysis. Then, cumulative mean normalized difference function and difference signal revision were used to make pitch valley sharper. At last, parabolic interpolation and pitch multiple check were taken to select real pitch period. The experimental results indicate that the pitch detection effect of the algorithm is superior to that of the conventional algorithms. The proposed algorithm conquers half frequency errors, and has good accuracy and robustness under the condition of high Signal.to.Noise Ratio (SNR).

Key words:

speech signal; pitch period; Linear Predictive Coding (LPC); Average Magnitude Difference Function (AMDF); Auto Correlation Function (ACF)

0 引言

人在发浊音时,气流通过声门使声带产生张驰振荡式振动,形成一股准周期脉冲气流,这种声带振动的频率称为基音频率,相应的周期称为基音周期。基音周期是语音信号中的重要参数,它在语音分析、语音合成、语音识别和语音编码中有重要的作用[1]。常用的基音提取算法有时域法、频域法和时频域混合法,时域法最常用的是自相关函数法(AutoCorrelation Function,ACF)、平均幅度差函数法(Average Magnitude Difference Function,AMDF)。

传统AMDF有两项缺陷:1)对于周期性和平稳性不太好的浊音信号,其AMDF的第一周期谷值点并不是全局最低谷值点,而全局谷值点出现在基音周期的倍数位置处,此时就会产生严重的倍音错误,即半频错误。2)AMDF对语言信号幅度的快速变化比较敏感,并随着延迟时间的增加,峰值幅度逐渐下降,使得谷值点检测出现困难。在这些方面,许多学者提出了改进方法,其中两种主要的改进方法描述如下:1)采用基音周期候选值搜索算法,选取谷值点“清晰”的点作为最终的基音周期。2)改变AMDF算法的定义以改善性能,如CAMDF、WAMDF、MAMDF 、HR.AMDF和 LV.AMDF等[2-5]。

虽然不同文献都对基本AMDF算法进行了一些改进,减少了基音周期的错误估计,但准确度和精度并不让人满意。

针对这些问题,本文提出了一种基于线性预测编码与平均幅度差函数的高精度基音检测算法,简称LA.amdf(LPC ACCURATE AMDF)。该算法能准确检测出基音周期,有效地减少了半频错误,具有较好的鲁棒性。

1 LPC与AMDF算法描述

LPC与AMDF算法在语音信号的分析与参数的计算过程中都占有很重要的作用。在语音信号的处理中应用LPC技术,不仅利用其预测功能,更利用了其优良的声道模型。LPC利用语音信号之间的相关性,用过去的取样值来预测现在或未来的取样值。在某种测度准则下,通过使实际的取样值与预测值之间的误差最小,确定唯一的一组预测系数,这组预测系数反映了语音信号的基本特征。

1.1 语音信号的线性预测残差信号

线性预测分析[6-7]在估计语音参数方面是一种重要的分析技术,语音信号采用全极点模型,语音抽样信号s(n)可以用以下差分方程来表示:

s(n)=Ge(n)+∑pi=1ais(n-i)(1)

其中G是声道滤波器增益,p是预测阶数,ai是线性预测系数,e(n)是激励信号。在模型参数估计中,把如下系统称为线性预测器:

(n)=∑pi=1ais(n-i)(2)

对语音信号进行线性预测分析,得到线性预测系数ai,从而可以得到逆滤波器A(z),其传输函数为:

A(z)=1-∑pi=1aiz-i (3)

线性预测残差信号ε(n)为:

ε(n)=s(n)-(n)=s(n)-∑pi=1ais(n-i)=Ge(n)(4)

通过对语音信号的产生原理分析可知,可以将准周期脉冲(浊音期间)或白噪声(清音期间)激励一个线性时不变系统(声道)所产生的输出作为语音模型。如图1所示。

上一篇:基于Markov链的嵌入式系统硬件可靠性研究 下一篇:对比源反演算法对二维混合目标重建成像的应用