<html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta charset="utf-8">
<meta name="generator" content="pdf2htmlEX">
<meta http-equiv="X-UA-Compatible" content="IE=edge,chrome=1">
<link rel="stylesheet" href="https://static.pudn.com/base/css/base.min.css">
<link rel="stylesheet" href="https://static.pudn.com/base/css/fancy.min.css">
<link rel="stylesheet" href="https://static.pudn.com/prod/directory_preview_static/626d8c3d7b37011214d45eba/raw.css">
<script src="https://static.pudn.com/base/js/compatibility.min.js"></script>
<script src="https://static.pudn.com/base/js/pdf2htmlEX.min.js"></script>
<script>
try{
pdf2htmlEX.defaultViewer = new pdf2htmlEX.Viewer({});
}catch(e){}
</script>
<title></title>
</head>
<body>
<div id="sidebar" style="display: none">
<div id="outline">
</div>
</div>
<div id="pf1" class="pf w0 h0" data-page-no="1"><div class="pc pc1 w0 h0"><img class="bi x0 y0 w1 h1" alt="" src="https://static.pudn.com/prod/directory_preview_static/626d8c3d7b37011214d45eba/bg1.jpg"><div class="c x0 y1 w2 h2"><div class="t m0 x0 h3 y2 ff1 fs0 fc0 sc0 ls0 ws0">摘要:<span class="sc1">介绍一种适合家电遥控器应用的语音识别算法,该算法使用双模块和两级端点检测方法,能有效地提高识</span></div><div class="t m0 x0 h3 y3 ff1 fs0 fc0 sc1 ls0 ws0">别和稳健性;介绍利用该技术实现的一种新型学习型遥控器,展现了语音识别技术在家电领域的广阔前景。</div><div class="t m0 x0 h3 y4 ff2 fs0 fc0 sc1 ls0 ws0"> <span class="ff1 sc0">关键词:<span class="sc1">语音识别 <span class="ff3">DTW FED FRED </span>学习型遥控器</span></span></div><div class="t m0 x1 h4 y5 ff1 fs1 fc0 sc1 ls0 ws0">家用电器发展的一个重要方面是让用户界面更加人性化,更加方便自然,做到老年人和残疾人可以无障碍地使</div><div class="t m0 x0 h4 y6 ff1 fs1 fc0 sc1 ls0 ws0">用。利用语音识别技术实现语音控制是提高家电产品用户界面质量的一条重要途径。本文以语音控制遥控器为例,说</div><div class="t m0 x0 h4 y7 ff1 fs1 fc0 sc1 ls0 ws0">明语音识别技术如何应用在家电器领域。</div><div class="t m0 x1 h4 y8 ff1 fs1 fc0 sc1 ls0 ws0">适合家用电器应用的语音识别嵌入式系统结构如图<span class="_ _0"> </span><span class="ff3">1<span class="_ _0"> </span></span>所示,它由四个部分组成。第一部分为模<span class="ff3">/</span>数转换部分,其输</div><div class="t m0 x0 h4 y9 ff1 fs1 fc0 sc1 ls0 ws0">入端接收输入的语音信号,并将其转化成数字芯片可处理的数字采集信号;在输出端将解码后的语音数字信号转换为</div><div class="t m0 x0 h4 ya ff1 fs1 fc0 sc1 ls0 ws0">音频模拟信号,通过扬声器放声。第二部分为语音识别部分,它的作用是对输入的数字语音词条信号进行分析,识别</div><div class="t m0 x0 h4 yb ff1 fs1 fc0 sc1 ls0 ws0">出词条信号所代表的命令,一般由<span class="_ _0"> </span><span class="ff3">DSP<span class="_ _0"> </span></span>完成。第三部分语音提示和语音回放部分,它一般也是在<span class="_ _0"> </span><span class="ff3">DSP<span class="_ _0"> </span></span>中完成的,其</div><div class="t m0 x0 h4 yc ff1 fs1 fc0 sc1 ls0 ws0">核心是对语音信号进行数字压缩编码和解码,目的是提示用户操作并对识别语音的响应,完成人机的语音交互。第四</div><div class="t m0 x0 h4 yd ff1 fs1 fc0 sc1 ls0 ws0">部分是系统控制部分,它将语音识别结果转换成相应的控制信号,并将其输出转换成物理层操作,完成具体功能。语</div><div class="t m0 x0 h4 ye ff1 fs1 fc0 sc1 ls0 ws0">音识别与系统控制的有机结合是完成声控交互的关键,下面将对语音识别算法及遥控系统控制部分作详细的讨论。</div><div class="t m0 x0 h4 yf ff4 fs1 fc0 sc1 ls0 ws0">1 <span class="ff1 sc0">语音识别算法</span></div><div class="t m0 x1 h4 y10 ff1 fs1 fc0 sc1 ls0 ws0">目前,常以单片机(<span class="ff3">MCU</span>)或<span class="_ _0"> </span><span class="ff3">DSP<span class="_ _0"> </span></span>作炎硬件平台的实现消费类电子产品中的语音识别。这类语音识别主要为孤立</div><div class="t m0 x0 h4 y11 ff1 fs1 fc0 sc1 ls0 ws0">词识别,它有两种实现方案:一种是基于隐含马尔科夫统计模型(<span class="ff3">HMM</span>)框架的非特定人识别;另一种是基于动态规</div><div class="t m0 x0 h4 y12 ff1 fs1 fc0 sc1 ls0 ws0">划(<span class="ff3">DP</span>)原理的特定人识别。它们在应用上各有优缺点。<span class="ff3">HMM<span class="_ _0"> </span></span>非特定人员的优点是用户无需经过训练,可以直接使</div><div class="t m0 x0 h4 y13 ff1 fs1 fc0 sc1 ls0 ws0">用;并且具良好的稳定性(即对使用者<span class="ff5">而言</span>,语音识别性能<span class="ff5">不会随着时间</span>的<span class="ff5">延长而降低</span>)。<span class="ff5">但</span>非特定人语音识别也有</div><div class="t m0 x0 h4 y14 ff1 fs1 fc0 sc1 ls0 ws0">其<span class="ff5">很难克服</span>的缺<span class="ff5">陷</span>。<span class="ff5">首先</span>,使用该方法需要<span class="ff5">预先</span>采集<span class="ff5">大</span>量的语<span class="ff5">料库</span>,以便训练出相应的识别模型,这<span class="ff5">就大大</span>提高了应</div><div class="t m0 x0 h4 y15 ff1 fs1 fc0 sc1 ls0 ws0">用<span class="ff5">此</span>技术的前<span class="ff5">期</span>成本;其<span class="ff5">次</span>,非特定人语音识别<span class="ff5">很难</span>解<span class="ff5">决汉</span>语中<span class="ff5">不同</span>方<span class="ff5">言</span>的<span class="ff5">问题</span>,<span class="ff5">限</span>制了它的使用<span class="ff5">区</span>域;另<span class="ff5">外还</span>有一</div><div class="t m0 x0 h4 y16 ff1 fs1 fc0 sc1 ls0 ws0">个<span class="ff5">因素</span>也应<span class="ff5">予</span>以<span class="ff5">考虑</span>,家电中用于控制的具体命令词语<span class="ff5">最</span>好<span class="ff5">不</span>要完<span class="ff5">全固</span>定,应<span class="ff5">当根据</span>的用户的习<span class="ff5">惯而改变</span>,这一点在</div><div class="t m0 x0 h4 y17 ff1 fs1 fc0 sc1 ls0 ws0">非特定人识别中<span class="ff5">几乎不</span>可能实现。<span class="ff5">因此大多</span>数家电遥控器<span class="ff5">不</span>适合采用<span class="ff5">此</span>方案。<span class="ff3">DP<span class="_ _0"> </span></span>特定人识别的优点是方法<span class="ff5">简</span>单,对硬</div><div class="t m0 x0 h4 y18 ff1 fs1 fc0 sc1 ls0 ws0">件<span class="ff5">资源</span>要<span class="ff5">求较低</span>;<span class="ff5">此外</span>,这一方法中的训练过<span class="ff5">程</span>也<span class="ff5">很简</span>单,<span class="ff5">不</span>需<span class="ff5">预先</span>采集过<span class="ff5">多</span>的<span class="ff5">样</span>本,<span class="ff5">不仅降低</span>了前<span class="ff5">期</span>成本,<span class="ff5">而</span>且可</div><div class="t m0 x0 h4 y19 ff1 fs1 fc0 sc1 ls0 ws0">以<span class="ff5">根据</span>用户习<span class="ff5">惯</span>,由用户<span class="ff5">任意</span>定<span class="ff5">义</span>控制<span class="ff5">项</span>目的具体命令语<span class="ff5">句</span>,<span class="ff5">因而</span>适合<span class="ff5">大多</span>数家电遥控器的应用。<span class="ff3">DP<span class="_ _0"> </span></span>特定识别的<span class="ff5">严</span>重</div><div class="t m0 x0 h4 y1a ff1 fs1 fc0 sc1 ls0 ws0">缺点是它的稳健性<span class="ff5">不</span>理<span class="ff5">想</span>,对有<span class="ff5">些</span>人的语音识别<span class="ff5">率</span>高,有的人识别<span class="ff5">率却不</span>高;<span class="ff5">刚</span>训练完<span class="ff5">时</span>识别<span class="ff5">率较</span>高,<span class="ff5">但随着时间</span>的</div></div></div><div class="pi" data-data='{"ctm":[1.611850,0.000000,0.000000,1.611850,0.000000,0.000000]}'></div></div>
</body>
</html>