【4】Speech signal improvement in real-time communication.pdf

编号:129367 PDF 29页 2.96MB 下载积分:VIP专享
下载报告请您先登录!

【4】Speech signal improvement in real-time communication.pdf

1、Speech Signal Improvement In Real-time CommunicaitonYannan WangTencent Ethereal Audio Lab,Tencent,Shenzhen,ChinaOutline1.Introduction2.Speech Signal Improvement3.Future work2 BackgroundReal-time communication(RTC)systems widely used:Teleconferencing systems Video callsReason for speech quality of cu

2、rrent RTC systems:Device robustness Acoustical capturing Noise/reverberation corruption Interfering speakers Network congestion3Introduction4Device robustnessOutline1.Introduction2.Speech Signal ImprovementI.EnhancementII.Restoration3.Future work56键盘雨声微信消息提示桌子放水杯咳嗽语音降噪7 房间墙壁、天花板、地面、各种物体的反射声波和直达波叠加,降

3、低语音质量和清晰度 传统方法缺陷:难以准确估计纯净语音和混响语音的非线性映射关系 算法需要先验信息较多,收敛较慢 去除混响的成分较少,效果不够明显去混响8说话人提取有感注册无感注册Yukai Ju et al.ASLPNPU&Tencent Ethereal Audio Lab,ChinaOur pervious winner model-TEA-PSE2The 1ststage network:estimate the target speakers magnitude with noisy phaseThe 2ndstage network:estimate the residual re

4、al and imaginary part Use simple concatenation method to combine speaker embeddingRelated WorksYukai Ju et al.ASLPNPU&Tencent Ethereal Audio Lab,ChinaContributionIncorporate a residual LSTM3after squeezed temporal convolution network(S-TCN)to enhance sequence modeling capabilitiesLocal-global repres

5、entation(LGR)4 structure is introduced to boost speaker information extractionMulti-STFT resolution loss5 is used to effectively capture the time-frequency characteristics of the speech signalsRetraining methods are employed based on the freeze training strategy to fine-tune the systemTEA-PSE 3.0 ra

6、nks 1st in both ICASSP 2023 DNS-Challenge track 1 and track 26TEA-PSE 3.0Yukai Ju et al.ASLPNPU&Tencent Ethereal Audio Lab,ChinaNetwork structureSame dual-stage framework as TEA-PSEResidual LSTM is added after every S-TCN module to further enhance the models sequence modeling capabilitiesLocal-globa

友情提示

1、下载报告失败解决办法
2、PDF文件下载后,可能会被浏览器默认打开,此种情况可以点击浏览器菜单,保存网页到桌面,就可以正常下载了。
3、本站不支持迅雷下载,请使用电脑自带的IE浏览器,或者360浏览器、谷歌浏览器下载即可。
4、本站报告下载后的文档和图纸-无水印,预览文档经过压缩,下载后原文更清晰。

本文(【4】Speech signal improvement in real-time communication.pdf)为本站 (2200) 主动上传,三个皮匠报告文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。 若此文所含内容侵犯了您的版权或隐私,请立即通知三个皮匠报告文库(点击联系客服),我们立即给予删除!

温馨提示:如果因为网速或其他原因下载失败请重新下载,重复下载不扣分。
客服
商务合作
小程序
服务号
折叠