找回密碼
 To register

QQ登錄

只需一步,快速開(kāi)始

掃一掃,訪問(wèn)微社區(qū)

打印 上一主題 下一主題

Titlebook: Advances in Speech and Language Technologies for Iberian Languages; IberSPEECH 2014 Conf Juan Luis Navarro Mesa,Alfonso Ortega,Doroteo T. T

[復(fù)制鏈接]
樓主: Causalgia
41#
發(fā)表于 2025-3-28 17:04:50 | 只看該作者
https://doi.org/10.1007/978-3-319-48354-2across the AHC iterations is done using reference speaker ground-truth labels to select the purer clustering as input for the global framework. Experiments on the REPERE phase 1 test database show improvements of around 6% absolute DER compared to the baseline system output.
42#
發(fā)表于 2025-3-28 19:47:46 | 只看該作者
43#
發(fā)表于 2025-3-29 00:12:10 | 只看該作者
Xiaobin Qiu,Hongqian Chen,Nan Zhoud. All processing is done locally on the phone, which is able to react in real-time to incoming keywords. In this paper we describe the application, review the matching algorithm we used and show experimentally that it successfully reacts to voice commands in a variety of acoustic conditions.
44#
發(fā)表于 2025-3-29 06:08:03 | 只看該作者
Global Speaker Clustering towards Optimal Stopping Criterion in Binary Key Speaker Diarizationacross the AHC iterations is done using reference speaker ground-truth labels to select the purer clustering as input for the global framework. Experiments on the REPERE phase 1 test database show improvements of around 6% absolute DER compared to the baseline system output.
45#
發(fā)表于 2025-3-29 11:14:38 | 只看該作者
CVX-Optimized Beamforming and Vector Taylor Series Compensation with German ASR Employing Star-Shapedium-vocabulary German database for microphone array made of embedded clean signals contaminated with real room impulsive responses and mixed in a ‘natural’ way with real noises. We show that the proposed enhancement framework performs better than other related systems on the presented database.
46#
發(fā)表于 2025-3-29 13:35:42 | 只看該作者
Flexible Stand-Alone Keyword Recognition Application Using Dynamic Time Warpingd. All processing is done locally on the phone, which is able to react in real-time to incoming keywords. In this paper we describe the application, review the matching algorithm we used and show experimentally that it successfully reacts to voice commands in a variety of acoustic conditions.
47#
發(fā)表于 2025-3-29 19:37:59 | 只看該作者
Statistical Text-to-Speech Synthesis of Spanish Subtitlesthe best of our knowledge, this is the first time that a DNN-based TTS system has been implemented for the synthesis of Spanish. A comparative objective evaluation between both models has been carried out. Our results show that DNN-based systems can reconstruct speech waveforms more accurately.
48#
發(fā)表于 2025-3-29 20:50:04 | 只看該作者
Unsupervised Training of PLDA with Variational Bayesre latent variables. We experimented on unlabeled NIST SRE data. The trained models were evaluated on NIST SRE10. Compared to cosine distance, unsupervised PLDA improved EER by 28% and minimum DCF by 36%.
49#
發(fā)表于 2025-3-30 02:32:30 | 只看該作者
50#
發(fā)表于 2025-3-30 06:08:38 | 只看該作者
 關(guān)于派博傳思  派博傳思旗下網(wǎng)站  友情鏈接
派博傳思介紹 公司地理位置 論文服務(wù)流程 影響因子官網(wǎng) 吾愛(ài)論文網(wǎng) 大講堂 北京大學(xué) Oxford Uni. Harvard Uni.
發(fā)展歷史沿革 期刊點(diǎn)評(píng) 投稿經(jīng)驗(yàn)總結(jié) SCIENCEGARD IMPACTFACTOR 派博系數(shù) 清華大學(xué) Yale Uni. Stanford Uni.
QQ|Archiver|手機(jī)版|小黑屋| 派博傳思國(guó)際 ( 京公網(wǎng)安備110108008328) GMT+8, 2025-10-11 13:09
Copyright © 2001-2015 派博傳思   京公網(wǎng)安備110108008328 版權(quán)所有 All rights reserved
快速回復(fù) 返回頂部 返回列表
永州市| 东乌| 且末县| 兴城市| 若尔盖县| 南陵县| 新泰市| 揭阳市| 宜昌市| 临邑县| 阳高县| 乌苏市| 泸州市| 宁夏| 象州县| 无为县| 黄龙县| 沈丘县| 肇州县| 威宁| 武宁县| 民勤县| 上高县| 新巴尔虎右旗| 福鼎市| 光泽县| 江陵县| 美姑县| 顺昌县| 新化县| 富川| 华安县| 龙陵县| 盐山县| 芦溪县| 余江县| 那坡县| 拜城县| 榆树市| 哈尔滨市| 江永县|