|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
The graphs summarize 149 occurrences of 116 keywords
|
|
|
Results
Found 1108 publication records. Showing 1108 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
15 | Rui Liu 0008, Berrak Sisman, Guanglai Gao, Haizhou Li 0001 |
Expressive TTS Training With Frame and Style Reconstruction Loss. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Therdpong Daengsi, Phisit Pornpongtechavanich, Pongpisit Wuttidittachotti |
Comparison of TTS System Efficiency: A Pilot Study of Word Intelligibility between Siri and Google Translate with Thai Language. |
ICAICST |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Chunyu Qiang, Jianhua Tao 0001, Ruibo Fu, Zhengqi Wen, Jiangyan Yi, Tao Wang 0074, Shiming Wang |
Text Enhancement for Paragraph Processing in End-to-End Code-switching TTS. |
ISCSLP |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Iván Vallés-Pérez, Julian Roth, Grzegorz Beringer, Roberto Barra-Chicote, Jasha Droppo |
Improving Multi-Speaker TTS Prosody Variance with a Residual Encoder and Normalizing Flows. |
Interspeech |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Xiaochun An, Frank K. Soong, Lei Xie 0001 |
Improving Performance of Seen and Unseen Speech Style Transfer in End-to-End Neural TTS. |
Interspeech |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Yan Deng, Rui Zhao 0017, Zhong Meng, Xie Chen 0001, Bing Liu, Jinyu Li 0001, Yifan Gong 0001, Lei He 0005 |
Improving RNN-T for Domain Scaling Using Semi-Supervised Training with Neural TTS. |
Interspeech |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Sashi Novitasari, Sakriani Sakti, Satoshi Nakamura 0001 |
Dynamically Adaptive Machine Speech Chain Inference for TTS in Noisy Environment: Listen and Speak Louder. |
Interspeech |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Dominik Schiller, Silvan Mertes, Pol van Rijn, Elisabeth André |
Analysis by Synthesis: Using an Expressive TTS Model as Feature Extractor for Paralinguistic Speech Classification. |
Interspeech |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Thananchai Kongthaworn, Burin Naowarat, Ekapol Chuangsuwanich |
Spectral and Latent Speech Representation Distortion for TTS Evaluation. |
Interspeech |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Brooke Stephenson, Thomas Hueber, Laurent Girin, Laurent Besacier |
Alternate Endings: Improving Prosody for Incremental Neural TTS with Predicted Future Text Input. |
Interspeech |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Jason Taylor, Korin Richmond |
Confidence Intervals for ASR-Based TTS Evaluation. |
Interspeech |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Penny Karanasou, Sri Karlapati, Alexis Moinet, Arnaud Joly, Ammar Abbas, Simon Slangen, Jaime Lorenzo-Trueba, Thomas Drugman |
A Learned Conditional Prior for the VAE Acoustic Space of a TTS System. |
Interspeech |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Cheng Gong, Longbiao Wang, Ju Zhang 0001, Shaotong Guo, Yuguang Wang, Jianwu Dang 0001 |
TacoLPCNet: Fast and Stable TTS by Conditioning LPCNet on Mel Spectrogram Predictions. |
Interspeech |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Pol van Rijn, Silvan Mertes, Dominik Schiller, Peter M. C. Harrison, Pauline Larrouy-Maestri, Elisabeth André, Nori Jacoby |
Exploring Emotional Prototypes in a High Dimensional TTS Latent Space. |
Interspeech |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Myeonghun Jeong, Hyeongju Kim, Sung Jun Cheon, Byoung Jin Choi, Nam Soo Kim |
Diff-TTS: A Denoising Diffusion Model for Text-to-Speech. |
Interspeech |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Daniel Tihelka, Markéta Rezácková, Martin Gruber, Zdenek Hanzlícek, Jakub Vít, Jindrich Matousek |
Save Your Voice: Voice Banking and TTS for Anyone. |
Interspeech |
2021 |
DBLP BibTeX RDF |
|
15 | Ye Jia, Heiga Zen, Jonathan Shen, Yu Zhang 0033, Yonghui Wu |
PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS. |
Interspeech |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Yao Shi, Hui Bu, Xin Xu, Shaoji Zhang, Ming Li 0026 |
AISHELL-3: A Multi-Speaker Mandarin TTS Corpus. |
Interspeech |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Evelina Bakhturina, Vitaly Lavrukhin, Boris Ginsburg, Yang Zhang 0089 |
Hi-Fi Multi-Speaker English TTS Dataset. |
Interspeech |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Isaac Elias, Heiga Zen, Jonathan Shen, Yu Zhang 0033, Ye Jia, R. J. Skerry-Ryan, Yonghui Wu |
Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling. |
Interspeech |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Hui Lu, Zhiyong Wu 0001, Xixin Wu, Xu Li, Shiyin Kang, Xunying Liu, Helen Meng |
VAENAR-TTS: Variational Auto-Encoder Based Non-AutoRegressive Text-to-Speech Synthesis. |
Interspeech |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Zhehuai Chen, Andrew Rosenberg, Yu Zhang 0033, Heiga Zen, Mohammadreza Ghodsi, Yinghui Huang, Jesse Emond, Gary Wang, Bhuvana Ramabhadran, Pedro J. Moreno 0001 |
Semi-Supervision in ASR: Sequential MixMatch and Factorized TTS-Based Augmentation. |
Interspeech |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Aleese Block, Michelle Cohn, Georgia Zellou |
Variation in Perceptual Sensitivity and Compensation for Coarticulation Across Adult and Child Naturally-Produced and TTS Voices. |
Interspeech |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Ryo Fukuda, Sashi Novitasari, Yui Oka, Yasumasa Kano, Yuki Yano, Yuka Ko, Hirotaka Tokuyama, Kosuke Doi, Tomoya Yanagita, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura 0001 |
Simultaneous Speech-to-Speech Translation System with Transformer-Based Incremental ASR, MT, and TTS. |
O-COCOSDA |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Zolzaya Byambadorj, Ryota Nishimura, Altangerel Ayush, Kengo Ohta, Norihide Kitaoka |
Multi-speaker TTS system for low-resource language using cross-lingual transfer learning and data augmentation. |
APSIPA ASC |
2021 |
DBLP BibTeX RDF |
|
15 | Amrith Setlur, Aman Madaan, Tanmay Parekh, Yiming Yang, Alan W. Black |
Towards Using Heterogeneous Relation Graphs for End-to-End TTS. |
ASRU |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Sei Ueno, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara |
Data Augmentation for ASR Using TTS Via a Discrete Representation. |
ASRU |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Wen-Chin Huang, Tomoki Hayashi, Xinjian Li, Shinji Watanabe 0001, Tomoki Toda |
On Prosody Modeling for ASR+TTS Based Voice Conversion. |
ASRU |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Pascal Puchtler, Johannes Wirth 0001, René Peinl |
HUI-Audio-Corpus-German: A High Quality TTS Dataset. |
KI |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Song Li, Beibei Ouyang, Lin Li 0032, Qingyang Hong |
Light-TTS: Lightweight Multi-Speaker Multi-Lingual Text-to-Speech. |
ICASSP |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Murali Karthick Baskar, Lukás Burget, Shinji Watanabe 0001, Ramón Fernandez Astudillo, Jan Honza Cernocký |
Eat: Enhanced ASR-TTS for Self-Supervised Speech Recognition. |
ICASSP |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Changhe Song, Jingbei Li, Yixuan Zhou 0002, Zhiyong Wu 0001, Helen M. Meng |
Syntactic Representation Learning For Neural Network Based TTS with Syntactic Parse Tree Traversal. |
ICASSP |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Wei Wang, Zhikai Zhou, Yizhou Lu, Hongji Wang, Chenpeng Du, Yanmin Qian |
Towards Data Selection on TTS Data for Children's Speech Recognition. |
ICASSP |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Feng-Long Xie, Xinhui Li, Wen-Chao Su, Li Lu, Frank K. Soong |
A New High Quality Trajectory Tiling Based Hybrid TTS In Real Time. |
ICASSP |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Keisuke Matsubara, Takuma Okamoto, Ryoichi Takashima, Tetsuya Takiguchi, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai |
High-Intelligibility Speech Synthesis for Dysarthric Speakers with LPCNet-Based TTS and CycleVAE-Based VC. |
ICASSP |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Isaac Elias, Heiga Zen, Jonathan Shen, Yu Zhang 0033, Ye Jia, Ron J. Weiss, Yonghui Wu |
Parallel Tacotron: Non-Autoregressive and Controllable TTS. |
ICASSP |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Liping Chen, Yan Deng, Xi Wang 0016, Frank K. Soong, Lei He 0005 |
Speech Bert Embedding for Improving Prosody in Neural TTS. |
ICASSP |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Detai Xin, Tatsuya Komatsu, Shinnosuke Takamichi, Hiroshi Saruwatari |
Disentangled Speaker and Language Representations Using Mutual Information Minimization and Domain Adaptation for Cross-Lingual TTS. |
ICASSP |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Shengkui Zhao, Hao Wang, Trung Hieu Nguyen 0001, Bin Ma 0001 |
Towards Natural and Controllable Cross-Lingual Voice Conversion Based on Neural TTS Model and Phonetic Posteriorgram. |
ICASSP |
2021 |
DBLP DOI BibTeX RDF |
|
15 | K. R. Prajwal, C. V. Jawahar |
Data-Efficient Training Strategies for Neural TTS Systems. |
COMAD/CODS |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Haohan Guo, Shaofei Zhang, Frank K. Soong, Lei He 0005, Lei Xie 0001 |
Conversational End-to-End TTS for Voice Agents. |
SLT |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Yiling Huang, Yutian Chen, Jason Pelecanos, Quan Wang |
Synth2Aug: Cross-Domain Speaker Recognition with TTS Synthesized Speech. |
SLT |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Adriana Stan, Beáta Lorincz, Maria Nutu, Mircea Giurgiu |
The MARA corpus: Expressivity in end-to-end TTS systems using synthesised speech data. |
SpeD |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Ji-Hoon Kim, Sang-Hoon Lee, Ji-Hyun Lee, Honggyu Jung, Seong-Whan Lee |
GC-TTS: Few-shot Speaker Adaptation with Geometric Constraints. |
SMC |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Yunxin Zhao, Minguang Song, Yanghao Yue, Mili Kuruvilla-Dugdale |
Personalizing TTS Voices for Progressive Dysarthria. |
BHI |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Siddique Latif, Inyoung Kim, Ioan Calapodescu, Laurent Besacier |
Controlling Prosody in End-to-End TTS: A Case Study on Contrastive Focus Generation. |
CoNLL |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Prachi Govalkar, Ahmed Mustafa, Nicola Pia, Judith Bauer, Metehan Yurt, Yigitcan Özer, Christian Dittmar |
A Lightweight Neural TTS System for High-quality German Speech Synthesis. |
ITG Conference on Speech Communication |
2021 |
DBLP BibTeX RDF |
|
15 | Ali Raheem Mandeel, Mohammed Salah Al-Radhi, Tamás Gábor Csapó |
Speaker Adaptation with Continuous Vocoder-Based DNN-TTS. |
SPECOM |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Atli Sigurgeirsson, Thorsteinn Gunnarsson, Gunnar Örnólfsson, Eydís Magnúsdóttir, Ragnheiðhur Thórhallsdóttir, Stefán Jónsson, Jón Guðnason |
Talrómur: A large Icelandic TTS corpus. |
NoDaLiDa |
2021 |
DBLP BibTeX RDF |
|
15 | Kishore Kumar Botsa, Lithin Reddy Marla, Suryakanth V. Gangashetty |
A Generative Adversarial Network based Training Framework for Robust TTS in Noisy Environment. |
IC3 |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Zdenek Hanzlícek, Jakub Vít, Markéta Rezácková |
Speakers Talking Foreign Languages in a Multi-lingual TTS System. |
TDS |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Daniel Tihelka, Jindrich Matousek, Alice Tihelková |
How Much End-to-End is Tacotron 2 End-to-End TTS System. |
TDS |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Vadim Popov, Ivan Vovk, Vladimir Gogoryan, Tasnima Sadekova, Mikhail A. Kudinov |
Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech. |
ICML |
2021 |
DBLP BibTeX RDF |
|
15 | Dawei Liu, Longbiao Wang, Sheng Li 0010, Haoyu Li, Chenchen Ding, Ju Zhang, Jianwu Dang 0001 |
Exploring Effective Speech Representation via ASR for High-Quality End-to-End Multispeaker TTS. |
ICONIP (6) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Kurniawati Azizah, Mirna Adriani, Wisnu Jatmiko |
Hierarchical Transfer Learning for Multilingual, Multi-Speaker, and Style Transfer DNN-Based TTS on Low-Resource Languages. |
IEEE Access |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Rui Liu 0008, Berrak Sisman, Feilong Bao, Guanglai Gao, Haizhou Li 0001 |
Modeling Prosodic Phrasing With Multi-Task Learning in Tacotron-Based TTS. |
IEEE Signal Process. Lett. |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Yang Zhang, Liqun Deng, Yasheng Wang |
Unified Mandarin TTS Front-end Based on Distilled BERT Model. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
15 | Jonathan Shen, Ye Jia, Mike Chrzanowski, Yu Zhang 0033, Isaac Elias, Heiga Zen, Yonghui Wu |
Non-Attentive Tacotron: Robust and Controllable Neural TTS Synthesis Including Unsupervised Duration Modeling. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
15 | Henry B. Moss, Vatsal Aggarwal, Nishant Prateek, Javier González 0002, Roberto Barra-Chicote |
BOFFIN TTS: Few-Shot Speaker Adaptation by Bayesian Optimization. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
15 | Isaac Elias, Heiga Zen, Jonathan Shen, Yu Zhang 0033, Ye Jia, Ron J. Weiss, Yonghui Wu |
Parallel Tacotron: Non-Autoregressive and Controllable TTS. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
15 | Rui Liu 0008, Berrak Sisman, Feilong Bao, Guanglai Gao, Haizhou Li 0001 |
Modeling Prosodic Phrasing with Multi-Task Learning in Tacotron-based TTS. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
15 | Rui Liu 0008, Berrak Sisman, Feilong Bao, Guanglai Gao, Haizhou Li 0001 |
WaveTTS: Tacotron-based TTS with Joint Time-Frequency Domain Loss. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
15 | Rui Liu 0008, Berrak Sisman, Guanglai Gao, Haizhou Li 0001 |
Expressive TTS Training with Frame and Style Reconstruction Loss. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
15 | Naihan Li, Shujie Liu 0001, Yanqing Liu, Sheng Zhao, Ming Liu, Ming Zhou 0001 |
MoBoAligner: a Neural Alignment Model for Non-autoregressive TTS with Monotonic Boundary Search. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
15 | Qiao Tian, Zewang Zhang, Chao Liu 0030, Heng Lu, Linghui Chen, Bin Wei, Pujiang He, Shan Liu 0001 |
FeatherTTS: Robust and Efficient attention based Neural TTS. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
15 | Yiling Huang, Yutian Chen, Jason Pelecanos, Quan Wang |
Synth2Aug: Cross-domain speaker recognition with TTS synthesized speech. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
15 | Yao Shi, Hui Bu, Xin Xu, Shaoji Zhang, Ming Li 0026 |
AISHELL-3: A Multi-speaker Mandarin TTS Corpus and the Baselines. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
15 | Wen-Chin Huang, Tomoki Hayashi, Shinji Watanabe 0001, Tomoki Toda |
The Sequence-to-Sequence Baseline for the Voice Conversion Challenge 2020: Cascading ASR and TTS. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
15 | Dongyang Dai, Li Chen, Yuping Wang, Mu Wang, Rui Xia, Xuchen Song, Zhiyong Wu 0001, Yuxuan Wang 0002 |
Noise Robust TTS for Low Resource Speakers using Pre-trained Model and Speech Enhancement. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
15 | Haohan Guo, Shaofei Zhang, Frank K. Soong, Lei He 0005, Lei Xie 0001 |
Conversational End-to-End TTS for Voice Agent. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
15 | Johanes Effendi, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura 0001 |
Augmenting Images for ASR and TTS through Single-loop and Dual-loop Multimodal Chain Framework. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
15 | Jaehyeon Kim, Sungwon Kim 0001, Jungil Kong, Sungroh Yoon |
Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment Search. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
15 | Arun Baby, Saranya Vinnaitherthan, Nagaraj Adiga, Pranav Jawale, Sumukh Badam, Sharath Adavanne, Srikanth Konjeti |
An ASR Guided Speech Intelligibility Measure for TTS Model Selection. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
15 | Katsuhito Sudoh, Takatomo Kano, Sashi Novitasari, Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura 0001 |
Simultaneous Speech-to-Speech Translation System with Neural Incremental ASR, MT, and TTS. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
15 | Alistair Conkie, Andrew M. Finch |
Scalable Multilingual Frontend for TTS. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
15 | Takashi Morita, Hiroki Koda |
Exploring TTS without T Using Biologically/Psychologically Motivated Neural Network Modules (ZeroSpeech 2020). |
CoRR |
2020 |
DBLP BibTeX RDF |
|
15 | Qinghua Sun, Kenji Nagamatsu |
Building Multi lingual TTS using Cross Lingual Voice Conversion. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
15 | Changhe Song, Jingbei Li, Yixuan Zhou 0002, Zhiyong Wu 0001, Helen M. Meng |
Syntactic representation learning for neural network based TTS with syntactic parse tree traversal. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
15 | Yahuan Cong, Ran Zhang, Jian Luan 0001 |
PPSpeech: Phrase based Parallel End-to-End TTS System. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
15 | Brooke Stephenson, Laurent Besacier, Laurent Girin, Thomas Hueber |
What the Future Brings: Investigating the Impact of Lookahead for Incremental Neural TTS. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
15 | Yash Sharma 0004, Basil Abraham, Karan Taneja, Preethi Jyothi |
Improving Low Resource Code-switched ASR using Augmented Code-switched TTS. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
15 | Fady K. Fahmy, Mahmoud I. Khalil, Hazem M. Abbas |
A Transfer Learning End-to-End ArabicText-To-Speech (TTS) Deep Architecture. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
15 | Vadim Popov, Stanislav Kamenev, Mikhail A. Kudinov, Sergey Repyevsky, Tasnima Sadekova, Vitalii Bushaev, Vladimir Kryzhanovskiy, Denis Parkhomenko |
Fast and Lightweight On-Device TTS with Tacotron2 and LPCNet. |
INTERSPEECH |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Adam Polyak, Lior Wolf, Yaniv Taigman |
TTS Skins: Speaker Conversion via ASR. |
INTERSPEECH |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Ruolan Liu, Xue Wen, Chunhui Lu, Xiao Chen |
Tone Learning in Low-Resource Bilingual TTS. |
INTERSPEECH |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Alex Peiró Lilja, Mireia Farrús |
Naturalness Enhancement with Linguistic Information in End-to-End TTS Using Unsupervised Parallel Encoding. |
INTERSPEECH |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Matt Whitehill, Shuang Ma, Daniel McDuff, Yale Song |
Multi-Reference Neural TTS Stylization with Adversarial Cycle Consistency. |
INTERSPEECH |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Manish Sharma, Tom Kenter, Rob Clark |
StrawNet: Self-Training WaveNet for TTS in Low-Data Regimes. |
INTERSPEECH |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Brooke Stephenson, Laurent Besacier, Laurent Girin, Thomas Hueber |
What the Future Brings: Investigating the Impact of Lookahead for Incremental Neural TTS. |
INTERSPEECH |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Johanes Effendi, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura 0001 |
Augmenting Images for ASR and TTS Through Single-Loop and Dual-Loop Multimodal Chain Framework. |
INTERSPEECH |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Yash Sharma 0004, Basil Abraham, Karan Taneja, Preethi Jyothi |
Improving Low Resource Code-Switched ASR Using Augmented Code-Switched TTS. |
INTERSPEECH |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Michelle Cohn, Georgia Zellou |
Perception of Concatenative vs. Neural Text-To-Speech (TTS): Differences in Intelligibility in Noise and Language Attitudes. |
INTERSPEECH |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Hyeong Rae Ihm, Joun Yeop Lee, Byoung Jin Choi, Sung Jun Cheon, Nam Soo Kim |
Reformer-TTS: Neural Speech Synthesis with Reformer Network. |
INTERSPEECH |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Naihan Li, Shujie Liu 0001, Yanqing Liu, Sheng Zhao, Ming Liu, Ming Zhou 0001 |
MoBoAligner: A Neural Alignment Model for Non-Autoregressive TTS with Monotonic Boundary Search. |
INTERSPEECH |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Xiangyu Liang, Zhiyong Wu 0001, Runnan Li, Yanqing Liu, Sheng Zhao, Helen Meng |
Enhancing Monotonicity for Robust Autoregressive Transformer TTS. |
INTERSPEECH |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Erica Cooper, Cheng-I Lai, Yusuke Yasuda, Junichi Yamagishi |
Can Speaker Augmentation Improve Multi-Speaker End-to-End TTS? |
INTERSPEECH |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Tao Wang 0074, Xuefei Liu, Jianhua Tao 0001, Jiangyan Yi, Ruibo Fu, Zhengqi Wen |
Non-Autoregressive End-to-End TTS with Coarse-to-Fine Decoding. |
INTERSPEECH |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Alexander Sorin, Slava Shechtman, Ron Hoory |
Principal Style Components: Expressive Style Control and Cross-Speaker Transfer in Neural TTS. |
INTERSPEECH |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Takashi Morita, Hiroki Koda |
Exploring TTS Without T Using Biologically/Psychologically Motivated Neural Network Modules (ZeroSpeech 2020). |
INTERSPEECH |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Mayuko Okamato, Sakriani Sakti, Satoshi Nakamura 0001 |
Towards Speech Entrainment: Considering ASR Information in Speaking Rate Variation of TTS Waveform Generation. |
O-COCOSDA |
2020 |
DBLP DOI BibTeX RDF |
|
Displaying result #401 - #500 of 1108 (100 per page; Change: ) Pages: [ <<][ 1][ 2][ 3][ 4][ 5][ 6][ 7][ 8][ 9][ 10][ 11][ 12][ >>] |
|