|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
The graphs summarize 149 occurrences of 116 keywords
|
|
|
Results
Found 1108 publication records. Showing 1108 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
15 | Rui Liu 0008, Berrak Sisman, Guanglai Gao, Haizhou Li 0001 |
Expressive TTS Training With Frame and Style Reconstruction Loss. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE ACM Trans. Audio Speech Lang. Process. ![In: IEEE ACM Trans. Audio Speech Lang. Process. 29, pp. 1806-1818, 2021. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Therdpong Daengsi, Phisit Pornpongtechavanich, Pongpisit Wuttidittachotti |
Comparison of TTS System Efficiency: A Pilot Study of Word Intelligibility between Siri and Google Translate with Thai Language. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICAICST ![In: International Conference on Artificial Intelligence and Computer Science Technology, ICAICST 2021, Yogyakarta, Indonesia, June 29-30, 2021, pp. 196-199, 2021, IEEE, 978-1-6654-2404-2. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Chunyu Qiang, Jianhua Tao 0001, Ruibo Fu, Zhengqi Wen, Jiangyan Yi, Tao Wang 0074, Shiming Wang |
Text Enhancement for Paragraph Processing in End-to-End Code-switching TTS. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ISCSLP ![In: 12th International Symposium on Chinese Spoken Language Processing, ISCSLP 2021, Hong Kong, January 24-27, 2021, pp. 1-5, 2021, IEEE, 978-1-7281-6994-1. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Iván Vallés-Pérez, Julian Roth, Grzegorz Beringer, Roberto Barra-Chicote, Jasha Droppo |
Improving Multi-Speaker TTS Prosody Variance with a Residual Encoder and Normalizing Flows. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Interspeech ![In: Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021., pp. 3131-3135, 2021, ISCA. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Xiaochun An, Frank K. Soong, Lei Xie 0001 |
Improving Performance of Seen and Unseen Speech Style Transfer in End-to-End Neural TTS. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Interspeech ![In: Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021., pp. 4688-4692, 2021, ISCA. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Yan Deng, Rui Zhao 0017, Zhong Meng, Xie Chen 0001, Bing Liu, Jinyu Li 0001, Yifan Gong 0001, Lei He 0005 |
Improving RNN-T for Domain Scaling Using Semi-Supervised Training with Neural TTS. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Interspeech ![In: Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021., pp. 751-755, 2021, ISCA. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Sashi Novitasari, Sakriani Sakti, Satoshi Nakamura 0001 |
Dynamically Adaptive Machine Speech Chain Inference for TTS in Noisy Environment: Listen and Speak Louder. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Interspeech ![In: Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021., pp. 4124-4128, 2021, ISCA. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Dominik Schiller, Silvan Mertes, Pol van Rijn, Elisabeth André |
Analysis by Synthesis: Using an Expressive TTS Model as Feature Extractor for Paralinguistic Speech Classification. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Interspeech ![In: Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021., pp. 486-490, 2021, ISCA. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Thananchai Kongthaworn, Burin Naowarat, Ekapol Chuangsuwanich |
Spectral and Latent Speech Representation Distortion for TTS Evaluation. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Interspeech ![In: Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021., pp. 2741-2745, 2021, ISCA. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Brooke Stephenson, Thomas Hueber, Laurent Girin, Laurent Besacier |
Alternate Endings: Improving Prosody for Incremental Neural TTS with Predicted Future Text Input. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Interspeech ![In: Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021., pp. 3865-3869, 2021, ISCA. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Jason Taylor, Korin Richmond |
Confidence Intervals for ASR-Based TTS Evaluation. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Interspeech ![In: Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021., pp. 2791-2795, 2021, ISCA. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Penny Karanasou, Sri Karlapati, Alexis Moinet, Arnaud Joly, Ammar Abbas, Simon Slangen, Jaime Lorenzo-Trueba, Thomas Drugman |
A Learned Conditional Prior for the VAE Acoustic Space of a TTS System. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Interspeech ![In: Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021., pp. 3620-3624, 2021, ISCA. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Cheng Gong, Longbiao Wang, Ju Zhang 0001, Shaotong Guo, Yuguang Wang, Jianwu Dang 0001 |
TacoLPCNet: Fast and Stable TTS by Conditioning LPCNet on Mel Spectrogram Predictions. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Interspeech ![In: Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021., pp. 111-115, 2021, ISCA. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Pol van Rijn, Silvan Mertes, Dominik Schiller, Peter M. C. Harrison, Pauline Larrouy-Maestri, Elisabeth André, Nori Jacoby |
Exploring Emotional Prototypes in a High Dimensional TTS Latent Space. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Interspeech ![In: Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021., pp. 3870-3874, 2021, ISCA. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Myeonghun Jeong, Hyeongju Kim, Sung Jun Cheon, Byoung Jin Choi, Nam Soo Kim |
Diff-TTS: A Denoising Diffusion Model for Text-to-Speech. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Interspeech ![In: Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021., pp. 3605-3609, 2021, ISCA. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Daniel Tihelka, Markéta Rezácková, Martin Gruber, Zdenek Hanzlícek, Jakub Vít, Jindrich Matousek |
Save Your Voice: Voice Banking and TTS for Anyone. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Interspeech ![In: Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021., pp. 4855-4856, 2021, ISCA. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP BibTeX RDF |
|
15 | Ye Jia, Heiga Zen, Jonathan Shen, Yu Zhang 0033, Yonghui Wu |
PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Interspeech ![In: Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021., pp. 151-155, 2021, ISCA. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Yao Shi, Hui Bu, Xin Xu, Shaoji Zhang, Ming Li 0026 |
AISHELL-3: A Multi-Speaker Mandarin TTS Corpus. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Interspeech ![In: Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021., pp. 2756-2760, 2021, ISCA. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Evelina Bakhturina, Vitaly Lavrukhin, Boris Ginsburg, Yang Zhang 0089 |
Hi-Fi Multi-Speaker English TTS Dataset. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Interspeech ![In: Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021., pp. 2776-2780, 2021, ISCA. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Isaac Elias, Heiga Zen, Jonathan Shen, Yu Zhang 0033, Ye Jia, R. J. Skerry-Ryan, Yonghui Wu |
Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Interspeech ![In: Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021., pp. 141-145, 2021, ISCA. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Hui Lu, Zhiyong Wu 0001, Xixin Wu, Xu Li, Shiyin Kang, Xunying Liu, Helen Meng |
VAENAR-TTS: Variational Auto-Encoder Based Non-AutoRegressive Text-to-Speech Synthesis. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Interspeech ![In: Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021., pp. 3775-3779, 2021, ISCA. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Zhehuai Chen, Andrew Rosenberg, Yu Zhang 0033, Heiga Zen, Mohammadreza Ghodsi, Yinghui Huang, Jesse Emond, Gary Wang, Bhuvana Ramabhadran, Pedro J. Moreno 0001 |
Semi-Supervision in ASR: Sequential MixMatch and Factorized TTS-Based Augmentation. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Interspeech ![In: Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021., pp. 736-740, 2021, ISCA. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Aleese Block, Michelle Cohn, Georgia Zellou |
Variation in Perceptual Sensitivity and Compensation for Coarticulation Across Adult and Child Naturally-Produced and TTS Voices. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Interspeech ![In: Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021., pp. 521-525, 2021, ISCA. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Ryo Fukuda, Sashi Novitasari, Yui Oka, Yasumasa Kano, Yuki Yano, Yuka Ko, Hirotaka Tokuyama, Kosuke Doi, Tomoya Yanagita, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura 0001 |
Simultaneous Speech-to-Speech Translation System with Transformer-Based Incremental ASR, MT, and TTS. ![Search on Bibsonomy](Pics/bibsonomy.png) |
O-COCOSDA ![In: 24th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2021, Singapore, November 18-20, 2021, pp. 186-192, 2021, IEEE, 978-1-6654-0870-7. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Zolzaya Byambadorj, Ryota Nishimura, Altangerel Ayush, Kengo Ohta, Norihide Kitaoka |
Multi-speaker TTS system for low-resource language using cross-lingual transfer learning and data augmentation. ![Search on Bibsonomy](Pics/bibsonomy.png) |
APSIPA ASC ![In: Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021, Tokyo, Japan, December 14-17, 2021, pp. 849-853, 2021, IEEE, 978-988-14768-9-0. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP BibTeX RDF |
|
15 | Amrith Setlur, Aman Madaan, Tanmay Parekh, Yiming Yang, Alan W. Black |
Towards Using Heterogeneous Relation Graphs for End-to-End TTS. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ASRU ![In: IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2021, Cartagena, Colombia, December 13-17, 2021, pp. 1162-1169, 2021, IEEE, 978-1-6654-3739-4. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Sei Ueno, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara |
Data Augmentation for ASR Using TTS Via a Discrete Representation. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ASRU ![In: IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2021, Cartagena, Colombia, December 13-17, 2021, pp. 68-75, 2021, IEEE, 978-1-6654-3739-4. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Wen-Chin Huang, Tomoki Hayashi, Xinjian Li, Shinji Watanabe 0001, Tomoki Toda |
On Prosody Modeling for ASR+TTS Based Voice Conversion. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ASRU ![In: IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2021, Cartagena, Colombia, December 13-17, 2021, pp. 642-649, 2021, IEEE, 978-1-6654-3739-4. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Pascal Puchtler, Johannes Wirth 0001, René Peinl |
HUI-Audio-Corpus-German: A High Quality TTS Dataset. ![Search on Bibsonomy](Pics/bibsonomy.png) |
KI ![In: KI 2021: Advances in Artificial Intelligence - 44th German Conference on AI, Virtual Event, September 27 - October 1, 2021, Proceedings, pp. 204-216, 2021, Springer, 978-3-030-87625-8. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Song Li, Beibei Ouyang, Lin Li 0032, Qingyang Hong |
Light-TTS: Lightweight Multi-Speaker Multi-Lingual Text-to-Speech. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICASSP ![In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2021, Toronto, ON, Canada, June 6-11, 2021, pp. 8383-8387, 2021, IEEE, 978-1-7281-7606-2. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Murali Karthick Baskar, Lukás Burget, Shinji Watanabe 0001, Ramón Fernandez Astudillo, Jan Honza Cernocký |
Eat: Enhanced ASR-TTS for Self-Supervised Speech Recognition. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICASSP ![In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2021, Toronto, ON, Canada, June 6-11, 2021, pp. 6753-6757, 2021, IEEE, 978-1-7281-7606-2. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Changhe Song, Jingbei Li, Yixuan Zhou 0002, Zhiyong Wu 0001, Helen M. Meng |
Syntactic Representation Learning For Neural Network Based TTS with Syntactic Parse Tree Traversal. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICASSP ![In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2021, Toronto, ON, Canada, June 6-11, 2021, pp. 6064-6068, 2021, IEEE, 978-1-7281-7606-2. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Wei Wang, Zhikai Zhou, Yizhou Lu, Hongji Wang, Chenpeng Du, Yanmin Qian |
Towards Data Selection on TTS Data for Children's Speech Recognition. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICASSP ![In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2021, Toronto, ON, Canada, June 6-11, 2021, pp. 6888-6892, 2021, IEEE, 978-1-7281-7606-2. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Feng-Long Xie, Xinhui Li, Wen-Chao Su, Li Lu, Frank K. Soong |
A New High Quality Trajectory Tiling Based Hybrid TTS In Real Time. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICASSP ![In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2021, Toronto, ON, Canada, June 6-11, 2021, pp. 5704-5708, 2021, IEEE, 978-1-7281-7606-2. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Keisuke Matsubara, Takuma Okamoto, Ryoichi Takashima, Tetsuya Takiguchi, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai |
High-Intelligibility Speech Synthesis for Dysarthric Speakers with LPCNet-Based TTS and CycleVAE-Based VC. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICASSP ![In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2021, Toronto, ON, Canada, June 6-11, 2021, pp. 7058-7062, 2021, IEEE, 978-1-7281-7606-2. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Isaac Elias, Heiga Zen, Jonathan Shen, Yu Zhang 0033, Ye Jia, Ron J. Weiss, Yonghui Wu |
Parallel Tacotron: Non-Autoregressive and Controllable TTS. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICASSP ![In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2021, Toronto, ON, Canada, June 6-11, 2021, pp. 5709-5713, 2021, IEEE, 978-1-7281-7606-2. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Liping Chen, Yan Deng, Xi Wang 0016, Frank K. Soong, Lei He 0005 |
Speech Bert Embedding for Improving Prosody in Neural TTS. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICASSP ![In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2021, Toronto, ON, Canada, June 6-11, 2021, pp. 6563-6567, 2021, IEEE, 978-1-7281-7606-2. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Detai Xin, Tatsuya Komatsu, Shinnosuke Takamichi, Hiroshi Saruwatari |
Disentangled Speaker and Language Representations Using Mutual Information Minimization and Domain Adaptation for Cross-Lingual TTS. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICASSP ![In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2021, Toronto, ON, Canada, June 6-11, 2021, pp. 6608-6612, 2021, IEEE, 978-1-7281-7606-2. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Shengkui Zhao, Hao Wang, Trung Hieu Nguyen 0001, Bin Ma 0001 |
Towards Natural and Controllable Cross-Lingual Voice Conversion Based on Neural TTS Model and Phonetic Posteriorgram. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICASSP ![In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2021, Toronto, ON, Canada, June 6-11, 2021, pp. 5969-5973, 2021, IEEE, 978-1-7281-7606-2. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | K. R. Prajwal, C. V. Jawahar |
Data-Efficient Training Strategies for Neural TTS Systems. ![Search on Bibsonomy](Pics/bibsonomy.png) |
COMAD/CODS ![In: CODS-COMAD 2021: 8th ACM IKDD CODS and 26th COMAD, Virtual Event, Bangalore, India, January 2-4, 2021, pp. 223-227, 2021, ACM, 978-1-4503-8817-7. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Haohan Guo, Shaofei Zhang, Frank K. Soong, Lei He 0005, Lei Xie 0001 |
Conversational End-to-End TTS for Voice Agents. ![Search on Bibsonomy](Pics/bibsonomy.png) |
SLT ![In: IEEE Spoken Language Technology Workshop, SLT 2021, Shenzhen, China, January 19-22, 2021, pp. 403-409, 2021, IEEE, 978-1-7281-7066-4. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Yiling Huang, Yutian Chen, Jason Pelecanos, Quan Wang |
Synth2Aug: Cross-Domain Speaker Recognition with TTS Synthesized Speech. ![Search on Bibsonomy](Pics/bibsonomy.png) |
SLT ![In: IEEE Spoken Language Technology Workshop, SLT 2021, Shenzhen, China, January 19-22, 2021, pp. 316-322, 2021, IEEE, 978-1-7281-7066-4. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Adriana Stan, Beáta Lorincz, Maria Nutu, Mircea Giurgiu |
The MARA corpus: Expressivity in end-to-end TTS systems using synthesised speech data. ![Search on Bibsonomy](Pics/bibsonomy.png) |
SpeD ![In: International Conference on Speech Technology and Human-Computer Dialogue, SpeD 2021, Bucharest, Romania, October 13-15, 2021, pp. 85-90, 2021, IEEE, 978-1-6654-2786-9. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Ji-Hoon Kim, Sang-Hoon Lee, Ji-Hyun Lee, Honggyu Jung, Seong-Whan Lee |
GC-TTS: Few-shot Speaker Adaptation with Geometric Constraints. ![Search on Bibsonomy](Pics/bibsonomy.png) |
SMC ![In: 2021 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2021, Melbourne, Australia, October 17-20, 2021, pp. 1172-1177, 2021, IEEE, 978-1-6654-4207-7. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Yunxin Zhao, Minguang Song, Yanghao Yue, Mili Kuruvilla-Dugdale |
Personalizing TTS Voices for Progressive Dysarthria. ![Search on Bibsonomy](Pics/bibsonomy.png) |
BHI ![In: IEEE EMBS International Conference on Biomedical and Health Informatics, BHI 2021, Athens, Greece, July 27-30, 2021, pp. 1-4, 2021, IEEE, 978-1-6654-0358-0. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Siddique Latif, Inyoung Kim, Ioan Calapodescu, Laurent Besacier |
Controlling Prosody in End-to-End TTS: A Case Study on Contrastive Focus Generation. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoNLL ![In: Proceedings of the 25th Conference on Computational Natural Language Learning, CoNLL 2021, Online, November 10-11, 2021, pp. 544-551, 2021, Association for Computational Linguistics. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Prachi Govalkar, Ahmed Mustafa, Nicola Pia, Judith Bauer, Metehan Yurt, Yigitcan Özer, Christian Dittmar |
A Lightweight Neural TTS System for High-quality German Speech Synthesis. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ITG Conference on Speech Communication ![In: 14th ITG Conference on Speech Communication, online, September 29 - October 1, 2021, pp. 1-5, 2021, VDE, 978-3-8007-5627-8. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP BibTeX RDF |
|
15 | Ali Raheem Mandeel, Mohammed Salah Al-Radhi, Tamás Gábor Csapó |
Speaker Adaptation with Continuous Vocoder-Based DNN-TTS. ![Search on Bibsonomy](Pics/bibsonomy.png) |
SPECOM ![In: Speech and Computer - 23rd International Conference, SPECOM 2021, St. Petersburg, Russia, September 27-30, 2021, Proceedings, pp. 407-416, 2021, Springer, 978-3-030-87801-6. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Atli Sigurgeirsson, Thorsteinn Gunnarsson, Gunnar Örnólfsson, Eydís Magnúsdóttir, Ragnheiðhur Thórhallsdóttir, Stefán Jónsson, Jón Guðnason |
Talrómur: A large Icelandic TTS corpus. ![Search on Bibsonomy](Pics/bibsonomy.png) |
NoDaLiDa ![In: Proceedings of the 23rd Nordic Conference on Computational Linguistics, NoDaLiDa 2021, Reykjavik, Iceland (Online), May 31 - June 2, 2021, pp. 440-444, 2021, Linköping University Electronic Press, Sweden, 978-91-7929-614-8. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP BibTeX RDF |
|
15 | Kishore Kumar Botsa, Lithin Reddy Marla, Suryakanth V. Gangashetty |
A Generative Adversarial Network based Training Framework for Robust TTS in Noisy Environment. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IC3 ![In: IC3 2021: Thirteenth International Conference on Contemporary Computing, Noida, India, August 5 - 7, 2021, pp. 273-277, 2021, ACM, 978-1-4503-8920-4. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Zdenek Hanzlícek, Jakub Vít, Markéta Rezácková |
Speakers Talking Foreign Languages in a Multi-lingual TTS System. ![Search on Bibsonomy](Pics/bibsonomy.png) |
TDS ![In: Text, Speech, and Dialogue - 24th International Conference, TSD 2021, Olomouc, Czech Republic, September 6-9, 2021, Proceedings, pp. 489-498, 2021, Springer, 978-3-030-83526-2. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Daniel Tihelka, Jindrich Matousek, Alice Tihelková |
How Much End-to-End is Tacotron 2 End-to-End TTS System. ![Search on Bibsonomy](Pics/bibsonomy.png) |
TDS ![In: Text, Speech, and Dialogue - 24th International Conference, TSD 2021, Olomouc, Czech Republic, September 6-9, 2021, Proceedings, pp. 511-522, 2021, Springer, 978-3-030-83526-2. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Vadim Popov, Ivan Vovk, Vladimir Gogoryan, Tasnima Sadekova, Mikhail A. Kudinov |
Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICML ![In: Proceedings of the 38th International Conference on Machine Learning, ICML 2021, 18-24 July 2021, Virtual Event., pp. 8599-8608, 2021, PMLR. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP BibTeX RDF |
|
15 | Dawei Liu, Longbiao Wang, Sheng Li 0010, Haoyu Li, Chenchen Ding, Ju Zhang, Jianwu Dang 0001 |
Exploring Effective Speech Representation via ASR for High-Quality End-to-End Multispeaker TTS. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICONIP (6) ![In: Neural Information Processing - 28th International Conference, ICONIP 2021, Sanur, Bali, Indonesia, December 8-12, 2021, Proceedings, Part VI, pp. 110-118, 2021, Springer, 978-3-030-92309-9. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Kurniawati Azizah, Mirna Adriani, Wisnu Jatmiko |
Hierarchical Transfer Learning for Multilingual, Multi-Speaker, and Style Transfer DNN-Based TTS on Low-Resource Languages. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Access ![In: IEEE Access 8, pp. 179798-179812, 2020. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Rui Liu 0008, Berrak Sisman, Feilong Bao, Guanglai Gao, Haizhou Li 0001 |
Modeling Prosodic Phrasing With Multi-Task Learning in Tacotron-Based TTS. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Signal Process. Lett. ![In: IEEE Signal Process. Lett. 27, pp. 1470-1474, 2020. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Yang Zhang, Liqun Deng, Yasheng Wang |
Unified Mandarin TTS Front-end Based on Distilled BERT Model. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2012.15404, 2020. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP BibTeX RDF |
|
15 | Jonathan Shen, Ye Jia, Mike Chrzanowski, Yu Zhang 0033, Isaac Elias, Heiga Zen, Yonghui Wu |
Non-Attentive Tacotron: Robust and Controllable Neural TTS Synthesis Including Unsupervised Duration Modeling. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2010.04301, 2020. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP BibTeX RDF |
|
15 | Henry B. Moss, Vatsal Aggarwal, Nishant Prateek, Javier González 0002, Roberto Barra-Chicote |
BOFFIN TTS: Few-Shot Speaker Adaptation by Bayesian Optimization. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2002.01953, 2020. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP BibTeX RDF |
|
15 | Isaac Elias, Heiga Zen, Jonathan Shen, Yu Zhang 0033, Ye Jia, Ron J. Weiss, Yonghui Wu |
Parallel Tacotron: Non-Autoregressive and Controllable TTS. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2010.11439, 2020. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP BibTeX RDF |
|
15 | Rui Liu 0008, Berrak Sisman, Feilong Bao, Guanglai Gao, Haizhou Li 0001 |
Modeling Prosodic Phrasing with Multi-Task Learning in Tacotron-based TTS. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2008.05284, 2020. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP BibTeX RDF |
|
15 | Rui Liu 0008, Berrak Sisman, Feilong Bao, Guanglai Gao, Haizhou Li 0001 |
WaveTTS: Tacotron-based TTS with Joint Time-Frequency Domain Loss. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2002.00417, 2020. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP BibTeX RDF |
|
15 | Rui Liu 0008, Berrak Sisman, Guanglai Gao, Haizhou Li 0001 |
Expressive TTS Training with Frame and Style Reconstruction Loss. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2008.01490, 2020. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP BibTeX RDF |
|
15 | Naihan Li, Shujie Liu 0001, Yanqing Liu, Sheng Zhao, Ming Liu, Ming Zhou 0001 |
MoBoAligner: a Neural Alignment Model for Non-autoregressive TTS with Monotonic Boundary Search. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2005.08528, 2020. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP BibTeX RDF |
|
15 | Qiao Tian, Zewang Zhang, Chao Liu 0030, Heng Lu, Linghui Chen, Bin Wei, Pujiang He, Shan Liu 0001 |
FeatherTTS: Robust and Efficient attention based Neural TTS. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2011.00935, 2020. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP BibTeX RDF |
|
15 | Yiling Huang, Yutian Chen, Jason Pelecanos, Quan Wang |
Synth2Aug: Cross-domain speaker recognition with TTS synthesized speech. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2011.11818, 2020. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP BibTeX RDF |
|
15 | Yao Shi, Hui Bu, Xin Xu, Shaoji Zhang, Ming Li 0026 |
AISHELL-3: A Multi-speaker Mandarin TTS Corpus and the Baselines. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2010.11567, 2020. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP BibTeX RDF |
|
15 | Wen-Chin Huang, Tomoki Hayashi, Shinji Watanabe 0001, Tomoki Toda |
The Sequence-to-Sequence Baseline for the Voice Conversion Challenge 2020: Cascading ASR and TTS. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2010.02434, 2020. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP BibTeX RDF |
|
15 | Dongyang Dai, Li Chen, Yuping Wang, Mu Wang, Rui Xia, Xuchen Song, Zhiyong Wu 0001, Yuxuan Wang 0002 |
Noise Robust TTS for Low Resource Speakers using Pre-trained Model and Speech Enhancement. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2005.12531, 2020. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP BibTeX RDF |
|
15 | Haohan Guo, Shaofei Zhang, Frank K. Soong, Lei He 0005, Lei Xie 0001 |
Conversational End-to-End TTS for Voice Agent. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2005.10438, 2020. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP BibTeX RDF |
|
15 | Johanes Effendi, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura 0001 |
Augmenting Images for ASR and TTS through Single-loop and Dual-loop Multimodal Chain Framework. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2011.02099, 2020. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP BibTeX RDF |
|
15 | Jaehyeon Kim, Sungwon Kim 0001, Jungil Kong, Sungroh Yoon |
Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment Search. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2005.11129, 2020. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP BibTeX RDF |
|
15 | Arun Baby, Saranya Vinnaitherthan, Nagaraj Adiga, Pranav Jawale, Sumukh Badam, Sharath Adavanne, Srikanth Konjeti |
An ASR Guided Speech Intelligibility Measure for TTS Model Selection. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2006.01463, 2020. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP BibTeX RDF |
|
15 | Katsuhito Sudoh, Takatomo Kano, Sashi Novitasari, Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura 0001 |
Simultaneous Speech-to-Speech Translation System with Neural Incremental ASR, MT, and TTS. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2011.04845, 2020. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP BibTeX RDF |
|
15 | Alistair Conkie, Andrew M. Finch |
Scalable Multilingual Frontend for TTS. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2004.04934, 2020. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP BibTeX RDF |
|
15 | Takashi Morita, Hiroki Koda |
Exploring TTS without T Using Biologically/Psychologically Motivated Neural Network Modules (ZeroSpeech 2020). ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2005.05487, 2020. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP BibTeX RDF |
|
15 | Qinghua Sun, Kenji Nagamatsu |
Building Multi lingual TTS using Cross Lingual Voice Conversion. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2012.14039, 2020. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP BibTeX RDF |
|
15 | Changhe Song, Jingbei Li, Yixuan Zhou 0002, Zhiyong Wu 0001, Helen M. Meng |
Syntactic representation learning for neural network based TTS with syntactic parse tree traversal. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2012.06971, 2020. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP BibTeX RDF |
|
15 | Yahuan Cong, Ran Zhang, Jian Luan 0001 |
PPSpeech: Phrase based Parallel End-to-End TTS System. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2008.02490, 2020. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP BibTeX RDF |
|
15 | Brooke Stephenson, Laurent Besacier, Laurent Girin, Thomas Hueber |
What the Future Brings: Investigating the Impact of Lookahead for Incremental Neural TTS. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2009.02035, 2020. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP BibTeX RDF |
|
15 | Yash Sharma 0004, Basil Abraham, Karan Taneja, Preethi Jyothi |
Improving Low Resource Code-switched ASR using Augmented Code-switched TTS. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2010.05549, 2020. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP BibTeX RDF |
|
15 | Fady K. Fahmy, Mahmoud I. Khalil, Hazem M. Abbas |
A Transfer Learning End-to-End ArabicText-To-Speech (TTS) Deep Architecture. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2007.11541, 2020. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP BibTeX RDF |
|
15 | Vadim Popov, Stanislav Kamenev, Mikhail A. Kudinov, Sergey Repyevsky, Tasnima Sadekova, Vitalii Bushaev, Vladimir Kryzhanovskiy, Denis Parkhomenko |
Fast and Lightweight On-Device TTS with Tacotron2 and LPCNet. ![Search on Bibsonomy](Pics/bibsonomy.png) |
INTERSPEECH ![In: Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25-29 October 2020., pp. 220-224, 2020, ISCA. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Adam Polyak, Lior Wolf, Yaniv Taigman |
TTS Skins: Speaker Conversion via ASR. ![Search on Bibsonomy](Pics/bibsonomy.png) |
INTERSPEECH ![In: Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25-29 October 2020., pp. 786-790, 2020, ISCA. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Ruolan Liu, Xue Wen, Chunhui Lu, Xiao Chen |
Tone Learning in Low-Resource Bilingual TTS. ![Search on Bibsonomy](Pics/bibsonomy.png) |
INTERSPEECH ![In: Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25-29 October 2020., pp. 2952-2956, 2020, ISCA. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Alex Peiró Lilja, Mireia Farrús |
Naturalness Enhancement with Linguistic Information in End-to-End TTS Using Unsupervised Parallel Encoding. ![Search on Bibsonomy](Pics/bibsonomy.png) |
INTERSPEECH ![In: Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25-29 October 2020., pp. 3994-3998, 2020, ISCA. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Matt Whitehill, Shuang Ma, Daniel McDuff, Yale Song |
Multi-Reference Neural TTS Stylization with Adversarial Cycle Consistency. ![Search on Bibsonomy](Pics/bibsonomy.png) |
INTERSPEECH ![In: Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25-29 October 2020., pp. 4442-4446, 2020, ISCA. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Manish Sharma, Tom Kenter, Rob Clark |
StrawNet: Self-Training WaveNet for TTS in Low-Data Regimes. ![Search on Bibsonomy](Pics/bibsonomy.png) |
INTERSPEECH ![In: Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25-29 October 2020., pp. 3550-3554, 2020, ISCA. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Brooke Stephenson, Laurent Besacier, Laurent Girin, Thomas Hueber |
What the Future Brings: Investigating the Impact of Lookahead for Incremental Neural TTS. ![Search on Bibsonomy](Pics/bibsonomy.png) |
INTERSPEECH ![In: Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25-29 October 2020., pp. 215-219, 2020, ISCA. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Johanes Effendi, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura 0001 |
Augmenting Images for ASR and TTS Through Single-Loop and Dual-Loop Multimodal Chain Framework. ![Search on Bibsonomy](Pics/bibsonomy.png) |
INTERSPEECH ![In: Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25-29 October 2020., pp. 4901-4905, 2020, ISCA. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Yash Sharma 0004, Basil Abraham, Karan Taneja, Preethi Jyothi |
Improving Low Resource Code-Switched ASR Using Augmented Code-Switched TTS. ![Search on Bibsonomy](Pics/bibsonomy.png) |
INTERSPEECH ![In: Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25-29 October 2020., pp. 4771-4775, 2020, ISCA. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Michelle Cohn, Georgia Zellou |
Perception of Concatenative vs. Neural Text-To-Speech (TTS): Differences in Intelligibility in Noise and Language Attitudes. ![Search on Bibsonomy](Pics/bibsonomy.png) |
INTERSPEECH ![In: Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25-29 October 2020., pp. 1733-1737, 2020, ISCA. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Hyeong Rae Ihm, Joun Yeop Lee, Byoung Jin Choi, Sung Jun Cheon, Nam Soo Kim |
Reformer-TTS: Neural Speech Synthesis with Reformer Network. ![Search on Bibsonomy](Pics/bibsonomy.png) |
INTERSPEECH ![In: Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25-29 October 2020., pp. 2012-2016, 2020, ISCA. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Naihan Li, Shujie Liu 0001, Yanqing Liu, Sheng Zhao, Ming Liu, Ming Zhou 0001 |
MoBoAligner: A Neural Alignment Model for Non-Autoregressive TTS with Monotonic Boundary Search. ![Search on Bibsonomy](Pics/bibsonomy.png) |
INTERSPEECH ![In: Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25-29 October 2020., pp. 3999-4003, 2020, ISCA. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Xiangyu Liang, Zhiyong Wu 0001, Runnan Li, Yanqing Liu, Sheng Zhao, Helen Meng |
Enhancing Monotonicity for Robust Autoregressive Transformer TTS. ![Search on Bibsonomy](Pics/bibsonomy.png) |
INTERSPEECH ![In: Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25-29 October 2020., pp. 3181-3185, 2020, ISCA. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Erica Cooper, Cheng-I Lai, Yusuke Yasuda, Junichi Yamagishi |
Can Speaker Augmentation Improve Multi-Speaker End-to-End TTS? ![Search on Bibsonomy](Pics/bibsonomy.png) |
INTERSPEECH ![In: Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25-29 October 2020., pp. 3979-3983, 2020, ISCA. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Tao Wang 0074, Xuefei Liu, Jianhua Tao 0001, Jiangyan Yi, Ruibo Fu, Zhengqi Wen |
Non-Autoregressive End-to-End TTS with Coarse-to-Fine Decoding. ![Search on Bibsonomy](Pics/bibsonomy.png) |
INTERSPEECH ![In: Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25-29 October 2020., pp. 3984-3988, 2020, ISCA. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Alexander Sorin, Slava Shechtman, Ron Hoory |
Principal Style Components: Expressive Style Control and Cross-Speaker Transfer in Neural TTS. ![Search on Bibsonomy](Pics/bibsonomy.png) |
INTERSPEECH ![In: Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25-29 October 2020., pp. 3411-3415, 2020, ISCA. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Takashi Morita, Hiroki Koda |
Exploring TTS Without T Using Biologically/Psychologically Motivated Neural Network Modules (ZeroSpeech 2020). ![Search on Bibsonomy](Pics/bibsonomy.png) |
INTERSPEECH ![In: Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25-29 October 2020., pp. 4856-4860, 2020, ISCA. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP DOI BibTeX RDF |
|
15 | Mayuko Okamato, Sakriani Sakti, Satoshi Nakamura 0001 |
Towards Speech Entrainment: Considering ASR Information in Speaking Rate Variation of TTS Waveform Generation. ![Search on Bibsonomy](Pics/bibsonomy.png) |
O-COCOSDA ![In: 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2020, Yangon, Myanmar, November 5-7, 2020, pp. 139-144, 2020, IEEE, 978-1-7281-9896-5. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP DOI BibTeX RDF |
|
Displaying result #401 - #500 of 1108 (100 per page; Change: ) Pages: [ <<][ 1][ 2][ 3][ 4][ 5][ 6][ 7][ 8][ 9][ 10][ 11][ 12][ >>] |
|