Search results for "tts"

Hits ?▲	Authors	Title	Venue	Year	Link
15	Nathaniel Romney Robinson, Perez Ogayo, Swetha R. Gangu, David R. Mortensen, Shinji Watanabe 0001	When Is TTS Augmentation Through a Pivot Language Useful?	INTERSPEECH	2022	DBLP DOI BibTeX RDF
15	Iona Gessinger, Michelle Cohn, Georgia Zellou, Bernd Möbius	Cross-Cultural Comparison of Gradient Emotion Perception: Human vs. Alexa TTS Voices.	INTERSPEECH	2022	DBLP DOI BibTeX RDF
15	Haohan Guo, Hui Lu, Xixin Wu, Helen Meng	A Multi-Scale Time-Frequency Spectrogram Discriminator for GAN-based Non-Autoregressive TTS.	INTERSPEECH	2022	DBLP DOI BibTeX RDF
15	Ariadna Sánchez, Alessio Falai, Ziyao Zhang, Orazio Angelini, Kayoko Yanagisawa	Unify and Conquer: How Phonetic Feature Representation Affects Polyglot Text-To-Speech (TTS).	INTERSPEECH	2022	DBLP DOI BibTeX RDF
15	Perry Lam, Huayun Zhang, Nancy F. Chen, Berrak Sisman	EPIC TTS Models: Empirical Pruning Investigations Characterizing Text-To-Speech Models.	INTERSPEECH	2022	DBLP DOI BibTeX RDF
15	Yookyung Shin, Younggun Lee, Suhee Jo, Yeongtae Hwang, Taesu Kim	Text-driven Emotional Style Control and Cross-speaker Style Transfer in Neural TTS.	INTERSPEECH	2022	DBLP DOI BibTeX RDF
15	Ivan Vovk, Tasnima Sadekova, Vladimir Gogoryan, Vadim Popov, Mikhail A. Kudinov, Jiansheng Wei	Fast Grad-TTS: Towards Efficient Diffusion-Based Speech Generation on CPU.	INTERSPEECH	2022	DBLP DOI BibTeX RDF
15	Cassia Valentini-Botinhao, Manuel Sam Ribeiro, Oliver Watts, Korin Richmond, Gustav Eje Henter	Predicting pairwise preferences between TTS audio stimuli using parallel ratings data and anti-symmetric twin neural networks.	INTERSPEECH	2022	DBLP DOI BibTeX RDF
15	Brooke Stephenson, Laurent Besacier, Laurent Girin, Thomas Hueber	BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model.	INTERSPEECH	2022	DBLP DOI BibTeX RDF
15	Syed Ammar Abbas, Thomas Merritt, Alexis Moinet, Sri Karlapati, Ewa Muszynska, Simon Slangen, Elia Gatti, Thomas Drugman	Expressive, Variable, and Controllable Duration Modelling in TTS.	INTERSPEECH	2022	DBLP DOI BibTeX RDF
15	Tuomo Raitio, Petko Petkov, Jiangchuan Li, P. V. Muhammed Shifas, Andrea Davis, Yannis Stylianou	Vocal effort modeling in neural TTS for improving the intelligibility of synthetic speech in noise.	INTERSPEECH	2022	DBLP DOI BibTeX RDF
15	Kenta Udagawa, Yuki Saito, Hiroshi Saruwatari	Human-in-the-loop Speaker Adaptation for DNN-based Multi-speaker TTS.	INTERSPEECH	2022	DBLP DOI BibTeX RDF
15	Naoki Makishima, Satoshi Suzuki, Atsushi Ando, Ryo Masumura	Speaker consistency loss and step-wise optimization for semi-supervised joint training of TTS and ASR using unpaired text data.	INTERSPEECH	2022	DBLP DOI BibTeX RDF
15	Yooncheol Ju, Ilhwan Kim, Hongsun Yang, Ji-Hoon Kim, Byeongyeol Kim, Soumi Maiti, Shinji Watanabe 0001	TriniTTS: Pitch-controllable End-to-end TTS without External Aligner.	INTERSPEECH	2022	DBLP DOI BibTeX RDF
15	Alon Levkovitch, Eliya Nachmani, Lior Wolf	Zero-Shot Voice Conditioning for Denoising Diffusion TTS Models.	INTERSPEECH	2022	DBLP DOI BibTeX RDF
15	Wei-Ping Huang, Po-Chun Chen, Sung-Feng Huang, Hung-yi Lee	Few Shot Cross-Lingual TTS Using Transferable Phoneme Embedding.	INTERSPEECH	2022	DBLP DOI BibTeX RDF
15	Sri Karlapati, Penny Karanasou, Mateusz Lajszczak, Syed Ammar Abbas, Alexis Moinet, Peter Makarov, Ray Li, Arent van Korlaar, Simon Slangen, Thomas Drugman	CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer.	INTERSPEECH	2022	DBLP DOI BibTeX RDF
15	Peter Makarov, Syed Ammar Abbas, Mateusz Lajszczak, Arnaud Joly, Sri Karlapati, Alexis Moinet, Thomas Drugman, Penny Karanasou	Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody.	INTERSPEECH	2022	DBLP DOI BibTeX RDF
15	Jaeuk Lee, Joon-Hyuk Chang	Advanced Speaker Embedding with Predictive Variance of Gaussian Distribution for Speaker Adaptation in TTS.	INTERSPEECH	2022	DBLP DOI BibTeX RDF
15	Ajinkya Kulkarni, Vincent Colotte, Denis Jouvet	Analysis of expressivity transfer in non-autoregressive end-to-end multispeaker TTS systems.	INTERSPEECH	2022	DBLP DOI BibTeX RDF
15	Haohan Guo, Feng-Long Xie, Frank K. Soong, Xixin Wu, Helen Meng	A Multi-Stage Multi-Codebook VQ-VAE Approach to High-Performance Neural TTS.	INTERSPEECH	2022	DBLP DOI BibTeX RDF
15	Jaeuk Lee, Joon-Hyuk Chang	One-Shot Speaker Adaptation Based on Initialization by Generative Adversarial Networks for TTS.	INTERSPEECH	2022	DBLP DOI BibTeX RDF
15	Martin Lenglet, Olivier Perrotin, Gérard Bailly	Speaking Rate Control of end-to-end TTS Models by Direct Manipulation of the Encoder's Output Embeddings.	INTERSPEECH	2022	DBLP DOI BibTeX RDF
15	Dengfeng Ke, Liangjie Huang, Wenhan Yao, Ruixin Hu, Xueyin Zu, Yanlu Xie, Jinsong Zhang 0001	Voicifier-LN: An Novel Approach to Elevate the Speaker Similarity for General Zero-shot Multi-Speaker TTS.	AIPR	2022	DBLP DOI BibTeX RDF
15	Yujia Xiao, Xi Wang 0016, Lei He 0005, Frank K. Soong	Improving Fastspeech TTS with Efficient Self-Attention and Compact Feed-Forward Network.	ICASSP	2022	DBLP DOI BibTeX RDF
15	Johanes Effendi, Yogesh Virkar, Roberto Barra-Chicote, Marcello Federico	Duration Modeling of Neural TTS for Automatic Dubbing.	ICASSP	2022	DBLP DOI BibTeX RDF
15	Mohammad Soleymanpour, Michael T. Johnson, Rahim Soleymanpour, Jeffrey Berry 0001	Synthesizing Dysarthric Speech Using Multi-Speaker Tts For Dysarthric Speech Recognition.	ICASSP	2022	DBLP DOI BibTeX RDF
15	Rohan Badlani, Adrian Lancucki, Kevin J. Shih, Rafael Valle, Wei Ping, Bryan Catanzaro	One TTS Alignment to Rule Them All.	ICASSP	2022	DBLP DOI BibTeX RDF
15	Rui Li, Dong Pu, Minnie Huang, Bill Huang	UNET-TTS: Improving Unseen Speaker and Style Transfer in One-Shot Voice Cloning.	ICASSP	2022	DBLP DOI BibTeX RDF
15	Shivam Mehta, Éva Székely, Jonas Beskow, Gustav Eje Henter	Neural HMMS Are All You Need (For High-Quality Attention-Free TTS).	ICASSP	2022	DBLP DOI BibTeX RDF
15	Chae-Bin Im, Sang-Hoon Lee, Seung-Bin Kim, Seong-Whan Lee	EMOQ-TTS: Emotion Intensity Quantization for Fine-Grained Controllable Emotional Text-to-Speech.	ICASSP	2022	DBLP DOI BibTeX RDF
15	Tuomo Raitio, Jiangchuan Li, Shreyas Seshadri	Hierarchical Prosody Modeling and Control in Non-Autoregressive Parallel Neural TTS.	ICASSP	2022	DBLP DOI BibTeX RDF
15	Junchen Lu, Berrak Sisman, Rui Liu 0008, Mingyang Zhang 0003, Haizhou Li 0001	Visualtts: TTS with Accurate Lip-Speech Synchronization for Automatic Voice Over.	ICASSP	2022	DBLP DOI BibTeX RDF
15	Rem Hida, Masaki Hamada, Chie Kamada, Emiru Tsunoo, Toshiyuki Sekiya, Toshiyuki Kumakura	Polyphone Disambiguation and Accent Prediction Using Pre-Trained Language Models in Japanese TTS Front-End.	ICASSP	2022	DBLP DOI BibTeX RDF
15	Ji-Hyun Lee, Sang-Hoon Lee, Ji-Hoon Kim, Seong-Whan Lee	PVAE-TTS: Adaptive Text-to-Speech via Progressive Style Adaptation.	ICASSP	2022	DBLP DOI BibTeX RDF
15	Oktai Tatanov, Stanislav Beliaev, Boris Ginsburg	Mixer-TTS: Non-Autoregressive, Fast and Compact Text-to-Speech Model Conditioned on Language Model Embeddings.	ICASSP	2022	DBLP DOI BibTeX RDF
15	Zidong Chen, Xiongkuo Min	Perceptual Quality Assessment of TTS-Synthesized Speech.	IFTC	2022	DBLP DOI BibTeX RDF
15	Xulong Zhang 0001, Jianzong Wang, Ning Cheng 0001, Jing Xiao 0006	Semi-Supervised Learning Based on Reference Model for Low-resource TTS.	MSN	2022	DBLP DOI BibTeX RDF
15	Leonardo B. de M. M. Marques, Lucas H. Ueda, Flávio Olmos Simões, Mario Uliani Neto, Fernando O. Runstein, Edson Jose Nagle, Bianca Dal Bó, Paula D. P. Costa	Diffusion-Based Approach to Style Modeling in Expressive TTS.	BRACIS (1)	2022	DBLP DOI BibTeX RDF
15	Xiaomin Li, Vangelis Metsis, Huangyingrui Wang, Anne Hee Hiong Ngu	TTS-GAN: A Transformer-Based Time-Series Generative Adversarial Network.	AIME	2022	DBLP DOI BibTeX RDF
15	Sewade Ogun, Vincent Colotte, Emmanuel Vincent 0001	Can We Use Common Voice to Train a Multi-Speaker TTS System?	SLT	2022	DBLP DOI BibTeX RDF
15	Yinghao Aaron Li, Cong Han, Nima Mesgarani	Styletts-VC: One-Shot Voice Conversion by Knowledge Transfer From Style-Based TTS Models.	SLT	2022	DBLP DOI BibTeX RDF
15	Rendi Chevi, Radityo Eko Prasojo, Alham Fikri Aji, Andros Tjandra, Sakriani Sakti	NIX-TTS: Lightweight and End-to-End Text-to-Speech Via Module-Wise Distillation.	SLT	2022	DBLP DOI BibTeX RDF
15	Stefan Taubert, Jasmin Sternkopf, Stefan Kahl, Maximilian Eibl	A Comparison of Text Selection Algorithms for Sequence-to-Sequence Neural TTS.	ICSPCC	2022	DBLP DOI BibTeX RDF
15	Hyoung-Kyu Song, Sang Hoon Woo, Junhyeok Lee, Seungmin Yang, Hyunjae Cho, Youseong Lee, Dongho Choi, Kang-wook Kim	Talking Face Generation with Multilingual TTS.	CVPR	2022	DBLP DOI BibTeX RDF
15	Haoxu Wang, Yan Jia, Zeqing Zhao, Xuyang Wang, Junjie Wang, Ming Li 0026	Generating TTS Based Adversarial Samples for Training Wake-Up Word Detection Systems Against Confusing Words.	Odyssey	2022	DBLP DOI BibTeX RDF
15	Ziyue Jiang 0001, Su Zhe, Zhou Zhao, Qian Yang, Yi Ren 0006, Jinglin Liu	Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech.	NeurIPS	2022	DBLP BibTeX RDF
15	Xun Zhou, Zhiyang Zhou, Xiaodong Shi	FCH-TTS: Fast, Controllable and High-quality Non-Autoregressive Text-to-Speech Synthesis.	IJCNN	2022	DBLP DOI BibTeX RDF
15	Xulong Zhang 0001, Jianzong Wang, Ning Cheng 0001, Jing Xiao 0006	TDASS: Target Domain Adaptation Speech Synthesis Framework for Multi-speaker Low-Resource TTS.	IJCNN	2022	DBLP DOI BibTeX RDF
15	Maria-Loulou Hajj, Martin Lenglet, Olivier Perrotin, Gérard Bailly	Comparing NLP Solutions for the Disambiguation of French Heterophonic Homographs for End-to-End TTS Systems.	SPECOM	2022	DBLP DOI BibTeX RDF
15	Charbel Arnaud Cedrique Y. Boco, Théophile K. Dagba	An End to End Bilingual TTS System for Fongbe and Yoruba.	ICCCI (CCIS Volume)	2022	DBLP DOI BibTeX RDF
15	Yuan-Fu Liao, Wen-Han Hsu, Chen-Ming Pan, Wern-Jun Wang, Matús Pleva, Daniel Hládek	Personalized Taiwanese Speech Synthesis using Cascaded ASR and TTS Framework.	RADIOELEKTRONIKA	2022	DBLP DOI BibTeX RDF
15	Heeseung Kim, Sungwon Kim 0001, Sungroh Yoon	Guided-TTS: A Diffusion Model for Text-to-Speech via Classifier Guidance.	ICML	2022	DBLP BibTeX RDF
15	Edresson Casanova, Julian Weber, Christopher Dane Shulby, Arnaldo Cândido Júnior, Eren Gölge, Moacir A. Ponti	YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for Everyone.	ICML	2022	DBLP BibTeX RDF
15	Siyang Wang, Joakim Gustafson, Éva Székely	Evaluating Sampling-based Filler Insertion with Spontaneous TTS.	LREC	2022	DBLP BibTeX RDF
15	Saida Mussakhojayeva, Yerbolat Khassanov, Huseyin Atakan Varol	KazakhTTS2: Extending the Open-Source Kazakh TTS Corpus With More Data, Speakers, and Topics.	LREC	2022	DBLP BibTeX RDF
15	Elham Akhlaghi, Ingibjörg Iðha Auðhunardóttir, Anna Baczkowska, Branislav Bédi, Hakeem Beedar, Harald Berthelsen, Cathy Chua, Catia Cucchiarini, Hanieh Habibi, Ivana Horváthová, Junta Ikeda, Christèle Maizonniaux, Neasa Ní Chiaráin, Chadi Raheb, Manny Rayner, John Sloan, Nikos Tsourakis, Chunlin Yao	Using the LARA Little Prince to compare human and TTS audio quality.	LREC	2022	DBLP BibTeX RDF
15	Sung-Woong Hwang, Joon-Hyuk Chang	Document-Level Neural TTS Using Curriculum Learning and Attention Masking.	IEEE Access	2021	DBLP DOI BibTeX RDF
15	Noé Tits, Kevin El Haddad, Thierry Dutoit	Analysis and Assessment of Controllability of an Expressive Deep Learning-Based TTS System.	Informatics	2021	DBLP DOI BibTeX RDF
15	Guangyu Liu, Bao Zhou, Yi Huang, Longfei Wang, Wei Wang, Enming Zhao	Video image scaling technology based on adaptive interpolation algorithm and TTS FPGA implementation.	Comput. Stand. Interfaces	2021	DBLP DOI BibTeX RDF
15	Ye Jia, Heiga Zen, Jonathan Shen, Yu Zhang 0033, Yonghui Wu	PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS.	CoRR	2021	DBLP BibTeX RDF
15	Hui Lu, Zhiyong Wu 0001, Xixin Wu, Xu Li, Shiyin Kang, Xunying Liu, Helen Meng	VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.	CoRR	2021	DBLP BibTeX RDF
15	Murali Karthick Baskar, Lukás Burget, Shinji Watanabe 0001, Ramón Fernandez Astudillo, Jan Honza Cernocký	EAT: Enhanced ASR-TTS for Self-supervised Speech Recognition.	CoRR	2021	DBLP BibTeX RDF
15	Heeseung Kim, Sungwon Kim 0001, Sungroh Yoon	Guided-TTS: Text-to-Speech with Untranscribed Speech.	CoRR	2021	DBLP BibTeX RDF
15	Myeonghun Jeong, Hyeongju Kim, Sung Jun Cheon, Byoung Jin Choi, Nam Soo Kim	Diff-TTS: A Denoising Diffusion Model for Text-to-Speech.	CoRR	2021	DBLP BibTeX RDF
15	Peng Liu, Yuewen Cao, Songxiang Liu, Na Hu, Guangzhi Li, Chao Weng, Dan Su 0002	VARA-TTS: Non-Autoregressive Text-to-Speech Synthesis based on Very Deep VAE with Residual Attention.	CoRR	2021	DBLP BibTeX RDF
15	Xiaochun An, Frank K. Soong, Lei Xie 0001	Improving Performance of Seen and Unseen Speech Style Transfer in End-to-end Neural TTS.	CoRR	2021	DBLP BibTeX RDF
15	Pol van Rijn, Silvan Mertes, Dominik Schiller, Peter M. C. Harrison, Pauline Larrouy-Maestri, Elisabeth André, Nori Jacoby	Exploring emotional prototypes in a high dimensional TTS latent space.	CoRR	2021	DBLP BibTeX RDF
15	Hieu-Thi Luong, Junichi Yamagishi	Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance.	CoRR	2021	DBLP BibTeX RDF
15	Ali Raheem Mandeel, Mohammed Salah Al-Radhi, Tamás Gábor Csapó	Speaker Adaptation with Continuous Vocoder-based DNN-TTS.	CoRR	2021	DBLP BibTeX RDF
15	Vadim Popov, Ivan Vovk, Vladimir Gogoryan, Tasnima Sadekova, Mikhail A. Kudinov	Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech.	CoRR	2021	DBLP BibTeX RDF
15	Penny Karanasou, Sri Karlapati, Alexis Moinet, Arnaud Joly, Ammar Abbas, Simon Slangen, Jaime Lorenzo-Trueba, Thomas Drugman	A learned conditional prior for the VAE acoustic space of a TTS system.	CoRR	2021	DBLP BibTeX RDF
15	Edresson Casanova, Julian Weber, Christopher Shulby, Arnaldo Cândido Júnior, Eren Gölge, Moacir Antonelli Ponti	YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone.	CoRR	2021	DBLP BibTeX RDF
15	Isaac Elias, Heiga Zen, Jonathan Shen, Yu Zhang 0033, Jia Ye, R. J. Skerry-Ryan, Yonghui Wu	Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling.	CoRR	2021	DBLP BibTeX RDF
15	Paarth Neekhara, Jason Li, Boris Ginsburg	Adapting TTS models For New Speakers using Transfer Learning.	CoRR	2021	DBLP BibTeX RDF
15	Shivam Mehta, Éva Székely, Jonas Beskow, Gustav Eje Henter	Neural HMMs are all you need (for high-quality attention-free TTS).	CoRR	2021	DBLP BibTeX RDF
15	Brooke Stephenson, Thomas Hueber, Laurent Girin, Laurent Besacier	Alternate Endings: Improving Prosody for Incremental Neural TTS with Predicted Future Text Input.	CoRR	2021	DBLP BibTeX RDF
15	Shoule Wu, Ziqiang Shi	It$\hat{\text{o}}$TTS and It$\hat{\text{o}}$Wave: Linear Stochastic Differential Equation Is All You Need For Audio Generation.	CoRR	2021	DBLP BibTeX RDF
15	Iván Vallés-Pérez, Julian Roth, Grzegorz Beringer, Roberto Barra-Chicote, Jasha Droppo	Improving multi-speaker TTS prosody variance with a residual encoder and normalizing flows.	CoRR	2021	DBLP BibTeX RDF
15	Zongmin Liu	Double Fuzzy Probabilistic Interval Linguistic Term Set and a Dynamic Fuzzy Decision Making Model based on Markov Process with tts Application in Multiple Criteria Group Decision Making.	CoRR	2021	DBLP BibTeX RDF
15	Liping Chen, Yan Deng, Xi Wang 0016, Frank K. Soong, Lei He 0005	Speech BERT Embedding For Improving Prosody in Neural TTS.	CoRR	2021	DBLP BibTeX RDF
15	Pascal Puchtler, Johannes Wirth 0001, René Peinl	HUI-Audio-Corpus-German: A high quality TTS dataset.	CoRR	2021	DBLP BibTeX RDF
15	Ji-Hoon Kim, Sang-Hoon Lee, Ji-Hyun Lee, Honggyu Jung, Seong-Whan Lee	GC-TTS: Few-shot Speaker Adaptation with Geometric Constraints.	CoRR	2021	DBLP BibTeX RDF
15	Sung-Feng Huang, Chyi-Jiunn Lin, Hung-yi Lee	Meta-TTS: Meta-Learning for Few-Shot Speaker Adaptive Text-to-Speech.	CoRR	2021	DBLP BibTeX RDF
15	Raahil Shah, Kamil Pokora, Abdelhamid Ezzerg, Viacheslav Klimkov, Goeric Huybrechts, Bartosz Putrycz, Daniel Korzekwa, Thomas Merritt	Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech.	CoRR	2021	DBLP BibTeX RDF
15	Shilun Lin, Wen-Chao Su, Li Meng, Fenglong Xie, Xinhui Li, Li Lu	Nana-HDR: A Non-attentive Non-autoregressive Hybrid Model for TTS.	CoRR	2021	DBLP BibTeX RDF
15	Rohan Badlani, Adrian Lancucki, Kevin J. Shih, Rafael Valle, Wei Ping, Bryan Catanzaro	One TTS Alignment To Rule Them All.	CoRR	2021	DBLP BibTeX RDF
15	Wen-Chin Huang, Tomoki Hayashi, Xinjian Li, Shinji Watanabe 0001, Tomoki Toda	On Prosody Modeling for ASR+TTS based Voice Conversion.	CoRR	2021	DBLP BibTeX RDF
15	Tuomo Raitio, Jiangchuan Li, Shreyas Seshadri	Hierarchical prosody modeling and control in non-autoregressive parallel neural TTS.	CoRR	2021	DBLP BibTeX RDF
15	Shreyas Seshadri, Tuomo Raitio, Dan Castellani, Jiangchuan Li	Emphasis control for parallel neural TTS.	CoRR	2021	DBLP BibTeX RDF
15	Mutian He 0001, Jingzhou Yang, Lei He 0005, Frank K. Soong	Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge.	CoRR	2021	DBLP BibTeX RDF
15	Noé Tits, Kevin El Haddad, Thierry Dutoit	Analysis and Assessment of Controllability of an Expressive Deep Learning-based TTS system.	CoRR	2021	DBLP BibTeX RDF
15	Shengkui Zhao, Hao Wang, Trung Hieu Nguyen 0001, Bin Ma 0001	Towards Natural and Controllable Cross-Lingual Voice Conversion Based on Neural TTS Model and Phonetic Posteriorgram.	CoRR	2021	DBLP BibTeX RDF
15	Rui Li, Dong Pu, Minnie Huang, Bill Huang	Unet-TTS: Improving Unseen Speaker and Style Transfer in One-shot Voice Cloning.	CoRR	2021	DBLP BibTeX RDF
15	Junchen Lu, Berrak Sisman, Rui Liu 0008, Mingyang Zhang 0003, Haizhou Li 0001	VisualTTS: TTS with Accurate Lip-Speech Synchronization for Automatic Voice Over.	CoRR	2021	DBLP BibTeX RDF
15	Sahoko Nakayama, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura 0001	Code-Switching ASR and TTS Using Semisupervised Learning with Machine Speech Chain.	IEICE Trans. Inf. Syst.	2021	DBLP DOI BibTeX RDF
15	Kiyoshi Kurihara, Nobumasa Seiyama, Tadashi Kumano	Prosodic Features Control by Symbols as Input of Sequence-to-Sequence Acoustic Modeling for Neural TTS.	IEICE Trans. Inf. Syst.	2021	DBLP BibTeX RDF
15	Noé Tits, Kevin El Haddad, Thierry Dutoit	ICE-Talk 2: Interface for Controllable Expressive TTS with perceptual assessment tool.	Softw. Impacts	2021	DBLP DOI BibTeX RDF
15	Liumeng Xue, Shifeng Pan, Lei He 0005, Lei Xie 0001, Frank K. Soong	Cycle consistent network for end-to-end style transfer TTS training.	Neural Networks	2021	DBLP DOI BibTeX RDF
15	Xiaochun An, Frank K. Soong, Shan Yang, Lei Xie 0001	Effective and direct control of neural TTS prosody by removing interactions between different attributes.	Neural Networks	2021	DBLP DOI BibTeX RDF