|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
The graphs summarize 149 occurrences of 116 keywords
|
|
|
Results
Found 1108 publication records. Showing 1108 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
15 | Nathaniel Romney Robinson, Perez Ogayo, Swetha R. Gangu, David R. Mortensen, Shinji Watanabe 0001 |
When Is TTS Augmentation Through a Pivot Language Useful? |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Iona Gessinger, Michelle Cohn, Georgia Zellou, Bernd Möbius |
Cross-Cultural Comparison of Gradient Emotion Perception: Human vs. Alexa TTS Voices. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Haohan Guo, Hui Lu, Xixin Wu, Helen Meng |
A Multi-Scale Time-Frequency Spectrogram Discriminator for GAN-based Non-Autoregressive TTS. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Ariadna Sánchez, Alessio Falai, Ziyao Zhang, Orazio Angelini, Kayoko Yanagisawa |
Unify and Conquer: How Phonetic Feature Representation Affects Polyglot Text-To-Speech (TTS). |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Perry Lam, Huayun Zhang, Nancy F. Chen, Berrak Sisman |
EPIC TTS Models: Empirical Pruning Investigations Characterizing Text-To-Speech Models. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Yookyung Shin, Younggun Lee, Suhee Jo, Yeongtae Hwang, Taesu Kim |
Text-driven Emotional Style Control and Cross-speaker Style Transfer in Neural TTS. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Ivan Vovk, Tasnima Sadekova, Vladimir Gogoryan, Vadim Popov, Mikhail A. Kudinov, Jiansheng Wei |
Fast Grad-TTS: Towards Efficient Diffusion-Based Speech Generation on CPU. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Cassia Valentini-Botinhao, Manuel Sam Ribeiro, Oliver Watts, Korin Richmond, Gustav Eje Henter |
Predicting pairwise preferences between TTS audio stimuli using parallel ratings data and anti-symmetric twin neural networks. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Brooke Stephenson, Laurent Besacier, Laurent Girin, Thomas Hueber |
BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Syed Ammar Abbas, Thomas Merritt, Alexis Moinet, Sri Karlapati, Ewa Muszynska, Simon Slangen, Elia Gatti, Thomas Drugman |
Expressive, Variable, and Controllable Duration Modelling in TTS. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Tuomo Raitio, Petko Petkov, Jiangchuan Li, P. V. Muhammed Shifas, Andrea Davis, Yannis Stylianou |
Vocal effort modeling in neural TTS for improving the intelligibility of synthetic speech in noise. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Kenta Udagawa, Yuki Saito, Hiroshi Saruwatari |
Human-in-the-loop Speaker Adaptation for DNN-based Multi-speaker TTS. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Naoki Makishima, Satoshi Suzuki, Atsushi Ando, Ryo Masumura |
Speaker consistency loss and step-wise optimization for semi-supervised joint training of TTS and ASR using unpaired text data. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Yooncheol Ju, Ilhwan Kim, Hongsun Yang, Ji-Hoon Kim, Byeongyeol Kim, Soumi Maiti, Shinji Watanabe 0001 |
TriniTTS: Pitch-controllable End-to-end TTS without External Aligner. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Alon Levkovitch, Eliya Nachmani, Lior Wolf |
Zero-Shot Voice Conditioning for Denoising Diffusion TTS Models. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Wei-Ping Huang, Po-Chun Chen, Sung-Feng Huang, Hung-yi Lee |
Few Shot Cross-Lingual TTS Using Transferable Phoneme Embedding. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Sri Karlapati, Penny Karanasou, Mateusz Lajszczak, Syed Ammar Abbas, Alexis Moinet, Peter Makarov, Ray Li, Arent van Korlaar, Simon Slangen, Thomas Drugman |
CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Peter Makarov, Syed Ammar Abbas, Mateusz Lajszczak, Arnaud Joly, Sri Karlapati, Alexis Moinet, Thomas Drugman, Penny Karanasou |
Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Jaeuk Lee, Joon-Hyuk Chang |
Advanced Speaker Embedding with Predictive Variance of Gaussian Distribution for Speaker Adaptation in TTS. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Ajinkya Kulkarni, Vincent Colotte, Denis Jouvet |
Analysis of expressivity transfer in non-autoregressive end-to-end multispeaker TTS systems. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Haohan Guo, Feng-Long Xie, Frank K. Soong, Xixin Wu, Helen Meng |
A Multi-Stage Multi-Codebook VQ-VAE Approach to High-Performance Neural TTS. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Jaeuk Lee, Joon-Hyuk Chang |
One-Shot Speaker Adaptation Based on Initialization by Generative Adversarial Networks for TTS. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Martin Lenglet, Olivier Perrotin, Gérard Bailly |
Speaking Rate Control of end-to-end TTS Models by Direct Manipulation of the Encoder's Output Embeddings. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Dengfeng Ke, Liangjie Huang, Wenhan Yao, Ruixin Hu, Xueyin Zu, Yanlu Xie, Jinsong Zhang 0001 |
Voicifier-LN: An Novel Approach to Elevate the Speaker Similarity for General Zero-shot Multi-Speaker TTS. |
AIPR |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Yujia Xiao, Xi Wang 0016, Lei He 0005, Frank K. Soong |
Improving Fastspeech TTS with Efficient Self-Attention and Compact Feed-Forward Network. |
ICASSP |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Johanes Effendi, Yogesh Virkar, Roberto Barra-Chicote, Marcello Federico |
Duration Modeling of Neural TTS for Automatic Dubbing. |
ICASSP |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Mohammad Soleymanpour, Michael T. Johnson, Rahim Soleymanpour, Jeffrey Berry 0001 |
Synthesizing Dysarthric Speech Using Multi-Speaker Tts For Dysarthric Speech Recognition. |
ICASSP |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Rohan Badlani, Adrian Lancucki, Kevin J. Shih, Rafael Valle, Wei Ping, Bryan Catanzaro |
One TTS Alignment to Rule Them All. |
ICASSP |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Rui Li, Dong Pu, Minnie Huang, Bill Huang |
UNET-TTS: Improving Unseen Speaker and Style Transfer in One-Shot Voice Cloning. |
ICASSP |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Shivam Mehta, Éva Székely, Jonas Beskow, Gustav Eje Henter |
Neural HMMS Are All You Need (For High-Quality Attention-Free TTS). |
ICASSP |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Chae-Bin Im, Sang-Hoon Lee, Seung-Bin Kim, Seong-Whan Lee |
EMOQ-TTS: Emotion Intensity Quantization for Fine-Grained Controllable Emotional Text-to-Speech. |
ICASSP |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Tuomo Raitio, Jiangchuan Li, Shreyas Seshadri |
Hierarchical Prosody Modeling and Control in Non-Autoregressive Parallel Neural TTS. |
ICASSP |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Junchen Lu, Berrak Sisman, Rui Liu 0008, Mingyang Zhang 0003, Haizhou Li 0001 |
Visualtts: TTS with Accurate Lip-Speech Synchronization for Automatic Voice Over. |
ICASSP |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Rem Hida, Masaki Hamada, Chie Kamada, Emiru Tsunoo, Toshiyuki Sekiya, Toshiyuki Kumakura |
Polyphone Disambiguation and Accent Prediction Using Pre-Trained Language Models in Japanese TTS Front-End. |
ICASSP |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Ji-Hyun Lee, Sang-Hoon Lee, Ji-Hoon Kim, Seong-Whan Lee |
PVAE-TTS: Adaptive Text-to-Speech via Progressive Style Adaptation. |
ICASSP |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Oktai Tatanov, Stanislav Beliaev, Boris Ginsburg |
Mixer-TTS: Non-Autoregressive, Fast and Compact Text-to-Speech Model Conditioned on Language Model Embeddings. |
ICASSP |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Zidong Chen, Xiongkuo Min |
Perceptual Quality Assessment of TTS-Synthesized Speech. |
IFTC |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Xulong Zhang 0001, Jianzong Wang, Ning Cheng 0001, Jing Xiao 0006 |
Semi-Supervised Learning Based on Reference Model for Low-resource TTS. |
MSN |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Leonardo B. de M. M. Marques, Lucas H. Ueda, Flávio Olmos Simões, Mario Uliani Neto, Fernando O. Runstein, Edson Jose Nagle, Bianca Dal Bó, Paula D. P. Costa |
Diffusion-Based Approach to Style Modeling in Expressive TTS. |
BRACIS (1) |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Xiaomin Li, Vangelis Metsis, Huangyingrui Wang, Anne Hee Hiong Ngu |
TTS-GAN: A Transformer-Based Time-Series Generative Adversarial Network. |
AIME |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Sewade Ogun, Vincent Colotte, Emmanuel Vincent 0001 |
Can We Use Common Voice to Train a Multi-Speaker TTS System? |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Yinghao Aaron Li, Cong Han, Nima Mesgarani |
Styletts-VC: One-Shot Voice Conversion by Knowledge Transfer From Style-Based TTS Models. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Rendi Chevi, Radityo Eko Prasojo, Alham Fikri Aji, Andros Tjandra, Sakriani Sakti |
NIX-TTS: Lightweight and End-to-End Text-to-Speech Via Module-Wise Distillation. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Stefan Taubert, Jasmin Sternkopf, Stefan Kahl, Maximilian Eibl |
A Comparison of Text Selection Algorithms for Sequence-to-Sequence Neural TTS. |
ICSPCC |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Hyoung-Kyu Song, Sang Hoon Woo, Junhyeok Lee, Seungmin Yang, Hyunjae Cho, Youseong Lee, Dongho Choi, Kang-wook Kim |
Talking Face Generation with Multilingual TTS. |
CVPR |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Haoxu Wang, Yan Jia, Zeqing Zhao, Xuyang Wang, Junjie Wang, Ming Li 0026 |
Generating TTS Based Adversarial Samples for Training Wake-Up Word Detection Systems Against Confusing Words. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Ziyue Jiang 0001, Su Zhe, Zhou Zhao, Qian Yang, Yi Ren 0006, Jinglin Liu |
Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech. |
NeurIPS |
2022 |
DBLP BibTeX RDF |
|
15 | Xun Zhou, Zhiyang Zhou, Xiaodong Shi |
FCH-TTS: Fast, Controllable and High-quality Non-Autoregressive Text-to-Speech Synthesis. |
IJCNN |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Xulong Zhang 0001, Jianzong Wang, Ning Cheng 0001, Jing Xiao 0006 |
TDASS: Target Domain Adaptation Speech Synthesis Framework for Multi-speaker Low-Resource TTS. |
IJCNN |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Maria-Loulou Hajj, Martin Lenglet, Olivier Perrotin, Gérard Bailly |
Comparing NLP Solutions for the Disambiguation of French Heterophonic Homographs for End-to-End TTS Systems. |
SPECOM |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Charbel Arnaud Cedrique Y. Boco, Théophile K. Dagba |
An End to End Bilingual TTS System for Fongbe and Yoruba. |
ICCCI (CCIS Volume) |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Yuan-Fu Liao, Wen-Han Hsu, Chen-Ming Pan, Wern-Jun Wang, Matús Pleva, Daniel Hládek |
Personalized Taiwanese Speech Synthesis using Cascaded ASR and TTS Framework. |
RADIOELEKTRONIKA |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Heeseung Kim, Sungwon Kim 0001, Sungroh Yoon |
Guided-TTS: A Diffusion Model for Text-to-Speech via Classifier Guidance. |
ICML |
2022 |
DBLP BibTeX RDF |
|
15 | Edresson Casanova, Julian Weber, Christopher Dane Shulby, Arnaldo Cândido Júnior, Eren Gölge, Moacir A. Ponti |
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for Everyone. |
ICML |
2022 |
DBLP BibTeX RDF |
|
15 | Siyang Wang, Joakim Gustafson, Éva Székely |
Evaluating Sampling-based Filler Insertion with Spontaneous TTS. |
LREC |
2022 |
DBLP BibTeX RDF |
|
15 | Saida Mussakhojayeva, Yerbolat Khassanov, Huseyin Atakan Varol |
KazakhTTS2: Extending the Open-Source Kazakh TTS Corpus With More Data, Speakers, and Topics. |
LREC |
2022 |
DBLP BibTeX RDF |
|
15 | Elham Akhlaghi, Ingibjörg Iðha Auðhunardóttir, Anna Baczkowska, Branislav Bédi, Hakeem Beedar, Harald Berthelsen, Cathy Chua, Catia Cucchiarini, Hanieh Habibi, Ivana Horváthová, Junta Ikeda, Christèle Maizonniaux, Neasa Ní Chiaráin, Chadi Raheb, Manny Rayner, John Sloan, Nikos Tsourakis, Chunlin Yao |
Using the LARA Little Prince to compare human and TTS audio quality. |
LREC |
2022 |
DBLP BibTeX RDF |
|
15 | Sung-Woong Hwang, Joon-Hyuk Chang |
Document-Level Neural TTS Using Curriculum Learning and Attention Masking. |
IEEE Access |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Noé Tits, Kevin El Haddad, Thierry Dutoit |
Analysis and Assessment of Controllability of an Expressive Deep Learning-Based TTS System. |
Informatics |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Guangyu Liu, Bao Zhou, Yi Huang, Longfei Wang, Wei Wang, Enming Zhao |
Video image scaling technology based on adaptive interpolation algorithm and TTS FPGA implementation. |
Comput. Stand. Interfaces |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Ye Jia, Heiga Zen, Jonathan Shen, Yu Zhang 0033, Yonghui Wu |
PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Hui Lu, Zhiyong Wu 0001, Xixin Wu, Xu Li, Shiyin Kang, Xunying Liu, Helen Meng |
VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Murali Karthick Baskar, Lukás Burget, Shinji Watanabe 0001, Ramón Fernandez Astudillo, Jan Honza Cernocký |
EAT: Enhanced ASR-TTS for Self-supervised Speech Recognition. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Heeseung Kim, Sungwon Kim 0001, Sungroh Yoon |
Guided-TTS: Text-to-Speech with Untranscribed Speech. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Myeonghun Jeong, Hyeongju Kim, Sung Jun Cheon, Byoung Jin Choi, Nam Soo Kim |
Diff-TTS: A Denoising Diffusion Model for Text-to-Speech. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Peng Liu, Yuewen Cao, Songxiang Liu, Na Hu, Guangzhi Li, Chao Weng, Dan Su 0002 |
VARA-TTS: Non-Autoregressive Text-to-Speech Synthesis based on Very Deep VAE with Residual Attention. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Xiaochun An, Frank K. Soong, Lei Xie 0001 |
Improving Performance of Seen and Unseen Speech Style Transfer in End-to-end Neural TTS. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Pol van Rijn, Silvan Mertes, Dominik Schiller, Peter M. C. Harrison, Pauline Larrouy-Maestri, Elisabeth André, Nori Jacoby |
Exploring emotional prototypes in a high dimensional TTS latent space. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Hieu-Thi Luong, Junichi Yamagishi |
Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Ali Raheem Mandeel, Mohammed Salah Al-Radhi, Tamás Gábor Csapó |
Speaker Adaptation with Continuous Vocoder-based DNN-TTS. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Vadim Popov, Ivan Vovk, Vladimir Gogoryan, Tasnima Sadekova, Mikhail A. Kudinov |
Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Penny Karanasou, Sri Karlapati, Alexis Moinet, Arnaud Joly, Ammar Abbas, Simon Slangen, Jaime Lorenzo-Trueba, Thomas Drugman |
A learned conditional prior for the VAE acoustic space of a TTS system. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Edresson Casanova, Julian Weber, Christopher Shulby, Arnaldo Cândido Júnior, Eren Gölge, Moacir Antonelli Ponti |
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Isaac Elias, Heiga Zen, Jonathan Shen, Yu Zhang 0033, Jia Ye, R. J. Skerry-Ryan, Yonghui Wu |
Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Paarth Neekhara, Jason Li, Boris Ginsburg |
Adapting TTS models For New Speakers using Transfer Learning. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Shivam Mehta, Éva Székely, Jonas Beskow, Gustav Eje Henter |
Neural HMMs are all you need (for high-quality attention-free TTS). |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Brooke Stephenson, Thomas Hueber, Laurent Girin, Laurent Besacier |
Alternate Endings: Improving Prosody for Incremental Neural TTS with Predicted Future Text Input. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Shoule Wu, Ziqiang Shi |
It$\hat{\text{o}}$TTS and It$\hat{\text{o}}$Wave: Linear Stochastic Differential Equation Is All You Need For Audio Generation. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Iván Vallés-Pérez, Julian Roth, Grzegorz Beringer, Roberto Barra-Chicote, Jasha Droppo |
Improving multi-speaker TTS prosody variance with a residual encoder and normalizing flows. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Zongmin Liu |
Double Fuzzy Probabilistic Interval Linguistic Term Set and a Dynamic Fuzzy Decision Making Model based on Markov Process with tts Application in Multiple Criteria Group Decision Making. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Liping Chen, Yan Deng, Xi Wang 0016, Frank K. Soong, Lei He 0005 |
Speech BERT Embedding For Improving Prosody in Neural TTS. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Pascal Puchtler, Johannes Wirth 0001, René Peinl |
HUI-Audio-Corpus-German: A high quality TTS dataset. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Ji-Hoon Kim, Sang-Hoon Lee, Ji-Hyun Lee, Honggyu Jung, Seong-Whan Lee |
GC-TTS: Few-shot Speaker Adaptation with Geometric Constraints. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Sung-Feng Huang, Chyi-Jiunn Lin, Hung-yi Lee |
Meta-TTS: Meta-Learning for Few-Shot Speaker Adaptive Text-to-Speech. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Raahil Shah, Kamil Pokora, Abdelhamid Ezzerg, Viacheslav Klimkov, Goeric Huybrechts, Bartosz Putrycz, Daniel Korzekwa, Thomas Merritt |
Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Shilun Lin, Wen-Chao Su, Li Meng, Fenglong Xie, Xinhui Li, Li Lu |
Nana-HDR: A Non-attentive Non-autoregressive Hybrid Model for TTS. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Rohan Badlani, Adrian Lancucki, Kevin J. Shih, Rafael Valle, Wei Ping, Bryan Catanzaro |
One TTS Alignment To Rule Them All. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Wen-Chin Huang, Tomoki Hayashi, Xinjian Li, Shinji Watanabe 0001, Tomoki Toda |
On Prosody Modeling for ASR+TTS based Voice Conversion. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Tuomo Raitio, Jiangchuan Li, Shreyas Seshadri |
Hierarchical prosody modeling and control in non-autoregressive parallel neural TTS. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Shreyas Seshadri, Tuomo Raitio, Dan Castellani, Jiangchuan Li |
Emphasis control for parallel neural TTS. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Mutian He 0001, Jingzhou Yang, Lei He 0005, Frank K. Soong |
Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Noé Tits, Kevin El Haddad, Thierry Dutoit |
Analysis and Assessment of Controllability of an Expressive Deep Learning-based TTS system. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Shengkui Zhao, Hao Wang, Trung Hieu Nguyen 0001, Bin Ma 0001 |
Towards Natural and Controllable Cross-Lingual Voice Conversion Based on Neural TTS Model and Phonetic Posteriorgram. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Rui Li, Dong Pu, Minnie Huang, Bill Huang |
Unet-TTS: Improving Unseen Speaker and Style Transfer in One-shot Voice Cloning. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Junchen Lu, Berrak Sisman, Rui Liu 0008, Mingyang Zhang 0003, Haizhou Li 0001 |
VisualTTS: TTS with Accurate Lip-Speech Synchronization for Automatic Voice Over. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Sahoko Nakayama, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura 0001 |
Code-Switching ASR and TTS Using Semisupervised Learning with Machine Speech Chain. |
IEICE Trans. Inf. Syst. |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Kiyoshi Kurihara, Nobumasa Seiyama, Tadashi Kumano |
Prosodic Features Control by Symbols as Input of Sequence-to-Sequence Acoustic Modeling for Neural TTS. |
IEICE Trans. Inf. Syst. |
2021 |
DBLP BibTeX RDF |
|
15 | Noé Tits, Kevin El Haddad, Thierry Dutoit |
ICE-Talk 2: Interface for Controllable Expressive TTS with perceptual assessment tool. |
Softw. Impacts |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Liumeng Xue, Shifeng Pan, Lei He 0005, Lei Xie 0001, Frank K. Soong |
Cycle consistent network for end-to-end style transfer TTS training. |
Neural Networks |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Xiaochun An, Frank K. Soong, Shan Yang, Lei Xie 0001 |
Effective and direct control of neural TTS prosody by removing interactions between different attributes. |
Neural Networks |
2021 |
DBLP DOI BibTeX RDF |
|
Displaying result #301 - #400 of 1108 (100 per page; Change: ) Pages: [ <<][ 1][ 2][ 3][ 4][ 5][ 6][ 7][ 8][ 9][ 10][ 11][ 12][ >>] |
|