The FacetedDBLP logo    Search for: in:

Disable automatic phrases ?     Syntactic query expansion: ?

Searching for tts with no syntactic query expansion in all metadata.

Publication years (Num. hits)
1988-1996 (20) 1997-1998 (25) 1999-2000 (33) 2001 (16) 2002 (20) 2003 (33) 2004 (52) 2005 (36) 2006 (47) 2007 (38) 2008 (41) 2009 (38) 2010 (40) 2011 (18) 2012 (28) 2013 (21) 2014 (31) 2015 (22) 2016 (33) 2017 (26) 2018 (22) 2019 (50) 2020 (64) 2021 (100) 2022 (135) 2023 (98) 2024 (21)
Publication types (Num. hits)
article(321) incollection(3) inproceedings(783) phdthesis(1)
Venues (Conferences, Journals, ...)
INTERSPEECH(227) CoRR(206) ICASSP(83) TSD(41) SSW(37) ISCSLP(25) EUROSPEECH(18) LREC(18) IEEE Trans. Speech Audio Proce...(17) ICSLP(14) IEEE ACM Trans. Audio Speech L...(12) SPECOM(12) ICASSP (1)(11) PROPOR(10) SLT(10) Speech Commun.(9) More (+10 of total 260)
GrowBag graphs for keyword ? (Num. hits/coverage)

Group by:
The graphs summarize 149 occurrences of 116 keywords

Results
Found 1108 publication records. Showing 1108 according to the selection in the facets
Hits ? Authors Title Venue Year Link Author keywords
26Hélder Ferreira, Diamantino Freitas Audio Rendering of Mathematical Formulae Using MathML and AudioMath. Search on Bibsonomy User Interfaces for All The full citation details ... 2004 DBLP  DOI  BibTeX  RDF Audio Rendering of Mathematical Expressions, Conversion of mathematical formulae into text, Accessibility, Text-to-Speech, MathML
26Gerasimos Xydas, Georgios Karberis, Georgios Kouroupetroglou Text Normalization for the Pronunciation of Non-standard Words in an Inflected Language. Search on Bibsonomy SETN The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
26Jin-Seok Lee, Byeongchang Kim 0001, Gary Geunbae Lee Automatic corpus-based tone and break-index prediction using K-ToBI representation. Search on Bibsonomy ACM Trans. Asian Lang. Inf. Process. The full citation details ... 2002 DBLP  DOI  BibTeX  RDF K-ToBI, phrase break, prosodic phrase, text-to-speech system, prosody, pitch, intonation
26Hans Kruschke Simulation of Speaking Styles with Adapted Prosody. Search on Bibsonomy TSD The full citation details ... 2001 DBLP  DOI  BibTeX  RDF
26György Balogh, Ervin Dobler, Tamás Gröbler, Béla Smodics, Csaba Szepesvári FlexVoice: A Parametric Approach to High-Quality Speech Synthesis. Search on Bibsonomy TSD The full citation details ... 2000 DBLP  DOI  BibTeX  RDF
23Anna Inn-Tung Chen, Ming-Shing Chen, Tien-Ren Chen, Chen-Mou Cheng, Jintai Ding, Eric Li-Hsiang Kuo, Frost Yu-Shuang Lee, Bo-Yin Yang SSE Implementation of Multivariate PKCs on Modern x86 CPUs. Search on Bibsonomy CHES The full citation details ... 2009 DBLP  DOI  BibTeX  RDF multivariate public key cryptosystem (MPKC), ?IC, vector instructions, SSSE3, Wiedemann, TTS, rainbow, SSE2
23Sri Hastuti Kurniawan, Adam J. Sporka Vocal interaction. Search on Bibsonomy CHI Extended Abstracts The full citation details ... 2008 DBLP  DOI  BibTeX  RDF human voice, vocal interaction, asr, interaction style, speech interaction, tts
23Kutz Arrieta, Igor Leturia, Urtza Iturraspe, Arantza Díaz de Ilarraza Sánchez, Kepa Sarasola, Inma Hernáez, Eva Navas AnHitz, Development and Integration of Language, Speech and Visual Technologies for Basque. Search on Bibsonomy ISUC The full citation details ... 2008 DBLP  DOI  BibTeX  RDF QA, CLIR, MT, TTS, ASR
23Xujie Wang, Ruwei Yun Design and Implement of Game Speech Interaction Based on Speech Synthesis Technique. Search on Bibsonomy Edutainment The full citation details ... 2008 DBLP  DOI  BibTeX  RDF Speech Conversion, Speech Synthesis, Speech Interaction, TTS
22Masatomo Kobayashi, Kentarou Fukuda, Hironobu Takagi, Chieko Asakawa Providing synthesized audio description for online videos. Search on Bibsonomy ASSETS The full citation details ... 2009 DBLP  DOI  BibTeX  RDF external metadata, text-to-speech (tts), web accessibility, speech synthesis, online videos, audio description
22Géza Németh, Gábor Olaszy, Mátyás Bartalis, Géza Kiss, Csaba Zainkó, Péter Mihajlik, Csaba Haraszti Automated Drug Information System for Aged and Visually Impaired Persons. Search on Bibsonomy ICCHP The full citation details ... 2008 DBLP  DOI  BibTeX  RDF Speech based automatic drug information, Medicine Line, speech recognition for drug names, TTS for pharmaceutical texts
22Hsi-Wen Wang, Chien-Chih Lai, Chun-Chieh Hsiao, Guan-Yu Hsieh, Ren-Guey Lee Design and Implementation of Secure Active RFID System with Cyptography and Authentication Mechanisms. Search on Bibsonomy IIH-MSP The full citation details ... 2008 DBLP  DOI  BibTeX  RDF TTS cryptography, authentication, Active RFID
22Chen Yang, Nan Chen, Pengfei Zhang, Zhen Jiao Flexible Multi-modal Interaction Technologies and User Interface Specially Designed for Chinese Car Infotainment System. Search on Bibsonomy HCI (3) The full citation details ... 2007 DBLP  DOI  BibTeX  RDF Car Infotainment, Chinese ASR, Chinese TTS, Chinese NLU, Chinese Finger Stroke Recognition, Melody Recognition, User-centered design
22Shoupeng Han, Kedi Huang Equivalent Semantic Translation from Parallel DEVS Models to Time Automata. Search on Bibsonomy International Conference on Computational Science (1) The full citation details ... 2007 DBLP  DOI  BibTeX  RDF Discrete Event System Specification (DEVS), Timed Transition System (TTS), Timed Automata (TA), Semantic Equivalence
22Rupal Patel, Michael Everett, Eldar Sadikov Loudmouth: : modifying text-to-speech synthesis in noise. Search on Bibsonomy ASSETS The full citation details ... 2006 DBLP  DOI  BibTeX  RDF text-to-speech synthesis (TTS), speech synthesis, augmentative and alternative communication (AAC)
15Jiewen Deng, Jinliang Deng, Du Yin, Renhe Jiang, Xuan Song 0001 TTS-Norm: Forecasting Tensor Time Series via Multi-Way Normalization. Search on Bibsonomy ACM Trans. Knowl. Discov. Data The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Rabbia Mahum, Aun Irtaza, Ali Javed, Haitham A. Mahmoud, Haseeb Hassan Correction: DeepDet: YAMNet with BottleNeck Attention Module (BAM) for TTS synthesis detection. Search on Bibsonomy EURASIP J. Audio Speech Music. Process. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Rabbia Mahum, Aun Irtaza, Ali Javed, Haitham A. Mahmoud, Haseeb Hassan DeepDet: YAMNet with BottleNeck Attention Module (BAM) TTS synthesis detection. Search on Bibsonomy EURASIP J. Audio Speech Music. Process. The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Rendi Chevi, Alham Fikri Aji Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Haobin Tang, Xulong Zhang 0001, Ning Cheng 0001, Jing Xiao 0006, Jianzong Wang ED-TTS: Multi-Scale Emotion Modeling using Cross-Domain Emotion Diarization for Emotional Speech Synthesis. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Yejin Jeon, Yunsu Kim 0001, Gary Geunbae Lee Enhancing Zero-Shot Multi-Speaker TTS with Negated Speaker Representations. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Ziqi Liang, Haoxiang Shi, Jiawei Wang, Keda Lu EM-TTS: Efficiently Trained Low-Resource Mongolian Lightweight Text-to-Speech. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Chunhui Wang, Chang Zeng, Bowen Zhang, Ziyang Ma, Yefan Zhu, Zifeng Cai, Jian Zhao, Zhonglin Jiang, Yong Chen HAM-TTS: Hierarchical Acoustic Modeling for Token-Based Zero-Shot Text-to-Speech with Model and Data Scaling. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Sunghee Jung, Won Jang, Jaesam Yoon, Bongwan Kim Intelli-Z: Toward Intelligible Zero-Shot TTS. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Wonjune Kang, Yun Wang, Shun Zhang, Arthur Hinsvark, Qing He Multi-Task Learning for Front-End Text Processing in TTS. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Mateusz Lajszczak, Guillermo Cámbara, Yang Li, Fatih Beyhan, Arent van Korlaar, Fan Yang, Arnaud Joly, Álvaro Martín-Cortinas, Ammar Abbas, Adam Michalski, Alexis Moinet, Sri Karlapati, Ewa Muszynska, Haohan Guo, Bartosz Putrycz, Soledad López Gambino, Kayeon Yoo, Elena Sokolova, Thomas Drugman BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Xiang Li, Fan Bu, Ambuj Mehrish, Yingting Li, Jiale Han, Bo Cheng 0001, Soujanya Poria CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Wei-Ping Huang, Sung-Feng Huang, Hung-yi Lee Maximizing Data Efficiency for Cross-Lingual TTS Adaptation by Self-Supervised Representation Mixing and Embedding Initialization. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Xi Chen, Jiakun Pei, Liumeng Xue, Mingyang Zhang Transfer the linguistic representations from TTS to accent conversion with non-parallel data. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Claudio S. Pinhanez, Raul Fernandez, Marcelo Grave, Julio Nogima, Ron Hoory Creating an African American-Sounding TTS: Guidelines, Technical Challenges, and Surprising Evaluations. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Claudio Santos Pinhanez, Raul Fernandez, Marcelo Carpinette Grave, Julio Nogima, Ron Hoory Creating an African American-Sounding TTS: Guidelines, Technical Challenges, and Surprising Evaluations. Search on Bibsonomy IUI The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Wenhao Guan, Yishuang Li, Tao Li, Hukai Huang, Feng Wang, Jiayan Lin, Lingyan Huang, Lin Li, Qingyang Hong MM-TTS: Multi-Modal Prompt Based Style Transfer for Expressive Text-to-Speech Synthesis. Search on Bibsonomy AAAI The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Yejin Jeon, Yunsu Kim 0001, Gary Geunbae Lee Enhancing Zero-Shot Multi-Speaker TTS with Negated Speaker Representations. Search on Bibsonomy AAAI The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
15Pavan Tankala, Preethi Jyothi, Preeti Rao, Pushpak Bhattacharyya STORiCo: Storytelling TTS for Hindi with Character Voice Modulation. Search on Bibsonomy EACL (2) The full citation details ... 2024 DBLP  BibTeX  RDF
15Edresson Casanova, Sandra M. Aluísio, Moacir Antonelli Ponti TTS applied to the generation of datasets for automatic speech recognition. Search on Bibsonomy PROPOR The full citation details ... 2024 DBLP  BibTeX  RDF
15Kirthika Natarajan, Jeyalakshmi Chelliah, Jemin Vijayaselvan Mariyarose, Senthilkumar Andi, Bharathi Venkatachalam, Manjunathan Alagarsamy TTS System for Deafened and Vocally impaired persons in Native Language. Search on Bibsonomy J. Intell. Fuzzy Syst. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Rabbia Mahum, Aun Irtaza, Ali Javed EDL-Det: A Robust TTS Synthesis Detector Using VGG19-Based YAMNet and Ensemble Learning Block. Search on Bibsonomy IEEE Access The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Davide Salvi, Brian C. Hosler, Paolo Bestagini, Matthew C. Stamm, Stefano Tubaro TIMIT-TTS: A Text-to-Speech Dataset for Multimodal Synthetic Media Detection. Search on Bibsonomy IEEE Access The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Yan Deng, Ning Wu, Chengjun Qiu, Yangyang Luo, Yan Chen MixGAN-TTS: Efficient and Stable Speech Synthesis Based on Diffusion Model. Search on Bibsonomy IEEE Access The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Giridhar Pamisetty, K. Sri Rama Murty Prosody-TTS: An End-to-End Speech Synthesis System with Prosody Control. Search on Bibsonomy Circuits Syst. Signal Process. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Yi Zhou 0020, Zhizheng Wu 0001, Mingyang Zhang 0003, Xiaohai Tian, Haizhou Li 0001 TTS-Guided Training for Accent Conversion Without Parallel Data. Search on Bibsonomy IEEE Signal Process. Lett. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Jianzong Wang, Pengcheng Li, Xulong Zhang 0001, Ning Cheng 0001, Jing Xiao 0006 DQR-TTS: Semi-supervised Text-to-speech Synthesis with Dynamic Quantized Representation. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Haolin Chen, Philip N. Garner An investigation into the adaptability of a diffusion-based TTS model. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Huadai Liu, Rongjie Huang, Xuan Lin, Wenqiang Xu, Maozong Zheng, Hong Chen, Jinzheng He, Zhou Zhao ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Ziyue Jiang 0001, Jinglin Liu, Yi Ren 0006, Jinzheng He, Chen Zhang, Zhenhui Ye, Pengfei Wei, Chunfeng Wang, Xiang Yin 0006, Zejun Ma, Zhou Zhao Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech Prompts. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Sewade Ogun, Vincent Colotte, Emmanuel Vincent 0001 Stochastic Pitch Prediction Improves the Diversity and Naturalness of Speech in Glow-TTS. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Ammar Abbas, Sri Karlapati, Bastian Schnell, Penny Karanasou, Marcel Granero Moya, Amith Nagaraj, Ayman Boustati, Nicole Peinelt, Alexis Moinet, Thomas Drugman eCat: An End-to-End Model for Multi-Speaker TTS & Many-to-Many Fine-Grained Prosody Transfer. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Adriana Stan, Johannah O'Mahony An analysis on the effects of speaker embedding choice in non auto-regressive TTS. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Sen Liu, Yiwei Guo, Chenpeng Du, Xie Chen 0001, Kai Yu 0004 DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Siyang Wang, Gustav Eje Henter, Joakim Gustafson, Éva Székely A Comparative Study of Self-Supervised Speech Representations in Read and Spontaneous TTS. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Jia-Jyu Su, Pang-Chen Liao, Yen-Ting Lin, Wu-Hao Li, Guan-Ting Liou, Cheng-Che Kao, Wei-Cheng Chen, Jen-Chieh Chiang, Wen-Yang Chang, Pin-Han Lin, Chen-Yu Chiang VoiceBank-2023: A Multi-Speaker Mandarin Speech Corpus for Constructing Personalized TTS Systems for the Speech Impaired. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Xin Jing, Yi Chang, Zijiang Yang 0007, Jiangjian Xie, Andreas Triantafyllopoulos, Björn W. Schuller U-DiT TTS: U-Diffusion Vision Transformer for Text-to-Speech. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Yuang Li, Yinglu Li, Min Zhang 0042, Chang Su 0001, Mengyao Piao, Xiaosong Qiao, Jiawei Yu, Miaomiao Ma, Yanqing Zhao, Hao Yang 0006 CB-Whisper: Contextual Biasing Whisper using TTS-based Keyword Spotting. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Yuan Gao, Nobuyuki Morioka, Yu Zhang, Nanxin Chen E3 TTS: Easy End-to-End Diffusion-based Text to Speech. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Frederico S. Oliveira, Edresson Casanova, Arnaldo Cândido Júnior, Anderson da Silva Soares, Arlindo R. Galvão Filho CML-TTS A Multilingual Dataset for Speech Synthesis in Low-Resource Languages. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Georgi Tinchev, Marta Czarnowska, Kamil Deja, Kayoko Yanagisawa, Marius Cotescu Modelling low-resource accents without accent-specific TTS frontend. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Qi Chen, Ziyang Ma, Tao Liu, Xu Tan, Qu Lu, Xie Chen 0001, Kai Yu 0004 Improving Few-Shot Learning for Talking Face System with TTS Data Augmentation. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Po-Chun Hsu, Ali Elkahky, Wei-Ning Hsu, Yossi Adi, Tu Anh Nguyen, Jade Copet, Emmanuel Dupoux, Hung-yi Lee, Abdelrahman Mohamed Low-Resource Self-Supervised Learning with SSL-Enhanced TTS. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Hanglei Zhang, Yiwei Guo, Sen Liu, Xie Chen 0001, Kai Yu 0004 Expressive TTS Driven by Natural Language Prompts Using Few Human Annotations. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Cheng Gong, Xin Wang 0037, Erica Cooper, Dan Wells, Longbiao Wang, Jianwu Dang 0001, Korin Richmond, Junichi Yamagishi ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Linhan Ma, Yongmao Zhang, Xinfa Zhu, Yi Lei, Ziqian Ning, Pengcheng Zhu 0004, Lei Xie Accent-VITS: accent transfer for end-to-end TTS. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Dongchao Yang, Songxiang Liu, Rongjie Huang, Guangzhi Lei, Chao Weng, Helen Meng, Dong Yu 0001 InstructTTS: Modelling Expressive TTS in Discrete Latent Space with Natural Language Style Prompt. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Rohan Badlani, Rafael Valle, Kevin J. Shih, João Felipe Santos, Siddharth Gururani, Bryan Catanzaro Multilingual Multiaccented Multispeaker TTS with RADTTS. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Haobin Tang, Xulong Zhang 0001, Jianzong Wang, Ning Cheng 0001, Jing Xiao 0006 QI-TTS: Questioning Intonation Control for Emotional Speech Synthesis. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Yifan Yang, Feiyu Shen, Chenpeng Du, Ziyang Ma, Kai Yu 0004, Daniel Povey, Xie Chen 0001 Towards Universal Speech Discrete Tokens: A Case Study for ASR and TTS. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Jiatong Shi, Yun Tang 0002, Ann Lee 0001, Hirofumi Inaguma, Changhan Wang, Juan Pino 0001, Shinji Watanabe 0001 Enhancing Speech-to-Speech Translation with Multiple TTS Targets. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Ambuj Mehrish, Abhinav Ramesh Kashyap, Yingting Li, Navonil Majumder, Soujanya Poria ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS Adaptation. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Yuanhao Chen Improving TTS for Shanghainese: Addressing Tone Sandhi via Word Segmentation. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Rohan Badlani, Akshit Arora, Subhankar Ghosh, Rafael Valle, Kevin J. Shih, João Felipe Santos, Boris Ginsburg, Bryan Catanzaro VANI: Very-lightweight Accent-controllable TTS for Native and Non-native speakers with Identity Preservation. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Ziyue Jiang 0001, Yi Ren 0006, Zhenhui Ye, Jinglin Liu, Chen Zhang, Qian Yang, Shengpeng Ji, Rongjie Huang, Chunfeng Wang, Xiang Yin 0006, Zejun Ma, Zhou Zhao Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Haohan Guo, Fenglong Xie, Jiawen Kang, Yujia Xiao, Xixin Wu, Helen Meng QS-TTS: Towards Semi-Supervised Text-to-Speech Synthesis via Vector-Quantized Self-Supervised Speech Representation Learning. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Jocelyn Huang, Evelina Bakhturina, Oktai Tatanov Automatic Heteronym Resolution Pipeline Using RAD-TTS Aligners. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Myeongjin Ko, Yong-Hoon Choi Adversarial Training of Denoising Diffusion Model Using Dual Discriminators for High-Fidelity Multi-Speaker TTS. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Massa Baali, Tomoki Hayashi, Hamdy Mubarak, Soumi Maiti, Shinji Watanabe 0001, Wassim El-Hajj, Ahmed Ali 0002 Unsupervised Data Selection for TTS: Using Arabic Broadcast News as a Case Study. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Wenhao Guan, Qi Su, Haodong Zhou, Shiyu Miao, Xingjia Xie, Lin Li, Qingyang Hong ReFlow-TTS: A Rectified Flow Model for High-fidelity Text-to-Speech. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Hanzhao Li, Xinfa Zhu, Liumeng Xue, Yang Song, Yunlin Chen, Lei Xie SponTTS: modeling and transferring spontaneous style for TTS. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Seongho Joo, Hyukhun Koh, Kyomin Jung DPP-TTS: Diversifying prosodic features of speech via determinantal point processes. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Tao Li, Chenxu Hu, Jian Cong, Xinfa Zhu, Jingbei Li, Qiao Tian, Yuping Wang, Lei Xie 0001 DiCLET-TTS: Diffusion Model based Cross-lingual Emotion Transfer for Text-to-Speech - A Study between English and Mandarin. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Wenhao Guan, Yishuang Li, Tao Li, Hukai Huang, Feng Wang, Jiayan Lin, Lingyan Huang, Lin Li, Qingyang Hong MM-TTS: Multi-modal Prompt based Style Transfer for Expressive Text-to-Speech Synthesis. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Ziyang Ma, Wen Wu, Zhisheng Zheng, Yiwei Guo, Qian Chen, Shiliang Zhang, Xie Chen 0001 Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Junhyeok Lee, Wonbin Jung, Hyunjae Cho, Jaeyeon Kim PITS: Variational Pitch Inference without Fundamental Frequency for End-to-End Pitch-controllable TTS. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Atli Þór Sigurgeirsson, Simon King 0001 Using a Large Language Model to Control Speaking Style for Expressive TTS. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Anandaswarup Vadapalli An investigation of speaker independent phrase break models in End-to-End TTS systems. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Nicole Dodd, Michelle Cohn, Georgia Zellou Comparing alignment toward American, British, and Indian English text-to-speech (TTS) voices: influence of social attitudes and talker guise. Search on Bibsonomy Frontiers Comput. Sci. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Junxiao Yu, Zhengyuan Xu, Xu He, Jian Wang, Bin Liu 0052, Rui Feng, Songsheng Zhu, Wei Wang 0217, Jianqing Li 0002 DIA-TTS: Deep-Inherited Attention-Based Text-to-Speech Synthesizer. Search on Bibsonomy Entropy The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Zeqing Zhao, Sifan Ma, Yan Jia, Jingyu Hou 0005, Lin Yang, Junjie Wang Disentangling Content Information by Combining ASR and TTS Bottleneck Features for Voice Conversion. Search on Bibsonomy Int. J. Asian Lang. Process. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Tao Li, Chenxu Hu, Jian Cong, Xinfa Zhu, Jingbei Li, Qiao Tian, Yuping Wang, Lei Xie 0001 DiCLET-TTS: Diffusion Model Based Cross-Lingual Emotion Transfer for Text-to-Speech - A Study Between English and Mandarin. Search on Bibsonomy IEEE ACM Trans. Audio Speech Lang. Process. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Siqi Sun, Korin Richmond, Hao Tang 0002 Improving Seq2Seq TTS Frontends With Transcribed Speech Audio. Search on Bibsonomy IEEE ACM Trans. Audio Speech Lang. Process. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Jia-Jyu Su, Pang-Chen Liao, Yen-Ting Lin, Wu-Hao Li, Guan-Ting Liou, Cheng-Che Kao, Wei-Cheng Chen, Jen-Chieh Chiang, Wen-Yang Chang, Pin-Han Lin, Chen-Yu Chiang VoiceBank-2023: A Multi-Speaker Mandarin Speech Corpus for Constructing Personalized TTS Systems for the Speech Impaired. Search on Bibsonomy O-COCOSDA The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Wenjiang Chi, Xiaoqin Feng, Liumeng Xue, Yunlin Chen, Lei Xie, Zhifei Li Multi-granularity Semantic and Acoustic Stress Prediction for Expressive TTS. Search on Bibsonomy APSIPA ASC The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Mingyang Zhang 0003, Yi Zhou 0020, Zhizheng Wu 0001, Haizhou Li 0001 Zero-shot multi-speaker accent TTS with limited accent data. Search on Bibsonomy APSIPA ASC The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Adam K. Coyne, Conor McGinn The Effect of Human Prosody on Comprehension of TTS Robot Speech. Search on Bibsonomy RO-MAN The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Jianzong Wang, Pengcheng Li, Xulong Zhang 0001, Ning Cheng 0001, Jing Xiao 0006 DQR-TTS: Semi-supervised Text-to-speech Synthesis with Dynamic Quantized Representation. Search on Bibsonomy ISPA/BDCloud/SocialCom/SustainCom The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Anusha Prakash 0001, Srinivasan Umesh, Hema A. Murthy Towards Developing State-of-The-Art TTS Synthesisers for 13 Indian Languages with Signal Processing Aided Alignments. Search on Bibsonomy ASRU The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Yuan Gao, Nobuyuki Morioka, Yu Zhang 0033, Nanxin Chen E3 TTS: Easy End-to-End Diffusion-Based Text To Speech. Search on Bibsonomy ASRU The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Wei-Ping Huang, Sung-Feng Huang, Hung-Yi Lee Maximizing Data Efficiency for Cross-Lingual TTS Adaptation by Self-Supervised Representation Mixing and Embedding Initialization. Search on Bibsonomy ASRU The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Yingjie Li 0008, Chenye Zhao, Cornelia Caragea TTS: A Target-based Teacher-Student Framework for Zero-Shot Stance Detection. Search on Bibsonomy WWW The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Haitong Zhang, Xinyuan Yu, Yue Lin NSV-TTS: Non-Speech Vocalization Modeling And Transfer In Emotional Text-To-Speech. Search on Bibsonomy ICASSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Georgi Tinchev, Marta Czarnowska, Kamil Deja, Kayoko Yanagisawa, Marius Cotescu Modelling Low-Resource Accents Without Accent-Specific TTS Frontend. Search on Bibsonomy ICASSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
15Jiatong Shi, Yun Tang 0002, Ann Lee 0001, Hirofumi Inaguma, Changhan Wang, Juan Pino 0001, Shinji Watanabe 0001 Enhancing Speech-To-Speech Translation with Multiple TTS Targets. Search on Bibsonomy ICASSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
Displaying result #101 - #200 of 1108 (100 per page; Change: )
Pages: [<<][1][2][3][4][5][6][7][8][9][10][11][>>]
Valid XHTML 1.1! Valid CSS! [Valid RSS]
Maintained by L3S.
Previously maintained by Jörg Diederich.
Based upon DBLP by Michael Ley.
open data data released under the ODC-BY 1.0 license