|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
The graphs summarize 149 occurrences of 116 keywords
|
|
|
Results
Found 1108 publication records. Showing 1108 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
26 | Hélder Ferreira, Diamantino Freitas |
Audio Rendering of Mathematical Formulae Using MathML and AudioMath. |
User Interfaces for All |
2004 |
DBLP DOI BibTeX RDF |
Audio Rendering of Mathematical Expressions, Conversion of mathematical formulae into text, Accessibility, Text-to-Speech, MathML |
26 | Gerasimos Xydas, Georgios Karberis, Georgios Kouroupetroglou |
Text Normalization for the Pronunciation of Non-standard Words in an Inflected Language. |
SETN |
2004 |
DBLP DOI BibTeX RDF |
|
26 | Jin-Seok Lee, Byeongchang Kim 0001, Gary Geunbae Lee |
Automatic corpus-based tone and break-index prediction using K-ToBI representation. |
ACM Trans. Asian Lang. Inf. Process. |
2002 |
DBLP DOI BibTeX RDF |
K-ToBI, phrase break, prosodic phrase, text-to-speech system, prosody, pitch, intonation |
26 | Hans Kruschke |
Simulation of Speaking Styles with Adapted Prosody. |
TSD |
2001 |
DBLP DOI BibTeX RDF |
|
26 | György Balogh, Ervin Dobler, Tamás Gröbler, Béla Smodics, Csaba Szepesvári |
FlexVoice: A Parametric Approach to High-Quality Speech Synthesis. |
TSD |
2000 |
DBLP DOI BibTeX RDF |
|
23 | Anna Inn-Tung Chen, Ming-Shing Chen, Tien-Ren Chen, Chen-Mou Cheng, Jintai Ding, Eric Li-Hsiang Kuo, Frost Yu-Shuang Lee, Bo-Yin Yang |
SSE Implementation of Multivariate PKCs on Modern x86 CPUs. |
CHES |
2009 |
DBLP DOI BibTeX RDF |
multivariate public key cryptosystem (MPKC), ?IC, vector instructions, SSSE3, Wiedemann, TTS, rainbow, SSE2 |
23 | Sri Hastuti Kurniawan, Adam J. Sporka |
Vocal interaction. |
CHI Extended Abstracts |
2008 |
DBLP DOI BibTeX RDF |
human voice, vocal interaction, asr, interaction style, speech interaction, tts |
23 | Kutz Arrieta, Igor Leturia, Urtza Iturraspe, Arantza Díaz de Ilarraza Sánchez, Kepa Sarasola, Inma Hernáez, Eva Navas |
AnHitz, Development and Integration of Language, Speech and Visual Technologies for Basque. |
ISUC |
2008 |
DBLP DOI BibTeX RDF |
QA, CLIR, MT, TTS, ASR |
23 | Xujie Wang, Ruwei Yun |
Design and Implement of Game Speech Interaction Based on Speech Synthesis Technique. |
Edutainment |
2008 |
DBLP DOI BibTeX RDF |
Speech Conversion, Speech Synthesis, Speech Interaction, TTS |
22 | Masatomo Kobayashi, Kentarou Fukuda, Hironobu Takagi, Chieko Asakawa |
Providing synthesized audio description for online videos. |
ASSETS |
2009 |
DBLP DOI BibTeX RDF |
external metadata, text-to-speech (tts), web accessibility, speech synthesis, online videos, audio description |
22 | Géza Németh, Gábor Olaszy, Mátyás Bartalis, Géza Kiss, Csaba Zainkó, Péter Mihajlik, Csaba Haraszti |
Automated Drug Information System for Aged and Visually Impaired Persons. |
ICCHP |
2008 |
DBLP DOI BibTeX RDF |
Speech based automatic drug information, Medicine Line, speech recognition for drug names, TTS for pharmaceutical texts |
22 | Hsi-Wen Wang, Chien-Chih Lai, Chun-Chieh Hsiao, Guan-Yu Hsieh, Ren-Guey Lee |
Design and Implementation of Secure Active RFID System with Cyptography and Authentication Mechanisms. |
IIH-MSP |
2008 |
DBLP DOI BibTeX RDF |
TTS cryptography, authentication, Active RFID |
22 | Chen Yang, Nan Chen, Pengfei Zhang, Zhen Jiao |
Flexible Multi-modal Interaction Technologies and User Interface Specially Designed for Chinese Car Infotainment System. |
HCI (3) |
2007 |
DBLP DOI BibTeX RDF |
Car Infotainment, Chinese ASR, Chinese TTS, Chinese NLU, Chinese Finger Stroke Recognition, Melody Recognition, User-centered design |
22 | Shoupeng Han, Kedi Huang |
Equivalent Semantic Translation from Parallel DEVS Models to Time Automata. |
International Conference on Computational Science (1) |
2007 |
DBLP DOI BibTeX RDF |
Discrete Event System Specification (DEVS), Timed Transition System (TTS), Timed Automata (TA), Semantic Equivalence |
22 | Rupal Patel, Michael Everett, Eldar Sadikov |
Loudmouth: : modifying text-to-speech synthesis in noise. |
ASSETS |
2006 |
DBLP DOI BibTeX RDF |
text-to-speech synthesis (TTS), speech synthesis, augmentative and alternative communication (AAC) |
15 | Jiewen Deng, Jinliang Deng, Du Yin, Renhe Jiang, Xuan Song 0001 |
TTS-Norm: Forecasting Tensor Time Series via Multi-Way Normalization. |
ACM Trans. Knowl. Discov. Data |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Rabbia Mahum, Aun Irtaza, Ali Javed, Haitham A. Mahmoud, Haseeb Hassan |
Correction: DeepDet: YAMNet with BottleNeck Attention Module (BAM) for TTS synthesis detection. |
EURASIP J. Audio Speech Music. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Rabbia Mahum, Aun Irtaza, Ali Javed, Haitham A. Mahmoud, Haseeb Hassan |
DeepDet: YAMNet with BottleNeck Attention Module (BAM) TTS synthesis detection. |
EURASIP J. Audio Speech Music. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Rendi Chevi, Alham Fikri Aji |
Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Haobin Tang, Xulong Zhang 0001, Ning Cheng 0001, Jing Xiao 0006, Jianzong Wang |
ED-TTS: Multi-Scale Emotion Modeling using Cross-Domain Emotion Diarization for Emotional Speech Synthesis. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Yejin Jeon, Yunsu Kim 0001, Gary Geunbae Lee |
Enhancing Zero-Shot Multi-Speaker TTS with Negated Speaker Representations. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Ziqi Liang, Haoxiang Shi, Jiawei Wang, Keda Lu |
EM-TTS: Efficiently Trained Low-Resource Mongolian Lightweight Text-to-Speech. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Chunhui Wang, Chang Zeng, Bowen Zhang, Ziyang Ma, Yefan Zhu, Zifeng Cai, Jian Zhao, Zhonglin Jiang, Yong Chen |
HAM-TTS: Hierarchical Acoustic Modeling for Token-Based Zero-Shot Text-to-Speech with Model and Data Scaling. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Sunghee Jung, Won Jang, Jaesam Yoon, Bongwan Kim |
Intelli-Z: Toward Intelligible Zero-Shot TTS. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Wonjune Kang, Yun Wang, Shun Zhang, Arthur Hinsvark, Qing He |
Multi-Task Learning for Front-End Text Processing in TTS. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Mateusz Lajszczak, Guillermo Cámbara, Yang Li, Fatih Beyhan, Arent van Korlaar, Fan Yang, Arnaud Joly, Álvaro Martín-Cortinas, Ammar Abbas, Adam Michalski, Alexis Moinet, Sri Karlapati, Ewa Muszynska, Haohan Guo, Bartosz Putrycz, Soledad López Gambino, Kayeon Yoo, Elena Sokolova, Thomas Drugman |
BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Xiang Li, Fan Bu, Ambuj Mehrish, Yingting Li, Jiale Han, Bo Cheng 0001, Soujanya Poria |
CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Wei-Ping Huang, Sung-Feng Huang, Hung-yi Lee |
Maximizing Data Efficiency for Cross-Lingual TTS Adaptation by Self-Supervised Representation Mixing and Embedding Initialization. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Xi Chen, Jiakun Pei, Liumeng Xue, Mingyang Zhang |
Transfer the linguistic representations from TTS to accent conversion with non-parallel data. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Claudio S. Pinhanez, Raul Fernandez, Marcelo Grave, Julio Nogima, Ron Hoory |
Creating an African American-Sounding TTS: Guidelines, Technical Challenges, and Surprising Evaluations. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Claudio Santos Pinhanez, Raul Fernandez, Marcelo Carpinette Grave, Julio Nogima, Ron Hoory |
Creating an African American-Sounding TTS: Guidelines, Technical Challenges, and Surprising Evaluations. |
IUI |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Wenhao Guan, Yishuang Li, Tao Li, Hukai Huang, Feng Wang, Jiayan Lin, Lingyan Huang, Lin Li, Qingyang Hong |
MM-TTS: Multi-Modal Prompt Based Style Transfer for Expressive Text-to-Speech Synthesis. |
AAAI |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Yejin Jeon, Yunsu Kim 0001, Gary Geunbae Lee |
Enhancing Zero-Shot Multi-Speaker TTS with Negated Speaker Representations. |
AAAI |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Pavan Tankala, Preethi Jyothi, Preeti Rao, Pushpak Bhattacharyya |
STORiCo: Storytelling TTS for Hindi with Character Voice Modulation. |
EACL (2) |
2024 |
DBLP BibTeX RDF |
|
15 | Edresson Casanova, Sandra M. Aluísio, Moacir Antonelli Ponti |
TTS applied to the generation of datasets for automatic speech recognition. |
PROPOR |
2024 |
DBLP BibTeX RDF |
|
15 | Kirthika Natarajan, Jeyalakshmi Chelliah, Jemin Vijayaselvan Mariyarose, Senthilkumar Andi, Bharathi Venkatachalam, Manjunathan Alagarsamy |
TTS System for Deafened and Vocally impaired persons in Native Language. |
J. Intell. Fuzzy Syst. |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Rabbia Mahum, Aun Irtaza, Ali Javed |
EDL-Det: A Robust TTS Synthesis Detector Using VGG19-Based YAMNet and Ensemble Learning Block. |
IEEE Access |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Davide Salvi, Brian C. Hosler, Paolo Bestagini, Matthew C. Stamm, Stefano Tubaro |
TIMIT-TTS: A Text-to-Speech Dataset for Multimodal Synthetic Media Detection. |
IEEE Access |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Yan Deng, Ning Wu, Chengjun Qiu, Yangyang Luo, Yan Chen |
MixGAN-TTS: Efficient and Stable Speech Synthesis Based on Diffusion Model. |
IEEE Access |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Giridhar Pamisetty, K. Sri Rama Murty |
Prosody-TTS: An End-to-End Speech Synthesis System with Prosody Control. |
Circuits Syst. Signal Process. |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Yi Zhou 0020, Zhizheng Wu 0001, Mingyang Zhang 0003, Xiaohai Tian, Haizhou Li 0001 |
TTS-Guided Training for Accent Conversion Without Parallel Data. |
IEEE Signal Process. Lett. |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Jianzong Wang, Pengcheng Li, Xulong Zhang 0001, Ning Cheng 0001, Jing Xiao 0006 |
DQR-TTS: Semi-supervised Text-to-speech Synthesis with Dynamic Quantized Representation. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Haolin Chen, Philip N. Garner |
An investigation into the adaptability of a diffusion-based TTS model. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Huadai Liu, Rongjie Huang, Xuan Lin, Wenqiang Xu, Maozong Zheng, Hong Chen, Jinzheng He, Zhou Zhao |
ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Ziyue Jiang 0001, Jinglin Liu, Yi Ren 0006, Jinzheng He, Chen Zhang, Zhenhui Ye, Pengfei Wei, Chunfeng Wang, Xiang Yin 0006, Zejun Ma, Zhou Zhao |
Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech Prompts. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Sewade Ogun, Vincent Colotte, Emmanuel Vincent 0001 |
Stochastic Pitch Prediction Improves the Diversity and Naturalness of Speech in Glow-TTS. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Ammar Abbas, Sri Karlapati, Bastian Schnell, Penny Karanasou, Marcel Granero Moya, Amith Nagaraj, Ayman Boustati, Nicole Peinelt, Alexis Moinet, Thomas Drugman |
eCat: An End-to-End Model for Multi-Speaker TTS & Many-to-Many Fine-Grained Prosody Transfer. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Adriana Stan, Johannah O'Mahony |
An analysis on the effects of speaker embedding choice in non auto-regressive TTS. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Sen Liu, Yiwei Guo, Chenpeng Du, Xie Chen 0001, Kai Yu 0004 |
DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Siyang Wang, Gustav Eje Henter, Joakim Gustafson, Éva Székely |
A Comparative Study of Self-Supervised Speech Representations in Read and Spontaneous TTS. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Jia-Jyu Su, Pang-Chen Liao, Yen-Ting Lin, Wu-Hao Li, Guan-Ting Liou, Cheng-Che Kao, Wei-Cheng Chen, Jen-Chieh Chiang, Wen-Yang Chang, Pin-Han Lin, Chen-Yu Chiang |
VoiceBank-2023: A Multi-Speaker Mandarin Speech Corpus for Constructing Personalized TTS Systems for the Speech Impaired. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Xin Jing, Yi Chang, Zijiang Yang 0007, Jiangjian Xie, Andreas Triantafyllopoulos, Björn W. Schuller |
U-DiT TTS: U-Diffusion Vision Transformer for Text-to-Speech. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Yuang Li, Yinglu Li, Min Zhang 0042, Chang Su 0001, Mengyao Piao, Xiaosong Qiao, Jiawei Yu, Miaomiao Ma, Yanqing Zhao, Hao Yang 0006 |
CB-Whisper: Contextual Biasing Whisper using TTS-based Keyword Spotting. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Yuan Gao, Nobuyuki Morioka, Yu Zhang, Nanxin Chen |
E3 TTS: Easy End-to-End Diffusion-based Text to Speech. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Frederico S. Oliveira, Edresson Casanova, Arnaldo Cândido Júnior, Anderson da Silva Soares, Arlindo R. Galvão Filho |
CML-TTS A Multilingual Dataset for Speech Synthesis in Low-Resource Languages. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Georgi Tinchev, Marta Czarnowska, Kamil Deja, Kayoko Yanagisawa, Marius Cotescu |
Modelling low-resource accents without accent-specific TTS frontend. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Qi Chen, Ziyang Ma, Tao Liu, Xu Tan, Qu Lu, Xie Chen 0001, Kai Yu 0004 |
Improving Few-Shot Learning for Talking Face System with TTS Data Augmentation. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Po-Chun Hsu, Ali Elkahky, Wei-Ning Hsu, Yossi Adi, Tu Anh Nguyen, Jade Copet, Emmanuel Dupoux, Hung-yi Lee, Abdelrahman Mohamed |
Low-Resource Self-Supervised Learning with SSL-Enhanced TTS. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Hanglei Zhang, Yiwei Guo, Sen Liu, Xie Chen 0001, Kai Yu 0004 |
Expressive TTS Driven by Natural Language Prompts Using Few Human Annotations. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Cheng Gong, Xin Wang 0037, Erica Cooper, Dan Wells, Longbiao Wang, Jianwu Dang 0001, Korin Richmond, Junichi Yamagishi |
ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Linhan Ma, Yongmao Zhang, Xinfa Zhu, Yi Lei, Ziqian Ning, Pengcheng Zhu 0004, Lei Xie |
Accent-VITS: accent transfer for end-to-end TTS. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Dongchao Yang, Songxiang Liu, Rongjie Huang, Guangzhi Lei, Chao Weng, Helen Meng, Dong Yu 0001 |
InstructTTS: Modelling Expressive TTS in Discrete Latent Space with Natural Language Style Prompt. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Rohan Badlani, Rafael Valle, Kevin J. Shih, João Felipe Santos, Siddharth Gururani, Bryan Catanzaro |
Multilingual Multiaccented Multispeaker TTS with RADTTS. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Haobin Tang, Xulong Zhang 0001, Jianzong Wang, Ning Cheng 0001, Jing Xiao 0006 |
QI-TTS: Questioning Intonation Control for Emotional Speech Synthesis. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Yifan Yang, Feiyu Shen, Chenpeng Du, Ziyang Ma, Kai Yu 0004, Daniel Povey, Xie Chen 0001 |
Towards Universal Speech Discrete Tokens: A Case Study for ASR and TTS. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Jiatong Shi, Yun Tang 0002, Ann Lee 0001, Hirofumi Inaguma, Changhan Wang, Juan Pino 0001, Shinji Watanabe 0001 |
Enhancing Speech-to-Speech Translation with Multiple TTS Targets. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Ambuj Mehrish, Abhinav Ramesh Kashyap, Yingting Li, Navonil Majumder, Soujanya Poria |
ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS Adaptation. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Yuanhao Chen |
Improving TTS for Shanghainese: Addressing Tone Sandhi via Word Segmentation. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Rohan Badlani, Akshit Arora, Subhankar Ghosh, Rafael Valle, Kevin J. Shih, João Felipe Santos, Boris Ginsburg, Bryan Catanzaro |
VANI: Very-lightweight Accent-controllable TTS for Native and Non-native speakers with Identity Preservation. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Ziyue Jiang 0001, Yi Ren 0006, Zhenhui Ye, Jinglin Liu, Chen Zhang, Qian Yang, Shengpeng Ji, Rongjie Huang, Chunfeng Wang, Xiang Yin 0006, Zejun Ma, Zhou Zhao |
Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Haohan Guo, Fenglong Xie, Jiawen Kang, Yujia Xiao, Xixin Wu, Helen Meng |
QS-TTS: Towards Semi-Supervised Text-to-Speech Synthesis via Vector-Quantized Self-Supervised Speech Representation Learning. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Jocelyn Huang, Evelina Bakhturina, Oktai Tatanov |
Automatic Heteronym Resolution Pipeline Using RAD-TTS Aligners. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Myeongjin Ko, Yong-Hoon Choi |
Adversarial Training of Denoising Diffusion Model Using Dual Discriminators for High-Fidelity Multi-Speaker TTS. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Massa Baali, Tomoki Hayashi, Hamdy Mubarak, Soumi Maiti, Shinji Watanabe 0001, Wassim El-Hajj, Ahmed Ali 0002 |
Unsupervised Data Selection for TTS: Using Arabic Broadcast News as a Case Study. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Wenhao Guan, Qi Su, Haodong Zhou, Shiyu Miao, Xingjia Xie, Lin Li, Qingyang Hong |
ReFlow-TTS: A Rectified Flow Model for High-fidelity Text-to-Speech. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Hanzhao Li, Xinfa Zhu, Liumeng Xue, Yang Song, Yunlin Chen, Lei Xie |
SponTTS: modeling and transferring spontaneous style for TTS. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Seongho Joo, Hyukhun Koh, Kyomin Jung |
DPP-TTS: Diversifying prosodic features of speech via determinantal point processes. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Tao Li, Chenxu Hu, Jian Cong, Xinfa Zhu, Jingbei Li, Qiao Tian, Yuping Wang, Lei Xie 0001 |
DiCLET-TTS: Diffusion Model based Cross-lingual Emotion Transfer for Text-to-Speech - A Study between English and Mandarin. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Wenhao Guan, Yishuang Li, Tao Li, Hukai Huang, Feng Wang, Jiayan Lin, Lingyan Huang, Lin Li, Qingyang Hong |
MM-TTS: Multi-modal Prompt based Style Transfer for Expressive Text-to-Speech Synthesis. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Ziyang Ma, Wen Wu, Zhisheng Zheng, Yiwei Guo, Qian Chen, Shiliang Zhang, Xie Chen 0001 |
Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Junhyeok Lee, Wonbin Jung, Hyunjae Cho, Jaeyeon Kim |
PITS: Variational Pitch Inference without Fundamental Frequency for End-to-End Pitch-controllable TTS. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Atli Þór Sigurgeirsson, Simon King 0001 |
Using a Large Language Model to Control Speaking Style for Expressive TTS. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Anandaswarup Vadapalli |
An investigation of speaker independent phrase break models in End-to-End TTS systems. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Nicole Dodd, Michelle Cohn, Georgia Zellou |
Comparing alignment toward American, British, and Indian English text-to-speech (TTS) voices: influence of social attitudes and talker guise. |
Frontiers Comput. Sci. |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Junxiao Yu, Zhengyuan Xu, Xu He, Jian Wang, Bin Liu 0052, Rui Feng, Songsheng Zhu, Wei Wang 0217, Jianqing Li 0002 |
DIA-TTS: Deep-Inherited Attention-Based Text-to-Speech Synthesizer. |
Entropy |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Zeqing Zhao, Sifan Ma, Yan Jia, Jingyu Hou 0005, Lin Yang, Junjie Wang |
Disentangling Content Information by Combining ASR and TTS Bottleneck Features for Voice Conversion. |
Int. J. Asian Lang. Process. |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Tao Li, Chenxu Hu, Jian Cong, Xinfa Zhu, Jingbei Li, Qiao Tian, Yuping Wang, Lei Xie 0001 |
DiCLET-TTS: Diffusion Model Based Cross-Lingual Emotion Transfer for Text-to-Speech - A Study Between English and Mandarin. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Siqi Sun, Korin Richmond, Hao Tang 0002 |
Improving Seq2Seq TTS Frontends With Transcribed Speech Audio. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Jia-Jyu Su, Pang-Chen Liao, Yen-Ting Lin, Wu-Hao Li, Guan-Ting Liou, Cheng-Che Kao, Wei-Cheng Chen, Jen-Chieh Chiang, Wen-Yang Chang, Pin-Han Lin, Chen-Yu Chiang |
VoiceBank-2023: A Multi-Speaker Mandarin Speech Corpus for Constructing Personalized TTS Systems for the Speech Impaired. |
O-COCOSDA |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Wenjiang Chi, Xiaoqin Feng, Liumeng Xue, Yunlin Chen, Lei Xie, Zhifei Li |
Multi-granularity Semantic and Acoustic Stress Prediction for Expressive TTS. |
APSIPA ASC |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Mingyang Zhang 0003, Yi Zhou 0020, Zhizheng Wu 0001, Haizhou Li 0001 |
Zero-shot multi-speaker accent TTS with limited accent data. |
APSIPA ASC |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Adam K. Coyne, Conor McGinn |
The Effect of Human Prosody on Comprehension of TTS Robot Speech. |
RO-MAN |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Jianzong Wang, Pengcheng Li, Xulong Zhang 0001, Ning Cheng 0001, Jing Xiao 0006 |
DQR-TTS: Semi-supervised Text-to-speech Synthesis with Dynamic Quantized Representation. |
ISPA/BDCloud/SocialCom/SustainCom |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Anusha Prakash 0001, Srinivasan Umesh, Hema A. Murthy |
Towards Developing State-of-The-Art TTS Synthesisers for 13 Indian Languages with Signal Processing Aided Alignments. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Yuan Gao, Nobuyuki Morioka, Yu Zhang 0033, Nanxin Chen |
E3 TTS: Easy End-to-End Diffusion-Based Text To Speech. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Wei-Ping Huang, Sung-Feng Huang, Hung-Yi Lee |
Maximizing Data Efficiency for Cross-Lingual TTS Adaptation by Self-Supervised Representation Mixing and Embedding Initialization. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Yingjie Li 0008, Chenye Zhao, Cornelia Caragea |
TTS: A Target-based Teacher-Student Framework for Zero-Shot Stance Detection. |
WWW |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Haitong Zhang, Xinyuan Yu, Yue Lin |
NSV-TTS: Non-Speech Vocalization Modeling And Transfer In Emotional Text-To-Speech. |
ICASSP |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Georgi Tinchev, Marta Czarnowska, Kamil Deja, Kayoko Yanagisawa, Marius Cotescu |
Modelling Low-Resource Accents Without Accent-Specific TTS Frontend. |
ICASSP |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Jiatong Shi, Yun Tang 0002, Ann Lee 0001, Hirofumi Inaguma, Changhan Wang, Juan Pino 0001, Shinji Watanabe 0001 |
Enhancing Speech-To-Speech Translation with Multiple TTS Targets. |
ICASSP |
2023 |
DBLP DOI BibTeX RDF |
|
Displaying result #101 - #200 of 1108 (100 per page; Change: ) Pages: [ <<][ 1][ 2][ 3][ 4][ 5][ 6][ 7][ 8][ 9][ 10][ 11][ >>] |
|