|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
The graphs summarize 287 occurrences of 166 keywords
|
|
|
Results
Found 1467 publication records. Showing 1467 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
14 | Chunyu Qiang, Peng Yang, Hao Che, Ying Zhang, Xiaorui Wang, Zhongyuan Wang 0006 |
Improving Prosody for Cross-Speaker Style Transfer by Semi-Supervised Style Extractor and Hierarchical Modeling in Speech Synthesis. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICASSP ![In: IEEE International Conference on Acoustics, Speech and Signal Processing ICASSP 2023, Rhodes Island, Greece, June 4-10, 2023, pp. 1-5, 2023, IEEE, 978-1-7281-6327-7. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
14 | Li-Wei Chen, Shinji Watanabe 0001, Alexander Rudnicky |
A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Units. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICASSP ![In: IEEE International Conference on Acoustics, Speech and Signal Processing ICASSP 2023, Rhodes Island, Greece, June 4-10, 2023, pp. 1-5, 2023, IEEE, 978-1-7281-6327-7. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
14 | Detai Xin, Sharath Adavanne, Federico Ang, Ashish Kulkarni, Shinnosuke Takamichi, Hiroshi Saruwatari |
Improving Speech Prosody of Audiobook Text-To-Speech Synthesis with Acoustic and Textual Contexts. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICASSP ![In: IEEE International Conference on Acoustics, Speech and Signal Processing ICASSP 2023, Rhodes Island, Greece, June 4-10, 2023, pp. 1-5, 2023, IEEE, 978-1-7281-6327-7. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
14 | Tian Huey Teh, Vivian Hu, Devang S. Ram Mohan, Zack Hodari, Christopher G. R. Wallis, Tomás Gómez Ibarrondo, Alexandra Torresquintero, James Leoni, Mark J. F. Gales, Simon King 0001 |
Ensemble Prosody Prediction For Expressive Speech Synthesis. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICASSP ![In: IEEE International Conference on Acoustics, Speech and Signal Processing ICASSP 2023, Rhodes Island, Greece, June 4-10, 2023, pp. 1-5, 2023, IEEE, 978-1-7281-6327-7. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
14 | Giridhar Pamisetty, Sahukari Chaitanya Varun, K. Sri Rama Murty |
Lightweight Prosody-TTS for Multi-Lingual Multi-Speaker Scenario. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICASSP ![In: IEEE International Conference on Acoustics, Speech and Signal Processing ICASSP 2023, Rhodes Island, Greece, June 4-10, 2023, pp. 1-2, 2023, IEEE, 978-1-7281-6327-7. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
14 | Abdul Rehman, Jian Jun Zhang, Xiaosong Yang |
Intonation Template Matching for Syllable-Level Prosody Encoding. ![Search on Bibsonomy](Pics/bibsonomy.png) |
COGAI@IJCLR ![In: Proceedings of the International Workshop on Cognitive AI 2023 co-located with the 3rd International Conference on Learning & Reasoning (IJCLR 2023), Bari, Italy, 13-15 November 2023., 2023, CEUR-WS.org. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP BibTeX RDF |
|
14 | Tijana V. Nosek, Sinisa Suzic, Vlado Delic, Milan Secujski |
Cross-lingual Text-to-Speech with Prosody Embedding. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IWSSIP ![In: 30th International Conference on Systems, Signals and Image Processing, IWSSIP 2023, Ohrid, North Macedonia, June 27-29, 2023, pp. 1-5, 2023, IEEE, 979-8-3503-3729-7. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
14 | Yimin Deng, Huaizhen Tang, Xulong Zhang 0001, Jianzong Wang, Ning Cheng 0001, Jing Xiao 0006 |
PMVC: Data Augmentation-Based Prosody Modeling for Expressive Voice Conversion. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ACM Multimedia ![In: Proceedings of the 31st ACM International Conference on Multimedia, MM 2023, Ottawa, ON, Canada, 29 October 2023- 3 November 2023, pp. 184-192, 2023, ACM. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
14 | Hui Lu, Xixin Wu, Zhiyong Wu 0001, Helen Meng |
SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ACM Multimedia ![In: Proceedings of the 31st ACM International Conference on Multimedia, MM 2023, Ottawa, ON, Canada, 29 October 2023- 3 November 2023, pp. 2829-2837, 2023, ACM. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
14 | Ming Li, Jiarui Li, Dan Zhang, Yukie Nagai |
Prosody-Based Vocal Emotional Alignment in Infant-Caregiver Interaction. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICDL ![In: IEEE International Conference on Development and Learning, ICDL 2023, Macau, China, November 9-11, 2023, pp. 361-366, 2023, IEEE, 978-1-6654-7075-9. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
14 | Caluã de Lacerda Pataca, Matthew Watkins, Roshan L. Peiris, Sooyeon Lee, Matt Huenerfauth |
Visualization of Speech Prosody and Emotion in Captions: Accessibility for Deaf and Hard-of-Hearing Users. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CHI ![In: Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, CHI 2023, Hamburg, Germany, April 23-28, 2023, pp. 831:1-831:15, 2023, ACM, 978-1-4503-9421-5. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
14 | Gaoxiang Cong, Liang Li 0003, Yuankai Qi, Zheng-Jun Zha, Qi Wu 0001, Wenyu Wang, Bin Jiang 0011, Ming-Hsuan Yang 0001, Qingming Huang |
Learning to Dub Movies via Hierarchical Prosody Models. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CVPR ![In: IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023, Vancouver, BC, Canada, June 17-24, 2023, pp. 14687-14697, 2023, IEEE, 979-8-3503-0129-8. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
14 | Giridhar Pamisetty, Gundluru Ramesh, K. Sri Rama Murty |
A non-linear source-filter based vocoder with prosody control. ![Search on Bibsonomy](Pics/bibsonomy.png) |
NCC ![In: 28th National Conference on Communications, NCC 2023, Guwahati, India, February 23-26, 2023, pp. 1-6, 2023, IEEE, 978-1-6654-5625-8. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
14 | M. Rama Rajeswari, D. Govind 0001, Suryakanth V. Gangashetty, Akhilesh Kumar Dubey |
Improved Epoch Based Prosody Modification by Zero Frequency Filtering of Gabor Filtered Telephonic Speech. ![Search on Bibsonomy](Pics/bibsonomy.png) |
NCC ![In: 28th National Conference on Communications, NCC 2023, Guwahati, India, February 23-26, 2023, pp. 1-5, 2023, IEEE, 978-1-6654-5625-8. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
14 | Lukas Wolf, Tiago Pimentel, Evelina Fedorenko, Ryan Cotterell, Alex Warstadt, Ethan Wilcox, Tamar Regev |
Quantifying the redundancy between prosody and text. ![Search on Bibsonomy](Pics/bibsonomy.png) |
EMNLP ![In: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, EMNLP 2023, Singapore, December 6-10, 2023, pp. 9765-9784, 2023, Association for Computational Linguistics, 979-8-89176-060-8. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
14 | Ji-Sang Hwang, Sang-Hoon Lee, Seong-Whan Lee |
PauseSpeech: Natural Speech Synthesis via Pre-trained Language Model and Pause-Based Prosody Modeling. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ACPR (1) ![In: Pattern Recognition - 7th Asian Conference, ACPR 2023, Kitakyushu, Japan, November 5-8, 2023, Proceedings, Part I, pp. 415-427, 2023, Springer, 978-3-031-47633-4. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
14 | Junlin Li, Chu-Ren Huang |
Investigating Acoustic Cues of Emotional Valence in Mandarin Speech Prosody - A Corpus Approach. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CLSW (2) ![In: Chinese Lexical Semantics - 24th Workshop, CLSW 2023, Singapore, Singapore, May 19-21, 2023, Revised Selected Papers, Part II, pp. 316-330, 2023, Springer, 978-981-97-0585-6. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
14 | Zhenhui Ye, Rongjie Huang, Yi Ren 0006, Ziyue Jiang 0001, Jinglin Liu, Jinzheng He, Xiang Yin, Zhou Zhao |
CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-Training. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ACL (1) ![In: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2023, Toronto, Canada, July 9-14, 2023., pp. 9317-9331, 2023, Association for Computational Linguistics, 978-1-959429-72-2. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
14 | Rose Sloan |
Using Linguistic Features to Improve Prosody for Text-to-Speech ![Search on Bibsonomy](Pics/bibsonomy.png) |
|
2023 |
DOI RDF |
|
14 | Yao Wang, Yongtao Xie |
Carlos Gussenhoven and Aoju Chen (eds.): The Oxford Handbook of Language Prosody. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Phonetica ![In: Phonetica 79(5), pp. 513-521, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Mahmood Yenkimaleki, Vincent J. van Heuven |
Comparing the nativeness vs. intelligibility approach in prosody instruction for developing speaking skills by interpreter trainees: An experimental study. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Speech Commun. ![In: Speech Commun. 137, pp. 92-102, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Katharina H. Menn, Christine Michel, Lars Meyer 0001, Stefanie Hoehl, Claudia Männel |
Natural infant-directed speech facilitates neural tracking of prosody. ![Search on Bibsonomy](Pics/bibsonomy.png) |
NeuroImage ![In: NeuroImage 251, pp. 118991, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Jason A. Shaw |
Micro-prosody. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Lang. Linguistics Compass ![In: Lang. Linguistics Compass 16(2), 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Pavan Raju Kammili, B. H. V. S. Ramakrishnam Raju, A. Sri Krishna |
Handling emotional speech: a prosody based data augmentation technique for improving neutral speech trained ASR systems. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Int. J. Speech Technol. ![In: Int. J. Speech Technol. 25(1), pp. 197-204, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Christian DiCanio, Wei-rong Chen, Joshua Benn, Jonathan D. Amith, Rey Castillo García |
Extreme stop allophony in Mixtec spontaneous speech: Data, word prosody, and modelling. ![Search on Bibsonomy](Pics/bibsonomy.png) |
J. Phonetics ![In: J. Phonetics 92, pp. 101147, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Fu-Sheng Tsai, Wei-Wen Chang, Chi-Chun Lee |
A Social Condition-Enhanced Network for Recognizing Power Distance Using Expressive Prosody and Intrinsic Brain Connectivity. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Trans. Multim. ![In: IEEE Trans. Multim. 24, pp. 2046-2057, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Sri Karlapati, Penny Karanasou, Mateusz Lajszczak, Ammar Abbas, Alexis Moinet, Peter Makarov, Ray Li, Arent van Korlaar, Simon Slangen, Thomas Drugman |
CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2206.13443, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Hartmut Meister, Isa Samira Winter, Moritz Waeachtler, Pascale Sandmann, Khaled Abdellatif |
A virtual reality-based method for examining audiovisual prosody perception. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2209.05745, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Yiwei Guo, Chenpeng Du, Kai Yu 0004 |
Unsupervised word-level prosody tagging for controllable speech synthesis. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2202.07200, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP BibTeX RDF |
|
14 | Harm Lameris, Shivam Mehta, Gustav Eje Henter, Joakim Gustafson, Éva Székely |
Prosody-controllable spontaneous TTS with neural HMMs. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2211.13533, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Yutian Wang, Yuankun Xie, Kun Zhao, Hui Wang 0070, Qin Zhang |
Unsupervised Quantized Prosody Representation for Controllable Speech Synthesis. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2204.03238, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Peter Makarov, Ammar Abbas, Mateusz Lajszczak, Arnaud Joly, Sri Karlapati, Alexis Moinet, Thomas Drugman, Penny Karanasou |
Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2206.14643, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Gaoxiang Cong, Liang Li 0003, Yuankai Qi, Zhengjun Zha, Qi Wu 0001, Wenyu Wang, Bin Jiang 0011, Ming-Hsuan Yang 0001, Qingming Huang |
Learning to Dub Movies via Hierarchical Prosody Models. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2212.04054, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Wendong Gan, Bolong Wen, Ying Yan, Haitao Chen, Zhichao Wang, Hongqiang Du, Lei Xie 0001, Kaixuan Guo, Hai Li |
IQDUBBING: Prosody modeling based on discrete self-supervised speech representation for expressive voice conversion. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2201.00269, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP BibTeX RDF |
|
14 | Xintao Zhao, Feng Liu, Changhe Song, Zhiyong Wu 0001, Shiyin Kang, Deyi Tuo, Helen Meng |
Disentangleing Content and Fine-grained Prosody Information via Hybrid ASR Bottleneck Features for Voice Conversion. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2203.12813, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Luigi Attorresi, Davide Salvi, Clara Borrelli, Paolo Bestagini, Stefano Tubaro |
Combining Automatic Speaker Verification and Prosody Analysis for Synthetic Speech Detection. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2210.17222, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Guan-Ting Lin, Chi-Luen Feng, Wei-Ping Huang, Yuan Tseng, Tzu-Han Lin, Chen-An Li, Hung-yi Lee, Nigel G. Ward |
On the Utility of Self-supervised Models for Prosody-related Tasks. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2210.07185, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Xin Yuan, Robin Feng, Mingming Ye |
Low-Resource Mongolian Speech Synthesis Based on Automatic Prosody Annotation. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2211.09365, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Konstantinos Klapsas, Karolos Nikitaras, Nikolaos Ellinas, June Sig Sung, Inchul Hwang, Spyros Raptis, Aimilios Chalamandaris, Pirros Tsiakoulis |
Predicting phoneme-level prosody latents using AR and flow-based Prior Networks for expressive speech synthesis. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2211.01327, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Jihwan Lee, Joun Yeop Lee, Heejin Choi, Seongkyu Mun, Sangjun Park, Chanwoo Kim 0001 |
Into-TTS : Intonation Template based Prosody Control System. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2204.01271, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Leyuan Qu, Taihao Li, Cornelius Weber, Theresa Pekarek-Rosin, Fuji Ren, Stefan Wermter |
Disentangling Prosody Representations with Unsupervised Speech Reconstruction. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2212.06972, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Tao Li, Xinsheng Wang, Qicong Xie, Zhichao Wang, Mingqi Jiang, Lei Xie 0001 |
Cross-speaker Emotion Transfer Based On Prosody Compensation for End-to-End Speech Synthesis. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2207.01198, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Guangyan Zhang, Ying Qin, Wenjie Zhang, Jialun Wu, Mei Li, Yutao Gai, Feijun Jiang, Tan Lee |
iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2206.14866, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Li-Wei Chen, Shinji Watanabe 0001, Alexander Rudnicky |
A unified one-shot prosody and speaker conversion system with self-supervised discrete speech units. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2211.06535, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Florian Lux, Julia Koch, Ngoc Thang Vu |
Prosody Cloning in Zero-Shot Multispeaker Text-to-Speech. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2206.12229, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Kai Wei, Dillon Knox, Martin Radfar, Thanh Tran, Markus Müller, Grant P. Strimel, Nathan Susanj, Athanasios Mouchtaris, Maurizio Omologo |
A neural prosody encoder for end-ro-end dialogue act classification. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2205.05590, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Kei Furukawa, Takeshi Kishiyama, Satoshi Nakamura 0001 |
Applying Syntax-Prosody Mapping Hypothesis and Prosodic Well-Formedness Constraints to Neural Sequence-to-Sequence Speech Synthesis. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2203.15276, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Detai Xin, Sharath Adavanne, Federico Ang, Ashish Kulkarni, Shinnosuke Takamichi, Hiroshi Saruwatari |
Improving Speech Prosody of Audiobook Text-to-Speech Synthesis with Acoustic and Textual Contexts. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2211.02336, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Yi Ren 0006, Ming Lei, Zhiying Huang, Shiliang Zhang, Qian Chen 0003, Zhijie Yan, Zhou Zhao |
ProsoSpeech: Enhancing Prosody With Quantized Vector Pre-training in Text-to-Speech. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2202.07816, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP BibTeX RDF |
|
14 | Ziqian Dai, Jianwei Yu, Yan Wang, Nuo Chen, Yanyao Bian, Guangzhi Li, Deng Cai 0002, Dong Yu 0001 |
Automatic Prosody Annotation with Pre-Trained Text-Speech Model. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2206.07956, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Susmitha Vekkot, Deepa Gupta |
Fusion of spectral and prosody modelling for multilingual speech emotion conversion. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Knowl. Based Syst. ![In: Knowl. Based Syst. 242, pp. 108360, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Irshad Ahmad Thukroo, Rumaan Bashir, Kaiser J. Giri |
Spoken Language Identification Using Prosody, Phonotactics, and Acoustics: A Review. ![Search on Bibsonomy](Pics/bibsonomy.png) |
J. Inf. Knowl. Manag. ![In: J. Inf. Knowl. Manag. 21(4), pp. 2250057:1-2250057:45, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Chenpeng Du, Kai Yu 0004 |
Phone-Level Prosody Modelling With GMM-Based MDN for Diverse and Controllable Speech Synthesis. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE ACM Trans. Audio Speech Lang. Process. ![In: IEEE ACM Trans. Audio Speech Lang. Process. 30, pp. 190-201, 2022. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Yuhan Yan, Shanpeng Li, Ying Chen 0015 |
In-group Advantage for Chinese and English Emotional Prosody in Quiet and Noise Conditions. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ISCSLP ![In: 13th International Symposium on Chinese Spoken Language Processing, ISCSLP 2022, Singapore, December 11-14, 2022, pp. 305-309, 2022, IEEE, 979-8-3503-9796-3. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Min-Kyung Kim 0002, Joon-Hyuk Chang |
Adversarial and Sequential Training for Cross-lingual Prosody Transfer TTS. ![Search on Bibsonomy](Pics/bibsonomy.png) |
INTERSPEECH ![In: Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022., pp. 4556-4560, 2022, ISCA. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Tao Li, Xinsheng Wang, Qicong Xie, Zhichao Wang, Mingqi Jiang, Lei Xie 0001 |
Cross-speaker Emotion Transfer Based On Prosody Compensation for End-to-End Speech Synthesis. ![Search on Bibsonomy](Pics/bibsonomy.png) |
INTERSPEECH ![In: Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022., pp. 5498-5502, 2022, ISCA. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Kei Furukawa, Takeshi Kishiyama, Satoshi Nakamura 0001 |
Applying Syntax-Prosody Mapping Hypothesis and Prosodic Well-Formedness Constraints to Neural Sequence-to-Sequence Speech Synthesis. ![Search on Bibsonomy](Pics/bibsonomy.png) |
INTERSPEECH ![In: Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022., pp. 5258-5262, 2022, ISCA. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Keiko Ochi, Nobutaka Ono, Keiho Owada, Miho Kuroda, Shigeki Sagayama, Hidenori Yamasue |
Use of Nods Less Synchronized with Turn-Taking and Prosody During Conversations in Adults with Autism. ![Search on Bibsonomy](Pics/bibsonomy.png) |
INTERSPEECH ![In: Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022., pp. 1136-1140, 2022, ISCA. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Helen Gent, Chase Adams, Yan Tang, Chilin Shih |
Deep Learning for Prosody-Based Irony Classification in Spontaneous Speech. ![Search on Bibsonomy](Pics/bibsonomy.png) |
INTERSPEECH ![In: Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022., pp. 3993-3997, 2022, ISCA. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Julian Zaïdi, Hugo Seuté, Benjamin van Niekerk, Marc-André Carbonneau |
Daft-Exprt: Cross-Speaker Prosody Transfer on Any Text for Expressive Speech Synthesis. ![Search on Bibsonomy](Pics/bibsonomy.png) |
INTERSPEECH ![In: Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022., pp. 4591-4595, 2022, ISCA. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Raymond Chung, Brian Mak |
Synthesizing Near Native-accented Speech for a Non-native Speaker by Imitating the Pronunciation and Prosody of a Native Speaker. ![Search on Bibsonomy](Pics/bibsonomy.png) |
INTERSPEECH ![In: Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022., pp. 4302-4306, 2022, ISCA. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Johannah O'Mahony, Catherine Lai, Simon King 0001 |
Combining conversational speech with read speech to improve prosody in Text-to-Speech synthesis. ![Search on Bibsonomy](Pics/bibsonomy.png) |
INTERSPEECH ![In: Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022., pp. 3388-3392, 2022, ISCA. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Yu Suzuki, Tsuneo Kato, Akihiro Tamura |
Automatic Prosody Evaluation of L2 English Read Speech in Reference to Accent Dictionary with Transformer Encoder. ![Search on Bibsonomy](Pics/bibsonomy.png) |
INTERSPEECH ![In: Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022., pp. 4466-4470, 2022, ISCA. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Yukun Peng, Zhenhua Ling |
Decoupled Pronunciation and Prosody Modeling in Meta-Learning-based Multilingual Speech Synthesis. ![Search on Bibsonomy](Pics/bibsonomy.png) |
INTERSPEECH ![In: Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022., pp. 4257-4261, 2022, ISCA. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Yeonjin Cho, Sara Ng, Trang Tran 0001, Mari Ostendorf |
Leveraging Prosody for Punctuation Prediction of Spontaneous Speech. ![Search on Bibsonomy](Pics/bibsonomy.png) |
INTERSPEECH ![In: Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022., pp. 555-559, 2022, ISCA. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Marzena Zygis, Sarah Wesolek, Nina Hosseini-Kivanani, Manfred Krifka |
The Prosody of Cheering in Sport Events. ![Search on Bibsonomy](Pics/bibsonomy.png) |
INTERSPEECH ![In: Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022., pp. 5283-5287, 2022, ISCA. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Sri Karlapati, Penny Karanasou, Mateusz Lajszczak, Syed Ammar Abbas, Alexis Moinet, Peter Makarov, Ray Li, Arent van Korlaar, Simon Slangen, Thomas Drugman |
CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer. ![Search on Bibsonomy](Pics/bibsonomy.png) |
INTERSPEECH ![In: Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022., pp. 3363-3367, 2022, ISCA. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Peter Makarov, Syed Ammar Abbas, Mateusz Lajszczak, Arnaud Joly, Sri Karlapati, Alexis Moinet, Thomas Drugman, Penny Karanasou |
Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody. ![Search on Bibsonomy](Pics/bibsonomy.png) |
INTERSPEECH ![In: Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022., pp. 3368-3372, 2022, ISCA. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Haoquan Yang, Liqun Deng, Yu Ting Yeung, Nianzu Zheng, Yong Xu |
Streamable Speech Representation Disentanglement and Multi-Level Prosody Modeling for Live One-Shot Voice Conversion. ![Search on Bibsonomy](Pics/bibsonomy.png) |
INTERSPEECH ![In: Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022., pp. 2578-2582, 2022, ISCA. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Ziqian Dai, Jianwei Yu, Yan Wang, Nuo Chen, Yanyao Bian, Guangzhi Li, Deng Cai 0002, Dong Yu 0001 |
Automatic Prosody Annotation with Pre-Trained Text-Speech Model. ![Search on Bibsonomy](Pics/bibsonomy.png) |
INTERSPEECH ![In: Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022., pp. 5513-5517, 2022, ISCA. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Antonio Galiza Cerdeira Gonzalez 0001, Wing Sum Lo, Ikuo Mizuuchi |
Talk to Kotaro: a web crowdsourcing study on the impact of phone and prosody choice for synthesized speech on human impression. ![Search on Bibsonomy](Pics/bibsonomy.png) |
RO-MAN ![In: 31st IEEE International Conference on Robot and Human Interactive Communication, RO-MAN 2022, Napoli, Italy, August 29 - Sept. 2, 2022, pp. 244-251, 2022, IEEE, 978-1-7281-8859-1. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Xintao Zhao, Feng Liu, Changhe Song, Zhiyong Wu 0001, Shiyin Kang, Deyi Tuo, Helen Meng |
Disentangling Content and Fine-Grained Prosody Information Via Hybrid ASR Bottleneck Features for Voice Conversion. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICASSP ![In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2022, Virtual and Singapore, 23-27 May 2022, pp. 7022-7026, 2022, IEEE, 978-1-6654-0541-6. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Yi Ren 0006, Ming Lei, Zhiying Huang, Shiliang Zhang, Qian Chen 0003, Zhijie Yan, Zhou Zhao |
Prosospeech: Enhancing Prosody with Quantized Vector Pre-Training in Text-To-Speech. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICASSP ![In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2022, Virtual and Singapore, 23-27 May 2022, pp. 7577-7581, 2022, IEEE, 978-1-6654-0541-6. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Cheng-I Jeff Lai, Erica Cooper, Yang Zhang 0001, Shiyu Chang, Kaizhi Qian, Yi-Lun Liao, Yung-Sung Chuang, Alexander H. Liu, Junichi Yamagishi, David D. Cox, James R. Glass |
On the Interplay between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICASSP ![In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2022, Virtual and Singapore, 23-27 May 2022, pp. 8447-8451, 2022, IEEE, 978-1-6654-0541-6. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Yiwei Guo, Chenpeng Du, Kai Yu 0004 |
Unsupervised Word-Level Prosody Tagging for Controllable Speech Synthesis. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICASSP ![In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2022, Virtual and Singapore, 23-27 May 2022, pp. 7597-7601, 2022, IEEE, 978-1-6654-0541-6. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Keiko Ochi, Nobutaka Ono, Keiho Owada, Miho Kuroda, Shigeki Sagayama, Hidenori Yamasue |
Entrainment Analysis for Assessment of Autistic Speech Prosody Using Bottleneck Features of Deep Neural Network. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICASSP ![In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2022, Virtual and Singapore, 23-27 May 2022, pp. 8492-8496, 2022, IEEE, 978-1-6654-0541-6. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Tobias Cornille, Fengna Wang, Jessa Bekker |
Interactive Multi-Level Prosody Control for Expressive Speech Synthesis. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICASSP ![In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2022, Virtual and Singapore, 23-27 May 2022, pp. 8312-8316, 2022, IEEE, 978-1-6654-0541-6. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | George Sammit, Zhongjie Wu, Yihao Wang, Zhongdi Wu, Akihito Kamata, Joseph Nese, Eric C. Larson |
Automated Prosody Classification for Oral Reading Fluency with Quadratic Kappa Loss and Attentive X-Vectors. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICASSP ![In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2022, Virtual and Singapore, 23-27 May 2022, pp. 3613-3617, 2022, IEEE, 978-1-6654-0541-6. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Tuomo Raitio, Jiangchuan Li, Shreyas Seshadri |
Hierarchical Prosody Modeling and Control in Non-Autoregressive Parallel Neural TTS. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICASSP ![In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2022, Virtual and Singapore, 23-27 May 2022, pp. 7587-7591, 2022, IEEE, 978-1-6654-0541-6. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Yuanhao Yi, Lei He 0005, Shifeng Pan, Xi Wang 0016, Yujia Xiao |
Prosodyspeech: Towards Advanced Prosody Model for Neural Text-to-Speech. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICASSP ![In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2022, Virtual and Singapore, 23-27 May 2022, pp. 7582-7586, 2022, IEEE, 978-1-6654-0541-6. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Ning-Qian Wu, Zhaoci Liu, Zhen-Hua Ling |
Discourse-Level Prosody Modeling with a Variational Autoencoder for Non-Autoregressive Expressive Speech Synthesis. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICASSP ![In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2022, Virtual and Singapore, 23-27 May 2022, pp. 7592-7596, 2022, IEEE, 978-1-6654-0541-6. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Kai Wei, Dillon Knox, Martin Radfar, Thanh Tran, Markus Müller, Grant P. Strimel, Nathan Susanj, Athanasios Mouchtaris, Maurizio Omologo |
A Neural Prosody Encoder for End-to-End Dialogue Act Classification. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICASSP ![In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2022, Virtual and Singapore, 23-27 May 2022, pp. 7047-7051, 2022, IEEE, 978-1-6654-0541-6. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Luigi Attorresi, Davide Salvi, Clara Borrelli, Paolo Bestagini, Stefano Tubaro |
Combining Automatic Speaker Verification and Prosody Analysis for Synthetic Speech Detection. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICPR Workshops (2) ![In: Pattern Recognition, Computer Vision, and Image Processing. ICPR 2022 International Workshops and Challenges - Montreal, QC, Canada, August 21-25, 2022, Proceedings, Part II, pp. 247-263, 2022, Springer, 978-3-031-37741-9. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Igor Bascandziev, Michael LaSorsa, Patrick Shafto, Elizabeth Bonawitz |
Can children recognize pedagogical intent in the prosody of speech? ![Search on Bibsonomy](Pics/bibsonomy.png) |
CogSci ![In: Proceedings of the 44th Annual Meeting of the Cognitive Science Society, CogSci 2022, Toronto, ON, Canada, July 27-30, 2022, 2022, cognitivesciencesociety.org. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP BibTeX RDF |
|
14 | Guan-Ting Lin, Chi-Luen Feng, Wei-Ping Huang, Yuan Tseng, Tzu-Han Lin, Chen-An Li, Hung-yi Lee, Nigel G. Ward |
On the Utility of Self-Supervised Models for Prosody-Related Tasks. ![Search on Bibsonomy](Pics/bibsonomy.png) |
SLT ![In: IEEE Spoken Language Technology Workshop, SLT 2022, Doha, Qatar, January 9-12, 2023, pp. 1104-1111, 2022, IEEE, 979-8-3503-9690-4. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Florian Lux, Julia Koch, Ngoc Thang Vu |
Exact Prosody Cloning in Zero-Shot Multispeaker Text-to-Speech. ![Search on Bibsonomy](Pics/bibsonomy.png) |
SLT ![In: IEEE Spoken Language Technology Workshop, SLT 2022, Doha, Qatar, January 9-12, 2023, pp. 962-969, 2022, IEEE, 979-8-3503-9690-4. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Xinlei Zhang, Zixiong Su, Jun Rekimoto |
Aware: Intuitive Device Activation Using Prosody for Natural Voice Interactions. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CHI ![In: CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022 - 5 May 2022, pp. 432:1-432:16, 2022, ACM, 978-1-4503-9157-3. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Erik Ekstedt, Gabriel Skantze |
How Much Does Prosody Help Turn-taking? Investigations using Voice Activity Projection Models. ![Search on Bibsonomy](Pics/bibsonomy.png) |
SIGDIAL ![In: Proceedings of the 23rd Annual Meeting of the Special Interest Group on Discourse and Dialogue, SIGDIAL 2022, Edinburgh, UK, 07-09 September 2022., pp. 541-551, 2022, Association for Computational Linguistics. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Yutian Wang, Yuankun Xie, Kun Zhao, Hui Wang 0070, Qin Zhang |
Unsupervised Quantized Prosody Representation for Controllable Speech Synthesis. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICME ![In: IEEE International Conference on Multimedia and Expo, ICME 2022, Taipei, Taiwan, July 18-22, 2022, pp. 1-6, 2022, IEEE, 978-1-6654-8563-0. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Michael Hassid, Michelle Tadmor Ramanovich, Brendan Shillingford, Miaosen Wang, Ye Jia, Tal Remez |
More than Words: In-the-Wild Visually-Driven Prosody for Text-to-Speech. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CVPR ![In: IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, New Orleans, LA, USA, June 18-24, 2022, pp. 10577-10587, 2022, IEEE, 978-1-6654-6946-3. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Chatri Chuangulueam, Boonserm Kijsirikul, Nuttakorn Thubthong |
Voice Impersonation for Thai Speech Using CycleGAN over Prosody. ![Search on Bibsonomy](Pics/bibsonomy.png) |
MSIE ![In: MSIE 2022: 4th International Conference on Management Science and Industrial Engineering, Chiang Mai Thailand, April 28 - 30, 2022, pp. 443-447, 2022, ACM, 978-1-4503-9581-6. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Ayush Agarwal, Amitabh Swain, Jagabandhu Mishra, S. R. Mahadeva Prasanna |
Significance of Prosody Modification in Privacy Preservation on speaker verification. ![Search on Bibsonomy](Pics/bibsonomy.png) |
NCC ![In: 27th National Conference on Communications, NCC 2022, Mumbai, India, May 24-27, 2022, pp. 245-249, 2022, IEEE, 978-1-6654-5136-9. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Shijun Wang, Damian Borth |
Zero-shot Voice Conversion via Self-supervised Prosody Representation Learning. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IJCNN ![In: International Joint Conference on Neural Networks, IJCNN 2022, Padua, Italy, July 18-23, 2022, pp. 1-8, 2022, IEEE, 978-1-7281-8671-9. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Anna Leonteva, Tatiana Sokoreva |
Nonverbal Constituents of Argumentative Discourse: Gesture and Prosody Interaction. ![Search on Bibsonomy](Pics/bibsonomy.png) |
SPECOM ![In: Speech and Computer - 24th International Conference, SPECOM 2022, Gurugram, India, November 14-16, 2022, Proceedings, pp. 416-425, 2022, Springer, 978-3-031-20979-6. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Xiaolong Lu |
Semantic Prosody: The Study of Gei in BA and BEI Constructions. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CLSW (2) ![In: Chinese Lexical Semantics - 23rd Workshop, CLSW 2022, Virtual Event, May 14-15, 2022, Revised Selected Papers, Part II, pp. 3-15, 2022, Springer, 978-3-031-28955-2. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Lichao Zhang, Zhou Zhao, Yi Ren 0006, Liqun Deng |
EditSinger: Zero-Shot Text-Based Singing Voice Editing System with Diverse Prosody Modeling. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IJCAI ![In: Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, IJCAI 2022, Vienna, Austria, 23-29 July 2022., pp. 4503-4509, 2022, ijcai.org. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Ishita Nag, Salman Azeez Syed, Shreya Basu, Suvra Shaw, Barnali Gupta Banik |
Telegram Bot for Emotion Recognition Using Acoustic Cues and Prosody. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CICBA ![In: Computational Intelligence in Communications and Business Analytics - 4th International Conference, CICBA 2022, Silchar, India, January 7-8, 2022, Revised Selected Papers, pp. 389-402, 2022, Springer, 978-3-031-10765-8. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Evan Krisdityawan, Sho Yokota, Akihiro Matsumoto, Daisuke Chugo, Satoshi Muramatsu, Hiroshi Hashimoto |
Effect of Embodiment and Improving Japanese Students' English Pronunciation and Prosody with Humanoid Robot. ![Search on Bibsonomy](Pics/bibsonomy.png) |
HSI ![In: 15th International Conference on Human System Interaction, HSI 2022, Melbourne, Australia, July 28-31, 2022, pp. 1-6, 2022, IEEE, 978-1-6654-6822-0. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Eugene Kharitonov, Ann Lee 0001, Adam Polyak, Yossi Adi, Jade Copet, Kushal Lakhotia, Tu Anh Nguyen, Morgane Rivière, Abdelrahman Mohamed, Emmanuel Dupoux, Wei-Ning Hsu |
Text-Free Prosody-Aware Generative Spoken Language Modeling. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ACL (1) ![In: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2022, Dublin, Ireland, May 22-27, 2022, pp. 8666-8681, 2022, Association for Computational Linguistics, 978-1-955917-21-6. The full citation details ...](Pics/full.jpeg) |
2022 |
DBLP DOI BibTeX RDF |
|
14 | Vivek Bhardwaj, Vinay Kukreja, Amitoj Singh |
Usage of Prosody Modification and Acoustic Adaptation for Robust Automatic Speech Recognition (ASR) System. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Rev. d'Intelligence Artif. ![In: Rev. d'Intelligence Artif. 35(3), pp. 235-242, 2021. The full citation details ...](Pics/full.jpeg) |
2021 |
DBLP DOI BibTeX RDF |
|
Displaying result #201 - #300 of 1467 (100 per page; Change: ) Pages: [ <<][ 1][ 2][ 3][ 4][ 5][ 6][ 7][ 8][ 9][ 10][ 11][ 12][ >>] |
|