The FacetedDBLP logo    Search for: in:

Disable automatic phrases ?     Syntactic query expansion: ?

Publications at "INTERSPEECH"( http://dblp.L3S.de/Venues/INTERSPEECH )

URL (DBLP): http://dblp.uni-trier.de/db/conf/interspeech

Publication years (Num. hits)
2000 (923) 2001 (671) 2002 (679) 2003 (799) 2004 (776) 2005 (870) 2006 (660) 2007 (752) 2008 (764) 2009 (766) 2010 (782) 2011 (852) 2012 (678) 2013 (789) 2014 (641) 2015 (793) 2016 (819) 2017 (845) 2018 (792) 2019 (971) 2020 (1036) 2021 (1000) 2022 (1124)
Publication types (Num. hits)
inproceedings(18759) proceedings(23)
Venues (Conferences, Journals, ...)
Interspeech(18782)
GrowBag graphs for keyword ? (Num. hits/coverage)

Group by:
No Growbag Graphs found.

Results
Found 18782 publication records. Showing 18782 according to the selection in the facets
Hits ? Authors Title Venue Year Link Author keywords
1Salah Zaiem, Titouan Parcollet, Slim Essid Automatic Data Augmentation Selection and Parametrization in Contrastive Self-Supervised Speech Representation Learning. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Hanbin Bae, Young-Sun Joo Enhancement of Pitch Controllability using Timbre-Preserving Pitch Augmentation in FastPitch. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Sreyan Ghosh, Sonal Kumar, Yaman Kumar 0001, Rajiv Ratn Shah, Srinivasan Umesh Span Classification with Structured Information for Disfluency Detection in Spoken Utterances. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Minho Jin, Chelsea Ju, Zeya Chen, Yi-Chieh Liu, Jasha Droppo, Andreas Stolcke Adversarial Reweighting for Speaker Verification Fairness. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Michel Cardoso Meneses, Rafael Bérgamo Holanda, Luis Vasconcelos Peres, Gabriela Dantas Rocha SiDi KWS: A Large-Scale Multilingual Dataset for Keyword Spotting. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Haodong Zhao, Wei Du, Junjie Guo, Gongshen Liu A Universal Identity Backdoor Attack against Speaker Verification based on Siamese Network. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Nan Li, Meng Ge, Longbiao Wang, Masashi Unoki, Sheng Li 0010, Jianwu Dang 0001 Global Signal-to-noise Ratio Estimation Based on Multi-subband Processing Using Convolutional Neural Network. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Vishal Sunder, Eric Fosler-Lussier, Samuel Thomas 0001, Hong-Kwang Kuo, Brian Kingsbury Tokenwise Contrastive Pretraining for Finer Speech-to-BERT Alignment in End-to-End Speech-to-Intent Systems. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Kay Peterson, Audrey Tong, Yan Yu OpenASR21: The Second Open Challenge for Automatic Speech Recognition of Low-Resource Languages. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Danilo de Oliveira, Tal Peer, Timo Gerkmann Efficient Transformer-based Speech Enhancement Using Long Frames and STFT Magnitudes. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Ngoc-Quan Pham, Alexander Waibel, Jan Niehues Adaptive multilingual speech recognition with pretrained models. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Chao Zhang, Bo Li 0028, Tara N. Sainath, Trevor Strohman, Sepand Mavandadi, Shuo-Yiin Chang, Parisa Haghani Streaming End-to-End Multilingual Speech Recognition with Joint Language Identification. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Franziska Braun, Markus Förstel, Bastian Oppermann, Andreas Erzigkeit, Hartmut Lehfeld, Thomas Hillemacher, Korbinian Riedhammer Automated Evaluation of Standardized Dementia Screening Tests. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Yiwen Shao, Jesús Villalba 0001, Sonal Joshi, Saurabh Kataria, Sanjeev Khudanpur, Najim Dehak Chunking Defense for Adversarial Attacks on ASR. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Eunkyung Yoo, Hyeonseop Song, Taehyeong Kim, Chul Lee Online Learning of Open-set Speaker Identification by Active User-registration. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Tao Liu, Shuai Fan 0005, Xu Xiang, Hongbo Song, Shaoxiong Lin, Jiaqi Sun, Tianyuan Han, Siyuan Chen, Binwei Yao, Sen Liu, Yifei Wu, Yanmin Qian, Kai Yu 0004 MSDWild: Multi-modal Speaker Diarization Dataset in the Wild. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Shuai Zhang 0014, Jiangyan Yi, Zhengkun Tian, Jianhua Tao 0001, Yu Ting Yeung, Liqun Deng reducing multilingual context confusion for end-to-end code-switching automatic speech recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Nik Vaessen, David A. van Leeuwen Training speaker recognition systems with limited data. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Hao Tan, Junjian Zhang, Huan Zhang, Le Wang 0008, Yaguan Qian, Zhaoquan Gu NRI-FGSM: An Efficient Transferable Adversarial Attack for Speaker Recognition Systems. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Yutaro Sanada, Takumi Nakagawa, Yuichiro Wada, Kosaku Takanashi, Yuhui Zhang, Kiichi Tokuyama, Takafumi Kanamori, Tomonori Yamada Deep Self-Supervised Learning of Speech Denoising from Noisy Speeches. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Ryo Terashima, Ryuichi Yamamoto, Eunwoo Song, Yuma Shirahata, Hyun-Wook Yoon, Jae-Min Kim, Kentaro Tachibana Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Ruida Li, Shuo Fang, Chenguang Ma, Liang Li Adaptive Rectangle Loss for Speaker Verification. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Zhan Zhang, Yuehai Wang, Jianyi Yang End-to-end Mispronunciation Detection with Simulated Error Distance. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Ye Wang, Baishun Ling, Yanmeng Wang, Junhao Xue, Shaojun Wang, Jing Xiao 0006 Adversarial Knowledge Distillation For Robust Spoken Language Understanding. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Ryo Masumura, Yoshihiro Yamazaki, Saki Mizuno, Naoki Makishima, Mana Ihori, Mihiro Uchida, Hiroshi Sato, Tomohiro Tanaka, Akihiko Takashima, Satoshi Suzuki, Shota Orihashi, Takafumi Moriya, Nobukatsu Hojo, Atsushi Ando End-to-End Joint Modeling of Conversation History-Dependent and Independent ASR Systems with Multi-History Training. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Long Chen, Yixiong Meng, Venkatesh Ravichandran, Andreas Stolcke Graph-based Multi-View Fusion and Local Adaptation: Mitigating Within-Household Confusability for Speaker Identification. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Sarthak Yadav, Neil Zeghidour Learning neural audio features without supervision. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Jieun Song, Hae-Sung Jeon, Jieun Kiaer Use of prosodic and lexical cues for disambiguating wh-words in Korean. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Zeyang Song, Qi Liu, Qu Yang, Haizhou Li 0001 Knowledge distillation for In-memory keyword spotting model. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Xian Li, Xiaofei Li ATST: Audio Representation Learning with Teacher-Student Transformer. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Takafumi Moriya, Hiroshi Sato, Tsubasa Ochiai, Marc Delcroix, Takahiro Shinozaki Streaming Target-Speaker ASR with Neural Transducer. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Aku Rouhe, Anja Virkkunen, Juho Leinonen 0002, Mikko Kurimo Low Resource Comparison of Attention-based and Hybrid ASR Exploiting wav2vec 2.0. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Guangzhi Sun, Chao Zhang 0031, Philip C. Woodland Tree-constrained Pointer Generator with Graph Neural Network Encodings for Contextual Speech Recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Yanyan Yue, Jun Du, Mao-Kui He, Yu Ting Yeung, Renyu Wang Online Speaker Diarization with Core Samples Selection. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Mingqiong Luo Mandarin nasal place assimilation revisited: an acoustic study. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Benjamin O'Brien, Christine Meunier, Alain Ghio Evaluating the effects of modified speech on perceptual speaker identification performance. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Yeonjin Cho, Sara Ng, Trang Tran 0001, Mari Ostendorf Leveraging Prosody for Punctuation Prediction of Spontaneous Speech. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Neha Reddy, Yoonjeong Lee, Zhaoyan Zhang, Dinesh K. Chhetri Optimal thyroplasty implant shape and stiffness for treatment of acute unilateral vocal fold paralysis: Evidence from a canine in vivo phonation model. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Jenthe Thienpondt, Kris Demuynck Transfer Learning for Robust Low-Resource Children's Speech ASR with Transformers and Source-Filter Warping. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1A. Arunkumar, Srinivasan Umesh Joint Encoder-Decoder Self-Supervised Pre-training for ASR. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Puneet Mathur, Franck Dernoncourt, Quan Hung Tran, Jiuxiang Gu, Ani Nenkova, Vlad I. Morariu, Rajiv Jain, Dinesh Manocha DocLayoutTTS: Dataset and Baselines for Layout-informed Document-level Neural Speech Synthesis. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Qiongqiong Wang, Kong Aik Lee, Tianchi Liu 0004 Scoring of Large-Margin Embeddings for Speaker Verification: Cosine or PLDA? Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Koharu Horii, Meiko Fukuda, Kengo Ohta, Ryota Nishimura, Atsunori Ogawa, Norihide Kitaoka End-to-End Spontaneous Speech Recognition Using Disfluency Labeling. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Byeonggeun Kim, Seunghan Yang, Jangho Kim, Hyunsin Park, Juntae Lee, Simyung Chang Domain Generalization with Relaxed Instance Frequency-wise Normalization for Multi-device Acoustic Scene Classification. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Debarpan Bhattacharya, Debottam Dutta, Neeraj Kumar Sharma 0001, Srikanth Raj Chetupalli, Pravin Mote, Sriram Ganapathy, Chandrakiran C, Sahiti Nori, Suhail K. K, Sadhana Gonuguntla, Murali Alagesan Coswara: A website application enabling COVID-19 screening by analysing respiratory sound samples and health symptoms. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  BibTeX  RDF
1Ruizhe Cao, Sherif Abdulatif, Bin Yang 0009 CMGAN: Conformer-based Metric GAN for Speech Enhancement. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Danni Liu, Changhan Wang, Hongyu Gong, Xutai Ma, Yun Tang 0002, Juan Miguel Pino From Start to Finish: Latency Reduction Strategies for Incremental Speech Synthesis in Simultaneous Speech-to-Speech Translation. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Yizhou Wang, Rikke L. Bundgaard-Nielsen, Brett Baker, Olga Maxwell Native phonotactic interference in L2 vowel processing: Mouse-tracking reveals cognitive conflicts during identification. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Carolina Lins Machado, Volker Dellwo, Lei He 0021 Idiosyncratic lingual articulation of American English /æ/ and /ɑ/ using network analysis. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Cassia Valentini-Botinhao, Manuel Sam Ribeiro, Oliver Watts, Korin Richmond, Gustav Eje Henter Predicting pairwise preferences between TTS audio stimuli using parallel ratings data and anti-symmetric twin neural networks. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Jisi Zhang, Catalin Zorila, Rama Doddipatla, Jon Barker On monoaural speech enhancement for automatic recognition of real noisy speech using mixture invariant training. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Chiori Hori, Takaaki Hori, Jonathan Le Roux Low-Latency Online Streaming VideoQA Using Audio-Visual Transformers. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Dongchao Yang, Helin Wang, Zhongjie Ye, Yuexian Zou, Wenwu Wang 0001 RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Yogesh Virkar, Marcello Federico, Robert Enyedi, Roberto Barra-Chicote Prosodic alignment for off-screen automatic dubbing. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Sathvik Udupa, Aravind Illa, Prasanta Kumar Ghosh Streaming model for Acoustic to Articulatory Inversion with transformer networks. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Werner van der Merwe, Herman Kamper, Johan Adam du Preez A Temporal Extension of Latent Dirichlet Allocation for Unsupervised Acoustic Unit Discovery. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Hyun-Wook Yoon, Ohsung Kwon, Hoyeon Lee, Ryuichi Yamamoto, Eunwoo Song, Jae-Min Kim, Min-Jae Hwang Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Ronit Damania, Christopher Homan, Emily Prud'hommeaux Combining Simple but Novel Data Augmentation Methods for Improving Conformer ASR. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Brooke Stephenson, Laurent Besacier, Laurent Girin, Thomas Hueber BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Gordon Rennie, Olga Perepelkina, Alessandro Vinciarelli Which Model is Best: Comparing Methods and Metrics for Automatic Laughter Detection in a Naturalistic Conversational Dataset. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Hoang Thi Thu Uyen, Nguyen Anh Tu, Ta Duc Huy Vietnamese Capitalization and Punctuation Recovery Models. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Weixin Meng, Chengshi Zheng, Xiaodong Li 0002 Fully Automatic Balance between Directivity Factor and White Noise Gain for Large-scale Microphone Arrays in Diffuse Noise Fields. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Sebastian Peter Bayerl, Dominik Wagner, Elmar Nöth, Korbinian Riedhammer Detecting Dysfluencies in Stuttering Therapy Using wav2vec 2.0. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Chenyu Yang, Yu Wang Robust End-to-end Speaker Diarization with Generic Neural Clustering. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Jinchao Li, Shuai Wang, Yang Chao, Xunying Liu, Helen Meng Context-aware Multimodal Fusion for Emotion Recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Marc-Antoine Georges, Jean-Luc Schwartz, Thomas Hueber Self-supervised speech unit discovery from articulatory and acoustic features using VQ-VAE. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Jirí Martínek, Christophe Cerisara, Pavel Král, Ladislav Lenc, Josef Baloun Weak supervision for Question Type Detection with large language models. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Yusuke Shinohara, Shinji Watanabe 0001 Minimum latency training of sequence transducers for streaming end-to-end speech recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Kelvin Tran, Lingfeng Xu, Gabriela Stegmann, Julie Liss, Visar Berisha, Rene Utianski Investigating the Impact of Speech Compression on the Acoustics of Dysarthric Speech. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Christin Jose, Joe Wang, Grant P. Strimel, Mohammad Omar Khursheed, Yuriy Mishchenko, Brian Kulis Latency Control for Keyword Spotting. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Muqiao Yang, Joseph Konan, David Bick, Anurag Kumar 0003, Shinji Watanabe 0001, Bhiksha Raj Improving Speech Enhancement through Fine-Grained Speech Characteristics. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Ruibin Yuan, Yuxuan Wu, Jacob Li, Jaxter Kim DeID-VC: Speaker De-identification via Zero-shot Pseudo Voice Conversion. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Lev Finkelstein, Heiga Zen, Norman Casagrande, Chun-an Chan, Ye Jia, Tom Kenter, Alexey Petelin, Jonathan Shen, Vincent Wan, Yu Zhang 0033, Yonghui Wu, Rob Clark Training Text-To-Speech Systems From Synthetic Data: A Practical Approach For Accent Transfer Tasks. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Haaris Mehmood, Agnieszka Dobrowolska, Karthikeyan Saravanan, Mete Ozay FedNST: Federated Noisy Student Training for Automatic Speech Recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Junyong Hao, Shunzhou Ye, Cheng Lu, Fei Dong, Jingang Liu, Dong Pi Soft-label Learn for No-Intrusive Speech Quality Assessment. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Zehui Yang, Yifan Chen, Lei Luo, Runyan Yang, Lingxuan Ye, Gaofeng Cheng, Ji Xu, Yaohui Jin, Qingqing Zhang, Pengyuan Zhang, Lei Xie, Yonghong Yan 0002 Open Source MagicData-RAMC: A Rich Annotated Mandarin Conversational(RAMC) Speech Dataset. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Mostafa Sadeghi, Paul Magron A Sparsity-promoting Dictionary Model for Variational Autoencoders. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Mengnan He, Tingwei Guo, Zhenxing Lu, Ruixiong Zhang, Caixia Gong Improving GAN-based vocoder for fast and high-quality speech synthesis. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Zalan Borsos, Matthew Sharifi, Marco Tagliasacchi SpeechPainter: Text-conditioned Speech Inpainting. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Shinnosuke Takamichi, Wataru Nakata, Naoko Tanji, Hiroshi Saruwatari J-MAC: Japanese multi-speaker audiobook corpus for speech synthesis. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Jeong-Hwan Choi, Joon-Young Yang, Ye-Rin Jeoung, Joon-Hyuk Chang Improved CNN-Transformer using Broadcasted Residual Learning for Text-Independent Speaker Verification. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Guodong Ma, Pengfei Hu 0004, Nurmemet Yolwas, Shen Huang, Hao Huang PM-MMUT: Boosted Phone-mask Data Augmentation using Multi-Modeling Unit Training for Phonetic-Reduction-Robust E2E Speech Recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Yeonghyeon Lee, Kangwook Jang, Jahyun Goo, Youngmoon Jung, Hoi Rin Kim FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Models. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Ke Hu, Tara N. Sainath, Yanzhang He, Rohit Prabhavalkar, Trevor Strohman, Sepand Mavandadi, Weiran Wang Improving Deliberation by Text-Only and Semi-Supervised Training. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1L. L. Chamara Kasun, Chung Soo Ahn, Jagath C. Rajapakse, Zhiping Lin 0001, Guang-Bin Huang Discriminative Adversarial Learning for Speaker Independent Emotion Recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Takeru Gorai, Daisuke Saito, Nobuaki Minematsu Text-to-speech synthesis using spectral modeling based on non-negative autoencoder. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Shuai Guo, Jiatong Shi, Tao Qian, Shinji Watanabe 0001, Qin Jin SingAug: Data Augmentation for Singing Voice Synthesis with Cycle-consistent Training Strategy. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Bahman Mirheidari, Daniel Blackburn, Heidi Christensen Automatic cognitive assessment: Combining sparse datasets with disparate cognitive scores. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Ashutosh Pandey 0004, Buye Xu, Anurag Kumar 0003, Jacob Donley, Paul Calamia, DeLiang Wang Time-domain Ad-hoc Array Speech Enhancement Using a Triple-path Network. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Suliang Bu, Yunxin Zhao, Tuo Zhao Steering vector correction in MVDR beamformer for speech enhancement. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Sichen Zhang, Aijun Li Acquisition of Two Consecutive Neutral Tones in Mandarin-Speaking Preschoolers: Phonological Representation and Phonetic Realization. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Dan Lim, Sunghee Jung, Eesung Kim JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Yuna Lee, Seung Jun Baek Keyword Spotting with Synthetic Data using Heterogeneous Knowledge Distillation. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Minseung Kim, Hyungchan Song, Sein Cheong, Jong Won Shin iDeepMMSE: An improved deep learning approach to MMSE speech and noise power spectrum estimation for speech enhancement. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Mohammad Zeineldeen, Jingjing Xu, Christoph Lüscher, Ralf Schlüter, Hermann Ney Improving the Training Recipe for a Robust Conformer-based Hybrid Model. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Dmitriy Serdyuk, Otavio Braga, Olivier Siohan Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition for Single and Muti-Person Video. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Zvi Kons, Hagai Aronowitz, Edmilson da Silva Morais, Matheus Damasceno, Hong-Kwang Kuo, Samuel Thomas 0001, George Saon Extending RNN-T-based speech recognition systems with emotion and language classification. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Magdalena Rybicka, Jesús Villalba 0001, Najim Dehak, Konrad Kowalczyk End-to-End Neural Speaker Diarization with an Iterative Refinement of Non-Autoregressive Attention-based Attractors. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Takuma Udagawa, Masayuki Suzuki, Gakuto Kurata, Nobuyasu Itoh, George Saon Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Pranav Dheram, Murugesan Ramakrishnan, Anirudh Raju, I-Fan Chen, Brian King, Katherine Powell, Melissa Saboowala, Karan Shetty, Andreas Stolcke Toward Fairness in Speech Recognition: Discovery and mitigation of performance disparities. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
Displaying result #601 - #700 of 18782 (100 per page; Change: )
Pages: [<<][1][2][3][4][5][6][7][8][9][10][11][12][13][14][15][16][>>]
Valid XHTML 1.1! Valid CSS! [Valid RSS]
Maintained by L3S.
Previously maintained by Jörg Diederich.
Based upon DBLP by Michael Ley.
open data data released under the ODC-BY 1.0 license