The FacetedDBLP logo    Search for: in:

Disable automatic phrases ?     Syntactic query expansion: ?

Publications at "INTERSPEECH"( http://dblp.L3S.de/Venues/INTERSPEECH )

URL (DBLP): http://dblp.uni-trier.de/db/conf/interspeech

Publication years (Num. hits)
2000 (923) 2001 (671) 2002 (679) 2003 (799) 2004 (776) 2005 (870) 2006 (660) 2007 (752) 2008 (764) 2009 (766) 2010 (782) 2011 (852) 2012 (678) 2013 (789) 2014 (641) 2015 (793) 2016 (819) 2017 (845) 2018 (792) 2019 (971) 2020 (1036) 2021 (1000) 2022 (1124)
Publication types (Num. hits)
inproceedings(18759) proceedings(23)
Venues (Conferences, Journals, ...)
INTERSPEECH(18782)
GrowBag graphs for keyword ? (Num. hits/coverage)

Group by:
No Growbag Graphs found.

Results
Found 18782 publication records. Showing 18782 according to the selection in the facets
Hits ? Authors Title Venue Year Link Author keywords
1Alon Levkovitch, Eliya Nachmani, Lior Wolf Zero-Shot Voice Conditioning for Denoising Diffusion TTS Models. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Anish Bhanushali, Grant Bridgman, Deekshitha G, Prasanta Kumar Ghosh, Pratik Kumar, Saurabh Kumar, Adithya Raj Kolladath, Nithya Ravi, Aaditeshwar Seth, Ashish Seth, Abhayjeet Singh, Vrunda N. Sukhadia, Srinivasan Umesh, Sathvik Udupa, Lodagala V. S. V. Durga Prasad Gram Vaani ASR Challenge on spontaneous telephone speech recordings in regional variations of Hindi. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Talia Ben Simon, Felix Kreuk, Faten Awwad, Jacob T. Cohen, Joseph Keshet Correcting Mispronunciations in Speech using Spectrogram Inpainting. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Christian Bergler, Alexander Barnhill, Dominik Perrin, Manuel Schmitt, Andreas K. Maier, Elmar Nöth ORCA-WHISPER: An Automatic Killer Whale Sound Type Generation Toolkit Using Deep Learning. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Bowen Zhang, Songjun Cao, Xiaoming Zhang, Yike Zhang, Long Ma, Takahiro Shinozaki Censer: Curriculum Semi-supervised Learning for Speech Recognition Based on Self-supervised Pre-training. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Lars Rumberg, Christopher Gebauer, Hanna Ehlert, Maren Wallbaum, Lena Bornholt, Jörn Ostermann, Ulrike Lüdtke kidsTALC: A Corpus of 3- to 11-year-old German Children's Connected Natural Speech. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Chengfei Li, Shuhao Deng, Yaoping Wang, Guangjing Wang 0004, Yaguang Gong, Changbin Chen, Jinfeng Bai TALCS: An open-source Mandarin-English code-switching corpus and a speech recognition baseline. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Sankaran Panchapagesan, Arun Narayanan, Turaj Zakizadeh Shabestary, Shuai Shao, Nathan Howard, Alex Park 0001, James Walker, Alexander Gruenstein A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR Accuracy. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Jonathan Him Nok Lee, Dehua Tao, Harold Chui, Tan Lee, Sarah Luk, Nicolette Wing Tung Lee, Koonkan Fung Durational Patterning at Discourse Boundaries in Relation to Therapist Empathy in Psychotherapy. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Saurabh Kataria, Jesús Villalba 0001, Laureano Moro-Velázquez, Najim Dehak Joint domain adaptation and speech bandwidth extension using time-domain GANs for speaker verification. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Mudit D. Batra, M. K. Jayesh, C. S. Ramalingam Robust Pitch Estimation Using Multi-Branch CNN-LSTM and 1-Norm LP Residual. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Zhong Meng, Yashesh Gaur, Naoyuki Kanda, Jinyu Li 0001, Xie Chen 0001, Yu Wu 0012, Yifan Gong 0001 Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Zihan Wang, Christer Gobl Contribution of the glottal flow residual in affect-related voice transformation. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Ashtosh Sapru Using Data Augmentation and Consistency Regularization to Improve Semi-supervised Speech Recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Deepak Baby, Pasquale D'Alterio, Valentin Mendelev Incremental learning for RNN-Transducer based speech recognition models. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Mahir Morshed, Mark Hasegawa-Johnson Cross-lingual articulatory feature information transfer for speech recognition using recurrent progressive neural networks. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Shrutina Agarwal, Naoya Takahashi, Sriram Ganapathy Leveraging Symmetrical Convolutional Transformer Networks for Speech to Singing Voice Style Transfer. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Keqi Deng, Shinji Watanabe 0001, Jiatong Shi, Siddhant Arora Blockwise Streaming Transformer for Spoken Language Understanding and Simultaneous Speech Translation. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Chen Chen 0075, Nana Hou, Yuchen Hu, Heqing Zou, Xiaofeng Qi, Eng Siong Chng Interactive Auido-text Representation for Automated Audio Captioning with Contrastive Learning. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Amber Afshan, Abeer Alwan Learning from human perception to improve automatic speaker verification in style-mismatched conditions. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Teena tom Dieck, Paula Andrea Pérez-Toro, Tomas Arias, Elmar Nöth, Philipp Klumpp Wav2vec behind the Scenes: How end2end Models learn Phonetics. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Linjuan Cheng, Chengshi Zheng, Andong Li, Yuquan Wu, Renhua Peng, Xiaodong Li 0002 A deep complex multi-frame filtering network for stereophonic acoustic echo cancellation. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Hai-tao Xu, Jie Zhang 0042, Li-Rong Dai 0001 Differential Time-frequency Log-mel Spectrogram Features for Vision Transformer Based Infant Cry Recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Nathan Joel Young, David Britain, Adrian Leemann A blueprint for using deepfakes in sociolinguistic matched-guise experiments. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Michael Chinen, Jan Skoglund, Chandan K. A. Reddy, Alessandro Ragano, Andrew Hines Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Tuan-Nam Nguyen, Ngoc-Quan Pham, Alexander Waibel Accent Conversion using Pre-trained Model and Synthesized Data from Voice Conversion. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Yu Nakagome, Tatsuya Komatsu, Yusuke Fujita, Shuta Ichimura, Yusuke Kida InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Yijie Lou, Shiliang Pu, Jianfeng Zhou, Xin Qi, Qinbo Dong, Hongwei Zhou A Deep One-Class Learning Method for Replay Attack Detection. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Yang Zhang, Zhiqiang Lv, Haibin Wu, Shanshan Zhang, Pengfei Hu 0004, Zhiyong Wu 0001, Hung-yi Lee, Helen Meng MFA-Conformer: Multi-scale Feature Aggregation Conformer for Automatic Speaker Verification. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Zhuoya Liu, Mark A. Huckvale, Julian McGlashan Automated Voice Pathology Discrimination from Continuous Speech Benefits from Analysis by Phonetic Context. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Zexu Pan, Meng Ge, Haizhou Li 0001 A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker Extraction. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Eunwoo Song, Ryuichi Yamamoto, Ohsung Kwon, Chan-Ho Song, Min-Jae Hwang, Suhyeon Oh, Hyun-Wook Yoon, Jin-Seob Kim, Jae-Min Kim TTS-by-TTS 2: Data-Selective Augmentation for Neural Speech Synthesis Using Ranking Support Vector Machine with Variational Autoencoder. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Srikanth Raj Chetupalli, Emanuël A. P. Habets Speech Separation for an Unknown Number of Speakers Using Transformers With Encoder-Decoder Attractors. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Zuheng Kang, Junqing Peng, Jianzong Wang, Jing Xiao 0006 SpeechEQ: Speech Emotion Recognition based on Multi-scale Unified Datasets and Multitask Learning. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Jian Zhu, Cong Zhang, David Jurgens ByT5 model for massively multilingual grapheme-to-phoneme conversion. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Ying Hu, Yuwu Tang, Hao Huang, Liang He 0003 A Graph Isomorphism Network with Weighted Multiple Aggregators for Speech Emotion Recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Giuseppe Magistro, Claudia Crocco Phonetic erosion and information structure in function words: the case of mia. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Daniel R. van Niekerk, Anqi Xu, Branislav Gerazov, Paul Konstantin Krug, Peter Birkholz, Yi Xu 0007 Exploration strategies for articulatory synthesis of complex syllable onsets. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Weiyi Zheng, Alex Xiao, Gil Keren, Duc Le, Frank Zhang 0001, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Abdelrahman Mohamed Scaling ASR Improves Zero and Few Shot Learning. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Akihiko Takashima, Ryo Masumura, Atsushi Ando, Yoshihiro Yamazaki, Mihiro Uchida, Shota Orihashi Interactive Co-Learning with Cross-Modal Transformer for Audio-Visual Emotion Recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Baihan Lin Voice2Alliance: Automatic Speaker Diarization and Quality Assurance of Conversational Alignment. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  BibTeX  RDF
1W. Ronny Huang, Steve Chien, Om Dipakbhai Thakkar, Rajiv Mathews Detecting Unintended Memorization in Language-Model-Fused ASR. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Taejin Park, Nithin Rao Koluguri, Jagadeesh Balam, Boris Ginsburg Multi-scale Speaker Diarization with Dynamic Scale Weighting. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Xiao Wei, Yuke Si, Shiquan Wang, Longbiao Wang, Jianwu Dang 0001 Hierarchical Tagger with Multi-task Learning for Cross-domain Slot Filling. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Zifeng Zhao, Rongzhi Gu, Dongchao Yang, Jinchuan Tian, Yuexian Zou Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Xu Li, Shansong Liu, Ying Shan A Hierarchical Speaker Representation Framework for One-shot Singing Voice Conversion. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Yi Chang, Zhao Ren, Thanh Tam Nguyen, Wolfgang Nejdl, Björn W. Schuller Example-based Explanations with Adversarial Attacks for Respiratory Sound Analysis. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Dan Wells, Hao Tang, Korin Richmond Phonetic Analysis of Self-supervised Representations of English Speech. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Pranay Manocha, Zeyu Jin, Adam Finkelstein Audio Similarity is Unreliable as a Proxy for Audio Quality. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Ali Siahkoohi, Michael Chinen, Tom Denton, W. Bastiaan Kleijn, Jan Skoglund Ultra-Low-Bitrate Speech Coding with Pretrained Transformers. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Seunghan Yang, Byeonggeun Kim, Inseop Chung, Simyung Chang Personalized Keyword Spotting through Multi-task Learning. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Shinimol Salim, Syed Shahnawazuddin, Waquar Ahmad Automatic Speaker Verification System for Dysarthria Patients. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Wenbin Jiang, Tao Liu, Kai Yu Efficient Speech Enhancement with Neural Homomorphic Synthesis. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Zhengyuan Liu, Nancy F. Chen Dynamic Sliding Window Modeling for Abstractive Meeting Summarization. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Qiang Xu, Tongtong Song, Longbiao Wang, Hao Shi, Yuqin Lin, Yongjie Lv, Meng Ge, Qiang Yu 0005, Jianwu Dang 0001 Self-Distillation Based on High-level Information Supervision for Compressing End-to-End ASR Model. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Puyuan Peng, David Harwath Word Discovery in Visually Grounded, Self-Supervised Speech Models. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Felix Weninger, Marco Gaudesi, Md. Akmal Haidar, Nicola Ferri, Jesús Andrés-Ferrer, Puming Zhan Conformer with dual-mode chunked attention for joint online and offline ASR. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Byeonggeun Kim, Seunghan Yang, Inseop Chung, Simyung Chang Dummy Prototypical Networks for Few-Shot Open-Set Keyword Spotting. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Mayank Sharma, Tarun Gupta, Kenny Qiu, Xiang Hao, Raffay Hamid CNN-based Audio Event Recognition for Automated Violence Classification and Rating for Prime Video Content. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Jun Chen 0024, Wei Rao, Zilin Wang, Zhiyong Wu 0001, Yannan Wang, Tao Yu, Shidong Shang, Helen Meng Speech Enhancement with Fullband-Subband Cross-Attention Network. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Murali Karthick Baskar, Andrew Rosenberg, Bhuvana Ramabhadran, Yu Zhang 0033, Nicolás Serrano Reducing Domain mismatch in Self-supervised speech pre-training. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Marcely Zanon Boito, Laurent Besacier, Natalia A. Tomashenko, Yannick Estève A Study of Gender Impact in Self-supervised Models for Speech-to-Text Systems. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Junjie Li, Meng Ge, Zexu Pan, Longbiao Wang, Jianwu Dang 0001 VCSE: Time-Domain Visual-Contextual Speaker Extraction Network. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Pu Wang, Hugo Van hamme Bottleneck Low-rank Transformers for Low-resource Spoken Language Understanding. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Muqiao Yang, Ian R. Lane, Shinji Watanabe 0001 Online Continual Learning of End-to-End Speech Recognition Models. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Kartik Audhkhasi, Yinghui Huang, Bhuvana Ramabhadran, Pedro J. Moreno 0001 Analysis of Self-Attention Head Diversity for Conformer-based Automatic Speech Recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Rosanna Turrisi, Leonardo Badino Interpretable dysarthric speaker adaptation based on optimal-transport. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Timm Koppelmann, Luca Becker, Alexandru Nelus, Rene Glitza, Lea Schönherr, Rainer Martin 0001 Clustering-based Wake Word Detection in Privacy-aware Acoustic Sensor Networks. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Nilaksh Das, Polo Chau Hear No Evil: Towards Adversarial Robustness of Automatic Speech Recognition via Multi-Task Learning. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Szu-Jui Chen, Jiamin Xie, John H. L. Hansen FeaRLESS: Feature Refinement Loss for Ensembling Self-Supervised Learning Features in Robust End-to-end Speech Recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Wei-Ping Huang, Po-Chun Chen, Sung-Feng Huang, Hung-yi Lee Few Shot Cross-Lingual TTS Using Transferable Phoneme Embedding. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Boram Lee, Naomi Yamaguchi, Cécile Fougeron Why is Korean lenis stop difficult to perceive for L2 Korean learners? Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Xianchao Wu Deep Sparse Conformer for Speech Recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Wiebke Toussaint, Lauriane Gorce, Aaron Yi Ding Design Guidelines for Inclusive Speaker Verification Evaluation Datasets. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Mohamed Maouche, Brij Mohan Lal Srivastava, Nathalie Vauquier, Aurélien Bellet, Marc Tommasi, Emmanuel Vincent 0001 Enhancing Speech Privacy with Slicing. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Hyeonuk Nam, Seong-Hu Kim, Byeong-Yun Ko, Yong-Hwa Park Frequency Dynamic Convolution: Frequency-Adaptive Pattern Recognition for Sound Event Detection. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Jay Mahadeokar, Yangyang Shi, Ke Li, Duc Le, Jiedan Zhu, Vikas Chandra, Ozlem Kalinli, Michael L. Seltzer Streaming parallel transducer beam search with fast slow cascaded encoders. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Ehsan Amid, Om Dipakbhai Thakkar, Arun Narayanan, Rajiv Mathews, Françoise Beaufays Extracting Targeted Training Data from ASR Models, and How to Mitigate It. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Rodrigo Schoburg Carrillo de Mira, Alexandros Haliassos, Stavros Petridis, Björn W. Schuller, Maja Pantic SVTS: Scalable Video-to-Speech Synthesis. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Gaoxiong Yi, Wei Xiao, Yiming Xiao, Babak Naderi, Sebastian Möller 0001, Wafaa Wardah, Gabriel Mittag, Ross Cutler, Zhuohuang Zhang, Donald S. Williamson, Fei Chen 0011, Fuzheng Yang, Shidong Shang ConferencingSpeech 2022 Challenge: Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge for Online Conferencing Applications. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Yanjie Fu, Meng Ge, Haoran Yin, Xinyuan Qian, Longbiao Wang, Gaoyan Zhang, Jianwu Dang 0001 Iterative Sound Source Localization for Unknown Number of Sources. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Xiaofeng Shu, Yanjie Chen, Chuxiang Shang, Yan Zhao 0010, Chengshuai Zhao, Yehang Zhu, Chuanzeng Huang, Yuxuan Wang 0002 Non-intrusive Speech Quality Assessment with a Multi-Task Learning based Subband Adaptive Attention Temporal Convolutional Neural Network. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Helin Wang, Dongchao Yang, Chao Weng, Jianwei Yu, Yuexian Zou Improving Target Sound Extraction with Timestamp Information. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Felix Meyer, Wilfried Michel, Mohammad Zeineldeen, Ralf Schlüter, Hermann Ney Automatic Learning of Subword Dependent Model Scales. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Naoaki Suzuki, Satoshi Nakamura Representing 'how you say' with 'what you say': English corpus of focused speech and text reflecting corresponding implications. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Zijian Yang, Yingbo Gao, Alexander Gerstenberger, Jintao Jiang, Ralf Schlüter, Hermann Ney Self-Normalized Importance Sampling for Neural Language Modeling. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Marise Neijman, Femke Hof, Noelle Oosterom, Roland Pfau, Bertus van Rooy, Rob J. J. H. van Son, Michiel W. M. van den Brekel Compensation in Verbal and Nonverbal Communication after Total Laryngectomy. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Vinay Kothapally, Yong Xu 0004, Meng Yu 0003, Shi-Xiong Zhang, Dong Yu 0001 Joint Neural AEC and Beamforming with Double-Talk Detection. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Boris Bergsma, Minhao Yang, Milos Cernak PEAF: Learnable Power Efficient Analog Acoustic Features for Audio Recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Martin Radfar, Rohit Barnwal, Rupak Vignesh Swaminathan, Feng-Ju Chang, Grant P. Strimel, Nathan Susanj, Athanasios Mouchtaris ConvRNN-T: Convolutional Augmented Recurrent Neural Network Transducers for Streaming Speech Recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Ya-Hsin Chang, Yun-Nung Chen Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Ayush Kumar, Vijit Malik, Jithendra Vepa Does Utterance entails Intent?: Evaluating Natural Language Inference Based Setup for Few-Shot Intent Detection. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Samuel Hollands, Daniel Blackburn, Heidi Christensen Evaluating the Performance of State-of-the-Art ASR Systems on Non-Native English using Corpora with Extensive Language Background Variation. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Sri Karlapati, Penny Karanasou, Mateusz Lajszczak, Syed Ammar Abbas, Alexis Moinet, Peter Makarov, Ray Li, Arent van Korlaar, Simon Slangen, Thomas Drugman CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Tuan-Duy H. Nguyen, Duy Phung, Duy Tran-Cong Nguyen, Hieu Minh Tran, Manh Luong, Tin Duy Vo, Hung Hai Bui, Dinh Q. Phung, Dat Quoc Nguyen A Vietnamese-English Neural Machine Translation System. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  BibTeX  RDF
1Heyang Xue, Xinsheng Wang, Yongmao Zhang, Lei Xie 0001, Pengcheng Zhu 0004, Mengxiao Bi Learn2Sing 2.0: Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Kevin Meng, Seo-Hyun Lee, Farhad Goodarzy, Simon J. Vogrin, Mark J. Cook, Seong-Whan Lee, David B. Grayden Evidence of Onset and Sustained Neural Responses to Isolated Phonemes from Intracranial Recordings in a Voice-based Cursor Control Task. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Yunhao Liang, Yanhua Long, Yijie Li, Jiaen Liang Selective Pseudo-labeling and Class-wise Discriminative Fusion for Sound Event Detection. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Hanseok Ko, John H. L. Hansen (eds.) Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Zhiheng Ouyang, Miao Wang, Wei-Ping Zhu 0001 Small Footprint Neural Networks for Acoustic Direction of Arrival Estimation. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
Displaying result #801 - #900 of 18782 (100 per page; Change: )
Pages: [<<][1][2][3][4][5][6][7][8][9][10][11][12][13][14][15][16][17][18][>>]
Valid XHTML 1.1! Valid CSS! [Valid RSS]
Maintained by L3S.
Previously maintained by Jörg Diederich.
Based upon DBLP by Michael Ley.
open data data released under the ODC-BY 1.0 license