|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
No Growbag Graphs found.
|
|
|
Results
Found 18782 publication records. Showing 18782 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
1 | Alon Levkovitch, Eliya Nachmani, Lior Wolf |
Zero-Shot Voice Conditioning for Denoising Diffusion TTS Models. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Anish Bhanushali, Grant Bridgman, Deekshitha G, Prasanta Kumar Ghosh, Pratik Kumar, Saurabh Kumar, Adithya Raj Kolladath, Nithya Ravi, Aaditeshwar Seth, Ashish Seth, Abhayjeet Singh, Vrunda N. Sukhadia, Srinivasan Umesh, Sathvik Udupa, Lodagala V. S. V. Durga Prasad |
Gram Vaani ASR Challenge on spontaneous telephone speech recordings in regional variations of Hindi. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Talia Ben Simon, Felix Kreuk, Faten Awwad, Jacob T. Cohen, Joseph Keshet |
Correcting Mispronunciations in Speech using Spectrogram Inpainting. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Christian Bergler, Alexander Barnhill, Dominik Perrin, Manuel Schmitt, Andreas K. Maier, Elmar Nöth |
ORCA-WHISPER: An Automatic Killer Whale Sound Type Generation Toolkit Using Deep Learning. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Bowen Zhang, Songjun Cao, Xiaoming Zhang, Yike Zhang, Long Ma, Takahiro Shinozaki |
Censer: Curriculum Semi-supervised Learning for Speech Recognition Based on Self-supervised Pre-training. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Lars Rumberg, Christopher Gebauer, Hanna Ehlert, Maren Wallbaum, Lena Bornholt, Jörn Ostermann, Ulrike Lüdtke |
kidsTALC: A Corpus of 3- to 11-year-old German Children's Connected Natural Speech. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Chengfei Li, Shuhao Deng, Yaoping Wang, Guangjing Wang 0004, Yaguang Gong, Changbin Chen, Jinfeng Bai |
TALCS: An open-source Mandarin-English code-switching corpus and a speech recognition baseline. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Sankaran Panchapagesan, Arun Narayanan, Turaj Zakizadeh Shabestary, Shuai Shao, Nathan Howard, Alex Park 0001, James Walker, Alexander Gruenstein |
A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR Accuracy. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jonathan Him Nok Lee, Dehua Tao, Harold Chui, Tan Lee, Sarah Luk, Nicolette Wing Tung Lee, Koonkan Fung |
Durational Patterning at Discourse Boundaries in Relation to Therapist Empathy in Psychotherapy. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Saurabh Kataria, Jesús Villalba 0001, Laureano Moro-Velázquez, Najim Dehak |
Joint domain adaptation and speech bandwidth extension using time-domain GANs for speaker verification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Mudit D. Batra, M. K. Jayesh, C. S. Ramalingam |
Robust Pitch Estimation Using Multi-Branch CNN-LSTM and 1-Norm LP Residual. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zhong Meng, Yashesh Gaur, Naoyuki Kanda, Jinyu Li 0001, Xie Chen 0001, Yu Wu 0012, Yifan Gong 0001 |
Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zihan Wang, Christer Gobl |
Contribution of the glottal flow residual in affect-related voice transformation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ashtosh Sapru |
Using Data Augmentation and Consistency Regularization to Improve Semi-supervised Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Deepak Baby, Pasquale D'Alterio, Valentin Mendelev |
Incremental learning for RNN-Transducer based speech recognition models. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Mahir Morshed, Mark Hasegawa-Johnson |
Cross-lingual articulatory feature information transfer for speech recognition using recurrent progressive neural networks. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Shrutina Agarwal, Naoya Takahashi, Sriram Ganapathy |
Leveraging Symmetrical Convolutional Transformer Networks for Speech to Singing Voice Style Transfer. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Keqi Deng, Shinji Watanabe 0001, Jiatong Shi, Siddhant Arora |
Blockwise Streaming Transformer for Spoken Language Understanding and Simultaneous Speech Translation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Chen Chen 0075, Nana Hou, Yuchen Hu, Heqing Zou, Xiaofeng Qi, Eng Siong Chng |
Interactive Auido-text Representation for Automated Audio Captioning with Contrastive Learning. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Amber Afshan, Abeer Alwan |
Learning from human perception to improve automatic speaker verification in style-mismatched conditions. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Teena tom Dieck, Paula Andrea Pérez-Toro, Tomas Arias, Elmar Nöth, Philipp Klumpp |
Wav2vec behind the Scenes: How end2end Models learn Phonetics. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Linjuan Cheng, Chengshi Zheng, Andong Li, Yuquan Wu, Renhua Peng, Xiaodong Li 0002 |
A deep complex multi-frame filtering network for stereophonic acoustic echo cancellation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Hai-tao Xu, Jie Zhang 0042, Li-Rong Dai 0001 |
Differential Time-frequency Log-mel Spectrogram Features for Vision Transformer Based Infant Cry Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Nathan Joel Young, David Britain, Adrian Leemann |
A blueprint for using deepfakes in sociolinguistic matched-guise experiments. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Michael Chinen, Jan Skoglund, Chandan K. A. Reddy, Alessandro Ragano, Andrew Hines |
Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Tuan-Nam Nguyen, Ngoc-Quan Pham, Alexander Waibel |
Accent Conversion using Pre-trained Model and Synthesized Data from Voice Conversion. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yu Nakagome, Tatsuya Komatsu, Yusuke Fujita, Shuta Ichimura, Yusuke Kida |
InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yijie Lou, Shiliang Pu, Jianfeng Zhou, Xin Qi, Qinbo Dong, Hongwei Zhou |
A Deep One-Class Learning Method for Replay Attack Detection. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yang Zhang, Zhiqiang Lv, Haibin Wu, Shanshan Zhang, Pengfei Hu 0004, Zhiyong Wu 0001, Hung-yi Lee, Helen Meng |
MFA-Conformer: Multi-scale Feature Aggregation Conformer for Automatic Speaker Verification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zhuoya Liu, Mark A. Huckvale, Julian McGlashan |
Automated Voice Pathology Discrimination from Continuous Speech Benefits from Analysis by Phonetic Context. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zexu Pan, Meng Ge, Haizhou Li 0001 |
A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker Extraction. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Eunwoo Song, Ryuichi Yamamoto, Ohsung Kwon, Chan-Ho Song, Min-Jae Hwang, Suhyeon Oh, Hyun-Wook Yoon, Jin-Seob Kim, Jae-Min Kim |
TTS-by-TTS 2: Data-Selective Augmentation for Neural Speech Synthesis Using Ranking Support Vector Machine with Variational Autoencoder. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Srikanth Raj Chetupalli, Emanuël A. P. Habets |
Speech Separation for an Unknown Number of Speakers Using Transformers With Encoder-Decoder Attractors. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zuheng Kang, Junqing Peng, Jianzong Wang, Jing Xiao 0006 |
SpeechEQ: Speech Emotion Recognition based on Multi-scale Unified Datasets and Multitask Learning. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jian Zhu, Cong Zhang, David Jurgens |
ByT5 model for massively multilingual grapheme-to-phoneme conversion. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ying Hu, Yuwu Tang, Hao Huang, Liang He 0003 |
A Graph Isomorphism Network with Weighted Multiple Aggregators for Speech Emotion Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Giuseppe Magistro, Claudia Crocco |
Phonetic erosion and information structure in function words: the case of mia. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Daniel R. van Niekerk, Anqi Xu, Branislav Gerazov, Paul Konstantin Krug, Peter Birkholz, Yi Xu 0007 |
Exploration strategies for articulatory synthesis of complex syllable onsets. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Weiyi Zheng, Alex Xiao, Gil Keren, Duc Le, Frank Zhang 0001, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Abdelrahman Mohamed |
Scaling ASR Improves Zero and Few Shot Learning. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Akihiko Takashima, Ryo Masumura, Atsushi Ando, Yoshihiro Yamazaki, Mihiro Uchida, Shota Orihashi |
Interactive Co-Learning with Cross-Modal Transformer for Audio-Visual Emotion Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Baihan Lin |
Voice2Alliance: Automatic Speaker Diarization and Quality Assurance of Conversational Alignment. |
INTERSPEECH |
2022 |
DBLP BibTeX RDF |
|
1 | W. Ronny Huang, Steve Chien, Om Dipakbhai Thakkar, Rajiv Mathews |
Detecting Unintended Memorization in Language-Model-Fused ASR. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Taejin Park, Nithin Rao Koluguri, Jagadeesh Balam, Boris Ginsburg |
Multi-scale Speaker Diarization with Dynamic Scale Weighting. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Xiao Wei, Yuke Si, Shiquan Wang, Longbiao Wang, Jianwu Dang 0001 |
Hierarchical Tagger with Multi-task Learning for Cross-domain Slot Filling. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zifeng Zhao, Rongzhi Gu, Dongchao Yang, Jinchuan Tian, Yuexian Zou |
Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Xu Li, Shansong Liu, Ying Shan |
A Hierarchical Speaker Representation Framework for One-shot Singing Voice Conversion. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yi Chang, Zhao Ren, Thanh Tam Nguyen, Wolfgang Nejdl, Björn W. Schuller |
Example-based Explanations with Adversarial Attacks for Respiratory Sound Analysis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Dan Wells, Hao Tang, Korin Richmond |
Phonetic Analysis of Self-supervised Representations of English Speech. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Pranay Manocha, Zeyu Jin, Adam Finkelstein |
Audio Similarity is Unreliable as a Proxy for Audio Quality. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ali Siahkoohi, Michael Chinen, Tom Denton, W. Bastiaan Kleijn, Jan Skoglund |
Ultra-Low-Bitrate Speech Coding with Pretrained Transformers. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Seunghan Yang, Byeonggeun Kim, Inseop Chung, Simyung Chang |
Personalized Keyword Spotting through Multi-task Learning. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Shinimol Salim, Syed Shahnawazuddin, Waquar Ahmad |
Automatic Speaker Verification System for Dysarthria Patients. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Wenbin Jiang, Tao Liu, Kai Yu |
Efficient Speech Enhancement with Neural Homomorphic Synthesis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zhengyuan Liu, Nancy F. Chen |
Dynamic Sliding Window Modeling for Abstractive Meeting Summarization. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Qiang Xu, Tongtong Song, Longbiao Wang, Hao Shi, Yuqin Lin, Yongjie Lv, Meng Ge, Qiang Yu 0005, Jianwu Dang 0001 |
Self-Distillation Based on High-level Information Supervision for Compressing End-to-End ASR Model. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Puyuan Peng, David Harwath |
Word Discovery in Visually Grounded, Self-Supervised Speech Models. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Felix Weninger, Marco Gaudesi, Md. Akmal Haidar, Nicola Ferri, Jesús Andrés-Ferrer, Puming Zhan |
Conformer with dual-mode chunked attention for joint online and offline ASR. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Byeonggeun Kim, Seunghan Yang, Inseop Chung, Simyung Chang |
Dummy Prototypical Networks for Few-Shot Open-Set Keyword Spotting. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Mayank Sharma, Tarun Gupta, Kenny Qiu, Xiang Hao, Raffay Hamid |
CNN-based Audio Event Recognition for Automated Violence Classification and Rating for Prime Video Content. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jun Chen 0024, Wei Rao, Zilin Wang, Zhiyong Wu 0001, Yannan Wang, Tao Yu, Shidong Shang, Helen Meng |
Speech Enhancement with Fullband-Subband Cross-Attention Network. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Murali Karthick Baskar, Andrew Rosenberg, Bhuvana Ramabhadran, Yu Zhang 0033, Nicolás Serrano |
Reducing Domain mismatch in Self-supervised speech pre-training. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Marcely Zanon Boito, Laurent Besacier, Natalia A. Tomashenko, Yannick Estève |
A Study of Gender Impact in Self-supervised Models for Speech-to-Text Systems. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Junjie Li, Meng Ge, Zexu Pan, Longbiao Wang, Jianwu Dang 0001 |
VCSE: Time-Domain Visual-Contextual Speaker Extraction Network. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Pu Wang, Hugo Van hamme |
Bottleneck Low-rank Transformers for Low-resource Spoken Language Understanding. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Muqiao Yang, Ian R. Lane, Shinji Watanabe 0001 |
Online Continual Learning of End-to-End Speech Recognition Models. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Kartik Audhkhasi, Yinghui Huang, Bhuvana Ramabhadran, Pedro J. Moreno 0001 |
Analysis of Self-Attention Head Diversity for Conformer-based Automatic Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Rosanna Turrisi, Leonardo Badino |
Interpretable dysarthric speaker adaptation based on optimal-transport. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Timm Koppelmann, Luca Becker, Alexandru Nelus, Rene Glitza, Lea Schönherr, Rainer Martin 0001 |
Clustering-based Wake Word Detection in Privacy-aware Acoustic Sensor Networks. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Nilaksh Das, Polo Chau |
Hear No Evil: Towards Adversarial Robustness of Automatic Speech Recognition via Multi-Task Learning. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Szu-Jui Chen, Jiamin Xie, John H. L. Hansen |
FeaRLESS: Feature Refinement Loss for Ensembling Self-Supervised Learning Features in Robust End-to-end Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Wei-Ping Huang, Po-Chun Chen, Sung-Feng Huang, Hung-yi Lee |
Few Shot Cross-Lingual TTS Using Transferable Phoneme Embedding. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Boram Lee, Naomi Yamaguchi, Cécile Fougeron |
Why is Korean lenis stop difficult to perceive for L2 Korean learners? |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Xianchao Wu |
Deep Sparse Conformer for Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Wiebke Toussaint, Lauriane Gorce, Aaron Yi Ding |
Design Guidelines for Inclusive Speaker Verification Evaluation Datasets. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Mohamed Maouche, Brij Mohan Lal Srivastava, Nathalie Vauquier, Aurélien Bellet, Marc Tommasi, Emmanuel Vincent 0001 |
Enhancing Speech Privacy with Slicing. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Hyeonuk Nam, Seong-Hu Kim, Byeong-Yun Ko, Yong-Hwa Park |
Frequency Dynamic Convolution: Frequency-Adaptive Pattern Recognition for Sound Event Detection. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jay Mahadeokar, Yangyang Shi, Ke Li, Duc Le, Jiedan Zhu, Vikas Chandra, Ozlem Kalinli, Michael L. Seltzer |
Streaming parallel transducer beam search with fast slow cascaded encoders. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ehsan Amid, Om Dipakbhai Thakkar, Arun Narayanan, Rajiv Mathews, Françoise Beaufays |
Extracting Targeted Training Data from ASR Models, and How to Mitigate It. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Rodrigo Schoburg Carrillo de Mira, Alexandros Haliassos, Stavros Petridis, Björn W. Schuller, Maja Pantic |
SVTS: Scalable Video-to-Speech Synthesis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Gaoxiong Yi, Wei Xiao, Yiming Xiao, Babak Naderi, Sebastian Möller 0001, Wafaa Wardah, Gabriel Mittag, Ross Cutler, Zhuohuang Zhang, Donald S. Williamson, Fei Chen 0011, Fuzheng Yang, Shidong Shang |
ConferencingSpeech 2022 Challenge: Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge for Online Conferencing Applications. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yanjie Fu, Meng Ge, Haoran Yin, Xinyuan Qian, Longbiao Wang, Gaoyan Zhang, Jianwu Dang 0001 |
Iterative Sound Source Localization for Unknown Number of Sources. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Xiaofeng Shu, Yanjie Chen, Chuxiang Shang, Yan Zhao 0010, Chengshuai Zhao, Yehang Zhu, Chuanzeng Huang, Yuxuan Wang 0002 |
Non-intrusive Speech Quality Assessment with a Multi-Task Learning based Subband Adaptive Attention Temporal Convolutional Neural Network. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Helin Wang, Dongchao Yang, Chao Weng, Jianwei Yu, Yuexian Zou |
Improving Target Sound Extraction with Timestamp Information. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Felix Meyer, Wilfried Michel, Mohammad Zeineldeen, Ralf Schlüter, Hermann Ney |
Automatic Learning of Subword Dependent Model Scales. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Naoaki Suzuki, Satoshi Nakamura |
Representing 'how you say' with 'what you say': English corpus of focused speech and text reflecting corresponding implications. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zijian Yang, Yingbo Gao, Alexander Gerstenberger, Jintao Jiang, Ralf Schlüter, Hermann Ney |
Self-Normalized Importance Sampling for Neural Language Modeling. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Marise Neijman, Femke Hof, Noelle Oosterom, Roland Pfau, Bertus van Rooy, Rob J. J. H. van Son, Michiel W. M. van den Brekel |
Compensation in Verbal and Nonverbal Communication after Total Laryngectomy. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Vinay Kothapally, Yong Xu 0004, Meng Yu 0003, Shi-Xiong Zhang, Dong Yu 0001 |
Joint Neural AEC and Beamforming with Double-Talk Detection. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Boris Bergsma, Minhao Yang, Milos Cernak |
PEAF: Learnable Power Efficient Analog Acoustic Features for Audio Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Martin Radfar, Rohit Barnwal, Rupak Vignesh Swaminathan, Feng-Ju Chang, Grant P. Strimel, Nathan Susanj, Athanasios Mouchtaris |
ConvRNN-T: Convolutional Augmented Recurrent Neural Network Transducers for Streaming Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ya-Hsin Chang, Yun-Nung Chen |
Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ayush Kumar, Vijit Malik, Jithendra Vepa |
Does Utterance entails Intent?: Evaluating Natural Language Inference Based Setup for Few-Shot Intent Detection. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Samuel Hollands, Daniel Blackburn, Heidi Christensen |
Evaluating the Performance of State-of-the-Art ASR Systems on Non-Native English using Corpora with Extensive Language Background Variation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Sri Karlapati, Penny Karanasou, Mateusz Lajszczak, Syed Ammar Abbas, Alexis Moinet, Peter Makarov, Ray Li, Arent van Korlaar, Simon Slangen, Thomas Drugman |
CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Tuan-Duy H. Nguyen, Duy Phung, Duy Tran-Cong Nguyen, Hieu Minh Tran, Manh Luong, Tin Duy Vo, Hung Hai Bui, Dinh Q. Phung, Dat Quoc Nguyen |
A Vietnamese-English Neural Machine Translation System. |
INTERSPEECH |
2022 |
DBLP BibTeX RDF |
|
1 | Heyang Xue, Xinsheng Wang, Yongmao Zhang, Lei Xie 0001, Pengcheng Zhu 0004, Mengxiao Bi |
Learn2Sing 2.0: Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Kevin Meng, Seo-Hyun Lee, Farhad Goodarzy, Simon J. Vogrin, Mark J. Cook, Seong-Whan Lee, David B. Grayden |
Evidence of Onset and Sustained Neural Responses to Isolated Phonemes from Intracranial Recordings in a Voice-based Cursor Control Task. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yunhao Liang, Yanhua Long, Yijie Li, Jiaen Liang |
Selective Pseudo-labeling and Class-wise Discriminative Fusion for Sound Event Detection. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Hanseok Ko, John H. L. Hansen (eds.) |
Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zhiheng Ouyang, Miao Wang, Wei-Ping Zhu 0001 |
Small Footprint Neural Networks for Acoustic Direction of Arrival Estimation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
|
|