The FacetedDBLP logo    Search for: in:

Disable automatic phrases ?     Syntactic query expansion: ?

Publications at "INTERSPEECH"( http://dblp.L3S.de/Venues/INTERSPEECH )

URL (DBLP): http://dblp.uni-trier.de/db/conf/interspeech

Publication years (Num. hits)
2000 (923) 2001 (671) 2002 (679) 2003 (799) 2004 (776) 2005 (870) 2006 (660) 2007 (752) 2008 (764) 2009 (766) 2010 (782) 2011 (852) 2012 (678) 2013 (789) 2014 (641) 2015 (793) 2016 (819) 2017 (845) 2018 (792) 2019 (971) 2020 (1036) 2021 (1000) 2022 (1124)
Publication types (Num. hits)
inproceedings(18759) proceedings(23)
Venues (Conferences, Journals, ...)
INTERSPEECH(18782)
GrowBag graphs for keyword ? (Num. hits/coverage)

Group by:
No Growbag Graphs found.

Results
Found 18782 publication records. Showing 18782 according to the selection in the facets
Hits ? Authors Title Venue Year Link Author keywords
1Bahman Mirheidari, André Bittar, Nicholas Cummins, Johnny Downs, Helen L. Fisher, Heidi Christensen Automatic Detection of Expressed Emotion from Five-Minute Speech Samples: Challenges and Opportunities. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Kuan-Po Huang, Yu-Kuan Fu, Yu Zhang 0033, Hung-yi Lee Improving Distortion Robustness of Self-supervised Speech Processing Tasks with Domain Adaptation. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Sebastião Quintas, Julie Mauclair, Virginie Woisard, Julien Pinquier Automatic Assessment of Speech Intelligibility using Consonant Similarity for Head and Neck Cancer. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Xin Wang, Chuan Xie, Qiang Wu, Huayi Zhan, Ying Wu A Novel Phoneme-based Modeling for Text-independent Speaker Identification. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Naokazu Uchida, Takeshi Homma, Makoto Iwayama, Yasuhiro Sogawa Reducing Offensive Replies in Open Domain Dialogue Systems. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Shichao Hu, Bin Zhang, Jinhong Lu, Yiliang Jiang, Wucheng Wang, Lingcheng Kong, Weifeng Zhao, Tao Jiang WideResNet with Joint Representation Learning and Data Augmentation for Cover Song Identification. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Zhongwei Teng, Quchen Fu, Jules White, Maria E. Powell, Douglas C. Schmidt SA-SASV: An End-to-End Spoof-Aggregated Spoofing-Aware Speaker Verification System. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Raphaël Olivier, Bhiksha Raj Recent improvements of ASR models in the face of adversarial attacks. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Dimitrios Stoidis, Andrea Cavallaro Generating gender-ambiguous voices for privacy-preserving speech recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Cal Peyser, W. Ronny Huang, Andrew Rosenberg, Tara N. Sainath, Michael Picheny, Kyunghyun Cho Towards Disentangled Speech Representations. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Murchana Baruah, Bonny Banerjee Speech Emotion Recognition via Generation using an Attention-based Variational Recurrent Neural Network. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Yen-Ju Lu, Xuankai Chang, Chenda Li, Wangyou Zhang, Samuele Cornell, Zhaoheng Ni, Yoshiki Masuyama, Brian Yan, Robin Scheibler, Zhong-Qiu Wang, Yu Tsao 0001, Yanmin Qian, Shinji Watanabe 0001 ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Cheng Yu, Szu-Wei Fu, Tsun-An Hsieh, Yu Tsao 0001, Mirco Ravanelli OSSEM: one-shot speaker adaptive speech enhancement using meta learning. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Davis Nicmanis, Askars Salimbajevs Spoken Dialogue System for Call Centers with Expressive Speech Synthesis. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  BibTeX  RDF
1Weidong Chen, Xiaofen Xing, Xiangmin Xu, Jianxin Pang, Lan Du SpeechFormer: A Hierarchical Efficient Framework Incorporating the Characteristics of Speech. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Véronique Delvaux, Audrey Lavallée, Fanny Degouis, Xavier Saloppe, Jean-Louis Nandrino, Thierry Pham Telling self-defining memories: An acoustic study of natural emotional speech productions. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Yan Li, Ying Chen 0015, Xinya Zhang, Yanyang Chen, Jiazheng Wang Effects of Language Contact on Vowel Nasalization in Wenzhou and Rugao Dialects. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Satwik Dutta, Sarah Anne Tao, Jacob C. Reyna, Rebecca Elizabeth Hacker, Dwight W. Irvin, Jay F. Buzhardt, John H. L. Hansen Challenges remain in Building ASR for Spontaneous Preschool Children Speech in Naturalistic Educational Environments. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Fangjun Kuang, Liyong Guo, Wei Kang 0006, Long Lin, Mingshuang Luo, Zengwei Yao, Daniel Povey Pruned RNN-T for fast, memory-efficient ASR training. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Youngsik Eom, Yeonghyeon Lee, Ji Sub Um, Hoi Rin Kim Anti-Spoofing Using Transfer Learning with Variational Information Bottleneck. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Marzena Zygis, Sarah Wesolek, Nina Hosseini-Kivanani, Manfred Krifka The Prosody of Cheering in Sport Events. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Jumon Nozaki, Tatsuya Kawahara, Kenkichi Ishizuka, Taiichi Hashimoto End-to-end Speech-to-Punctuated-Text Recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Jiaming Cheng, Ruiyu Liang, Yue Xie, Li Zhao 0003, Björn W. Schuller, Jie Jia, Yiyuan Peng Cross-Layer Similarity Knowledge Distillation for Speech Enhancement. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Yingying Gao, Junlan Feng, Chao Deng, Shilei Zhang Meta Auxiliary Learning for Low-resource Spoken Language Understanding. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1João Vítor Menezes, Pouriya Amini Digehsara, Christoph Wagner, Marco Mütze, Michael Bärhold, Petr Schaffer, Dirk Plettemeier, Peter Birkholz Evaluation of different antenna types and positions in a stepped frequency continuous-wave radar-based silent speech interface. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Marc Delcroix, Keisuke Kinoshita, Tsubasa Ochiai, Katerina Zmolíková, Hiroshi Sato, Tomohiro Nakatani Listen only to me! How well can target speech extraction handle false alarms? Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Sukanya Sonowal, Anish Tamse Novel Augmentation Schemes for Device Robust Acoustic Scene Classification. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Zhiyun Lu, Yongqiang Wang, Yu Zhang 0033, Wei Han, Zhehuai Chen, Parisa Haghani Unsupervised Data Selection via Discrete Speech Representation for ASR. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Digvijay Ingle, Ayush Kumar, Krishnachaitanya Gogineni, Jithendra Vepa Real-Time Monitoring of Silences in Contact Center Conversations. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  BibTeX  RDF
1Wangyou Zhang, Zhuo Chen 0006, Naoyuki Kanda, Shujie Liu 0001, Jinyu Li 0001, Sefik Emre Eskimez, Takuya Yoshioka, Xiong Xiao, Zhong Meng, Yanmin Qian, Furu Wei Separating Long-Form Speech with Group-wise Permutation Invariant Training. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Sonal Joshi, Saurabh Kataria, Yiwen Shao, Piotr Zelasko, Jesús Villalba 0001, Sanjeev Khudanpur, Najim Dehak Defense against Adversarial Attacks on Hybrid Speech Recognition System using Adversarial Fine-tuning with Denoiser. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Longfei Yang, Wenqing Wei, Sheng Li 0010, Jiyi Li, Takahiro Shinozaki Augmented Adversarial Self-Supervised Learning for Early-Stage Alzheimer's Speech Detection. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Beiming Cao, Kristin Teplansky, Nordine Sebkhi, Arpan Bhavsar, Omer T. Inan, Robin Samlan, Ted Mau, Jun Wang 0037 Data Augmentation for End-to-end Silent Speech Recognition for Laryngectomees. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Zilu Guo, Xu Xu 0003, Zhongfu Ye Joint Optimization of the Module and Sign of the Spectral Real Part Based on CRN for Speech Denoising. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Badr M. Abdullah, Bernd Möbius, Dietrich Klakow Integrating Form and Meaning: A Multi-Task Learning Model for Acoustic Word Embeddings. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Corentin Puffay, Jana Van Canneyt, Jonas Vanthornhout, Hugo Van hamme, Tom Francart Relating the fundamental frequency of speech with EEG using a dilated convolutional network. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Mieszko Fras, Marcin Witkowski, Konrad Kowalczyk Convolutive Weighted Multichannel Wiener Filter Front-end for Distant Automatic Speech Recognition in Reverberant Multispeaker Scenarios. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Jason Fong, Daniel Lyth, Gustav Eje Henter, Hao Tang, Simon King 0001 Speech Audio Corrector: using speech from non-target speakers for one-off correction of mispronunciations in grapheme-input text-to-speech. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Cong-Thanh Do, Mohan Li, Rama Doddipatla Multiple-hypothesis RNN-T Loss for Unsupervised Fine-tuning and Self-training of Neural Transducer. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Jing Zhou, Changchun Bao Multi-source wideband DOA estimation method by frequency focusing and error weighting. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Seonwoo Lee, Sunhee Kim, Minhwa Chung A Study on the Phonetic Inventory Development of Children with Cochlear Implants for 5 Years after Implantation. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Martin Kocour, Katerina Zmolíková, Lucas Ondel, Jan Svec, Marc Delcroix, Tsubasa Ochiai, Lukás Burget, Jan Cernocký Revisiting joint decoding based multi-talker speech recognition with DNN acoustic model. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Jiamin Xie, John H. L. Hansen DEFORMER: Coupling Deformed Localized Patterns with Global Context for Robust End-to-end Speech Recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Toshio Irino, Honoka Tamaru, Ayako Yamamoto Speech intelligibility of simulated hearing loss sounds and its prediction using the Gammachirp Envelope Similarity Index (GESI). Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Martin Lebourdais, Marie Tahon, Antoine Laurent, Sylvain Meignier Overlapped speech and gender detection with WavLM pre-trained features. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Xianchao Wu Attention Enhanced Citrinet for Speech Recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Darshana Priyasad, Andi Partovi, Sridha Sridharan, Maryam Kashefpoor, Tharindu Fernando, Simon Denman, Clinton Fookes, Jia Tang, David Kaye Detecting Heart Failure Through Voice Analysis using Self-Supervised Mode-Based Memory Fusion. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Han Lei, Ning Chen Audio-Visual Scene Classification Based on Multi-modal Graph Fusion. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1M. K. Jayesh, Mukesh Sharma, Praneeth Vonteddu, Mahaboob Ali Basha Shaik, Sriram Ganapathy Transformer Networks for Non-Intrusive Speech Quality Prediction. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Evelina Bakhturina, Yang Zhang 0089, Boris Ginsburg Shallow Fusion of Weighted Finite-State Transducer and Language Model for Text Normalization. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Siddhant Arora, Siddharth Dalmia, Xuankai Chang, Brian Yan, Alan W. Black, Shinji Watanabe 0001 Two-Pass Low Latency End-to-End Spoken Language Understanding. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Arlo Faria, Adam Janin, Sidhi Adkoli, Korbinian Riedhammer Toward Zero Oracle Word Error Rate on the Switchboard Benchmark. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Liang Xu, Jing Wang 0037, Lizhong Wang, Sijun Bi, Jianqian Zhang, Qiuyue Ma Human Sound Classification based on Feature Fusion Method with Air and Bone Conducted Signal. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Jihyun Lee, Gary Geunbae Lee SF-DST: Few-Shot Self-Feeding Reading Comprehension Dialogue State Tracking with Auxiliary Task. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Wei-Cheng Tseng, Wei-Tsung Kao, Hung-yi Lee Membership Inference Attacks Against Self-supervised Speech Models. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Lucas Goncalves, Carlos Busso Improving Speech Emotion Recognition Using Self-Supervised Learning with Domain-Specific Audiovisual Tasks. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Hayato Futami, Hirofumi Inaguma, Sei Ueno, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Apoorv Vyas, Wei-Ning Hsu, Michael Auli, Alexei Baevski On-demand compute reduction with stochastic wav2vec 2.0. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Moakala Tzudir, Priyankoo Sarmah, S. R. Mahadeva Prasanna Prosodic Information in Dialect Identification of a Tonal Language: The case of Ao. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Myunghun Jung, Hoi Rin Kim Asymmetric Proxy Loss for Multi-View Acoustic Word Embeddings. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Syed Ammar Abbas, Thomas Merritt, Alexis Moinet, Sri Karlapati, Ewa Muszynska, Simon Slangen, Elia Gatti, Thomas Drugman Expressive, Variable, and Controllable Duration Modelling in TTS. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Pouriya Amini Digehsara, João Vítor Possamai de Menezes, Christoph Wagner, Michael Bärhold, Petr Schaffer, Dirk Plettemeier, Peter Birkholz A user-friendly headset for radar-based silent speech recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Rajeev Rajan, Ananya Ayasi Oktoechos Classification in Liturgical Music Using SBU-LSTM/GRU. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Ayimnisagul Ablimit, Karen Scholz, Tanja Schultz Deep Learning Approaches for Detecting Alzheimer's Dementia from Conversational Speech of ILSE Study. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Shaojin Ding, Phoenix Meadowlark, Yanzhang He, Lukasz Lew, Shivani Agrawal, Oleg Rybakov 4-bit Conformer with Native Quantization Aware Training for Speech Recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Alexandre Bittar, Philip N. Garner Bayesian Recurrent Units and the Forward-Backward Algorithm. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Nianzu Zheng, Liqun Deng, Wenyong Huang, Yu Ting Yeung, Baohua Xu, Yuanyuan Guo, Yasheng Wang, Xiao Chen 0012, Xin Jiang 0002, Qun Liu 0001 CoCA-MDD: A Coupled Cross-Attention based Framework for Streaming Mispronunciation Detection and Diagnosis. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Yu Wang 0105, Mark Cartwright, Juan Pablo Bello Active Few-Shot Learning for Sound Event Detection. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Gasser Elbanna, Alice Biryukov, Neil Scheidwasser-Clow, Lara Orlandic, Pablo Mainar, Mikolaj Kegler, Pierre Beckmann, Milos Cernak Hybrid Handcrafted and Learnable Audio Representation for Analysis of Speech Under Cognitive and Physical Load. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Manh Luong, Viet-Anh Tran FlowVocoder: A small Footprint Neural Vocoder based Normalizing Flow for Speech Synthesis. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Xiaofeng Ge, Jiangyu Han, Yanhua Long, Haixin Guan PercepNet+: A Phase and SNR Aware PercepNet for Real-Time Speech Enhancement. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Xun Gong 0005, Zhikai Zhou, Yanmin Qian Knowledge Transfer and Distillation from Autoregressive to Non-Autoregessive Speech Recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Manthan Thakker, Sefik Emre Eskimez, Takuya Yoshioka, Huaming Wang Fast Real-time Personalized Speech Enhancement: End-to-End Enhancement Network (E3Net) and Knowledge Distillation. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Xiang Li 0105, Changhe Song, Xianhao Wei, Zhiyong Wu 0001, Jia Jia 0001, Helen Meng Towards Cross-speaker Reading Style Transfer on Audiobook Dataset. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Zijiang Yang 0007, Xin Jing, Andreas Triantafyllopoulos, Meishu Song, Ilhan Aslan, Björn W. Schuller An Overview & Analysis of Sequence-to-Sequence Emotional Voice Conversion. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Hao Zhang, Ashutosh Pandey 0004, DeLiang Wang Attentive Recurrent Network for Low-Latency Active Noise Control. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Denis Ivanko, Dmitry Ryumin, Alexey M. Kashevnik, Alexandr Axyonov, Andrey Kitenko, Igor Lashkov, Alexey Karpov 0001 DAVIS: Driver's Audio-Visual Speech recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  BibTeX  RDF
1Tuomo Raitio, Petko Petkov, Jiangchuan Li, P. V. Muhammed Shifas, Andrea Davis, Yannis Stylianou Vocal effort modeling in neural TTS for improving the intelligibility of synthetic speech in noise. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Siqing Qin, Longbiao Wang, Sheng Li 0010, Yuqin Lin, Jianwu Dang 0001 Finer-grained Modeling units-based Meta-Learning for Low-resource Tibetan Speech Recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Kenta Udagawa, Yuki Saito, Hiroshi Saruwatari Human-in-the-loop Speaker Adaptation for DNN-based Multi-speaker TTS. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Song Zhang, Ken Zheng, Xiaoxu Zhu, Baoxiang Li A polyphone BERT for Polyphone Disambiguation in Mandarin Chinese. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Zixiu Wu, Rim Helaoui, Diego Reforgiato Recupero, Daniele Riboni Towards Automated Counselling Decision-Making: Remarks on Therapist Action Forecasting on the AnnoMI Dataset. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Naoki Makishima, Satoshi Suzuki, Atsushi Ando, Ryo Masumura Speaker consistency loss and step-wise optimization for semi-supervised joint training of TTS and ASR using unpaired text data. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Jan Lehecka, Jan Svec, Ales Prazák, Josef Psutka Exploring Capabilities of Monolingual Audio Transformers using Large Datasets in Automatic Speech Recognition of Czech. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Samik Sadhu, Hynek Hermansky Complex Frequency Domain Linear Prediction: A Tool to Compute Modulation Spectrum of Speech. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Wo Jae Lee, Emanuele Coviello A Multimodal Strategy for Singing Language Identification. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Binu Nisal Abeysinghe, Jesin James, Catherine I. Watson, Felix Marattukalam Visualising Model Training via Vowel Space for Text-To-Speech Systems. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Marvin Borsdorf, Kevin Scheck, Haizhou Li 0001, Tanja Schultz Blind Language Separation: Disentangling Multilingual Cocktail Party Voices by Language. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Arun Babu, Changhan Wang, Andros Tjandra, Kushal Lakhotia, Qiantong Xu, Naman Goyal, Kritika Singh, Patrick von Platen, Yatharth Saraf, Juan Pino 0001, Alexei Baevski, Alexis Conneau, Michael Auli XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Karan Singla, Shahab Jalalvand, Yeon-Jun Kim, Ryan Price, Daniel Pressel, Srinivas Bangalore Seq-2-Seq based Refinement of ASR Output for Spoken Name Capture. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Xubo Liu, Haohe Liu, Qiuqiang Kong, Xinhao Mei, Jinzheng Zhao, Qiushi Huang, Mark D. Plumbley, Wenwu Wang 0001 Separate What You Describe: Language-Queried Audio Source Separation. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Jing Zhao, Haoyu Wang, Jinpeng Li, Shuzhou Chai, Guanbo Wang, Guoguo Chen, Wei-Qiang Zhang 0001 The THUEE System Description for the IARPA OpenASR21 Challenge. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Dario Albesano, Jesús Andrés-Ferrer, Nicola Ferri, Puming Zhan On the Prediction Network Architecture in RNN-T for ASR. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Haoyu Li, Junichi Yamagishi DDS: A new device-degraded speech dataset for speech enhancement. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Alexandra Antonova, Evelina Bakhturina, Boris Ginsburg Thutmose Tagger: Single-pass neural model for Inverse Text Normalization. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Pierre-Michel Bousquet, Mickael Rouvier, Jean-François Bonastre Reliability criterion based on learning-phase entropy for speaker recognition with neural network. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Yooncheol Ju, Ilhwan Kim, Hongsun Yang, Ji-Hoon Kim, Byeongyeol Kim, Soumi Maiti, Shinji Watanabe 0001 TriniTTS: Pitch-controllable End-to-end TTS without External Aligner. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Weiqing Wang, Ming Li 0026, Qingjian Lin Online Target Speaker Voice Activity Detection for Speaker Diarization. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Laura Spinu, Ioana Vasilescu, Lori Lamel, Jason Lilley Voicing neutralization in Romanian fricatives across different speech styles. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Zewang Zhang, Yibin Zheng, Xinhui Li, Li Lu WeSinger: Data-augmented Singing Voice Synthesis with Auxiliary Losses. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
Displaying result #701 - #800 of 18782 (100 per page; Change: )
Pages: [<<][1][2][3][4][5][6][7][8][9][10][11][12][13][14][15][16][17][>>]
Valid XHTML 1.1! Valid CSS! [Valid RSS]
Maintained by L3S.
Previously maintained by Jörg Diederich.
Based upon DBLP by Michael Ley.
open data data released under the ODC-BY 1.0 license