|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
No Growbag Graphs found.
|
|
|
Results
Found 18782 publication records. Showing 18782 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
1 | Bahman Mirheidari, André Bittar, Nicholas Cummins, Johnny Downs, Helen L. Fisher, Heidi Christensen |
Automatic Detection of Expressed Emotion from Five-Minute Speech Samples: Challenges and Opportunities. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Kuan-Po Huang, Yu-Kuan Fu, Yu Zhang 0033, Hung-yi Lee |
Improving Distortion Robustness of Self-supervised Speech Processing Tasks with Domain Adaptation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Sebastião Quintas, Julie Mauclair, Virginie Woisard, Julien Pinquier |
Automatic Assessment of Speech Intelligibility using Consonant Similarity for Head and Neck Cancer. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Xin Wang, Chuan Xie, Qiang Wu, Huayi Zhan, Ying Wu |
A Novel Phoneme-based Modeling for Text-independent Speaker Identification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Naokazu Uchida, Takeshi Homma, Makoto Iwayama, Yasuhiro Sogawa |
Reducing Offensive Replies in Open Domain Dialogue Systems. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Shichao Hu, Bin Zhang, Jinhong Lu, Yiliang Jiang, Wucheng Wang, Lingcheng Kong, Weifeng Zhao, Tao Jiang |
WideResNet with Joint Representation Learning and Data Augmentation for Cover Song Identification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zhongwei Teng, Quchen Fu, Jules White, Maria E. Powell, Douglas C. Schmidt |
SA-SASV: An End-to-End Spoof-Aggregated Spoofing-Aware Speaker Verification System. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Raphaël Olivier, Bhiksha Raj |
Recent improvements of ASR models in the face of adversarial attacks. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Dimitrios Stoidis, Andrea Cavallaro |
Generating gender-ambiguous voices for privacy-preserving speech recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Cal Peyser, W. Ronny Huang, Andrew Rosenberg, Tara N. Sainath, Michael Picheny, Kyunghyun Cho |
Towards Disentangled Speech Representations. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Murchana Baruah, Bonny Banerjee |
Speech Emotion Recognition via Generation using an Attention-based Variational Recurrent Neural Network. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yen-Ju Lu, Xuankai Chang, Chenda Li, Wangyou Zhang, Samuele Cornell, Zhaoheng Ni, Yoshiki Masuyama, Brian Yan, Robin Scheibler, Zhong-Qiu Wang, Yu Tsao 0001, Yanmin Qian, Shinji Watanabe 0001 |
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Cheng Yu, Szu-Wei Fu, Tsun-An Hsieh, Yu Tsao 0001, Mirco Ravanelli |
OSSEM: one-shot speaker adaptive speech enhancement using meta learning. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Davis Nicmanis, Askars Salimbajevs |
Spoken Dialogue System for Call Centers with Expressive Speech Synthesis. |
INTERSPEECH |
2022 |
DBLP BibTeX RDF |
|
1 | Weidong Chen, Xiaofen Xing, Xiangmin Xu, Jianxin Pang, Lan Du |
SpeechFormer: A Hierarchical Efficient Framework Incorporating the Characteristics of Speech. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Véronique Delvaux, Audrey Lavallée, Fanny Degouis, Xavier Saloppe, Jean-Louis Nandrino, Thierry Pham |
Telling self-defining memories: An acoustic study of natural emotional speech productions. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yan Li, Ying Chen 0015, Xinya Zhang, Yanyang Chen, Jiazheng Wang |
Effects of Language Contact on Vowel Nasalization in Wenzhou and Rugao Dialects. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Satwik Dutta, Sarah Anne Tao, Jacob C. Reyna, Rebecca Elizabeth Hacker, Dwight W. Irvin, Jay F. Buzhardt, John H. L. Hansen |
Challenges remain in Building ASR for Spontaneous Preschool Children Speech in Naturalistic Educational Environments. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Fangjun Kuang, Liyong Guo, Wei Kang 0006, Long Lin, Mingshuang Luo, Zengwei Yao, Daniel Povey |
Pruned RNN-T for fast, memory-efficient ASR training. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Youngsik Eom, Yeonghyeon Lee, Ji Sub Um, Hoi Rin Kim |
Anti-Spoofing Using Transfer Learning with Variational Information Bottleneck. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Marzena Zygis, Sarah Wesolek, Nina Hosseini-Kivanani, Manfred Krifka |
The Prosody of Cheering in Sport Events. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jumon Nozaki, Tatsuya Kawahara, Kenkichi Ishizuka, Taiichi Hashimoto |
End-to-end Speech-to-Punctuated-Text Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jiaming Cheng, Ruiyu Liang, Yue Xie, Li Zhao 0003, Björn W. Schuller, Jie Jia, Yiyuan Peng |
Cross-Layer Similarity Knowledge Distillation for Speech Enhancement. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yingying Gao, Junlan Feng, Chao Deng, Shilei Zhang |
Meta Auxiliary Learning for Low-resource Spoken Language Understanding. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | João Vítor Menezes, Pouriya Amini Digehsara, Christoph Wagner, Marco Mütze, Michael Bärhold, Petr Schaffer, Dirk Plettemeier, Peter Birkholz |
Evaluation of different antenna types and positions in a stepped frequency continuous-wave radar-based silent speech interface. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Marc Delcroix, Keisuke Kinoshita, Tsubasa Ochiai, Katerina Zmolíková, Hiroshi Sato, Tomohiro Nakatani |
Listen only to me! How well can target speech extraction handle false alarms? |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Sukanya Sonowal, Anish Tamse |
Novel Augmentation Schemes for Device Robust Acoustic Scene Classification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zhiyun Lu, Yongqiang Wang, Yu Zhang 0033, Wei Han, Zhehuai Chen, Parisa Haghani |
Unsupervised Data Selection via Discrete Speech Representation for ASR. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Digvijay Ingle, Ayush Kumar, Krishnachaitanya Gogineni, Jithendra Vepa |
Real-Time Monitoring of Silences in Contact Center Conversations. |
INTERSPEECH |
2022 |
DBLP BibTeX RDF |
|
1 | Wangyou Zhang, Zhuo Chen 0006, Naoyuki Kanda, Shujie Liu 0001, Jinyu Li 0001, Sefik Emre Eskimez, Takuya Yoshioka, Xiong Xiao, Zhong Meng, Yanmin Qian, Furu Wei |
Separating Long-Form Speech with Group-wise Permutation Invariant Training. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Sonal Joshi, Saurabh Kataria, Yiwen Shao, Piotr Zelasko, Jesús Villalba 0001, Sanjeev Khudanpur, Najim Dehak |
Defense against Adversarial Attacks on Hybrid Speech Recognition System using Adversarial Fine-tuning with Denoiser. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Longfei Yang, Wenqing Wei, Sheng Li 0010, Jiyi Li, Takahiro Shinozaki |
Augmented Adversarial Self-Supervised Learning for Early-Stage Alzheimer's Speech Detection. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Beiming Cao, Kristin Teplansky, Nordine Sebkhi, Arpan Bhavsar, Omer T. Inan, Robin Samlan, Ted Mau, Jun Wang 0037 |
Data Augmentation for End-to-end Silent Speech Recognition for Laryngectomees. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zilu Guo, Xu Xu 0003, Zhongfu Ye |
Joint Optimization of the Module and Sign of the Spectral Real Part Based on CRN for Speech Denoising. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Badr M. Abdullah, Bernd Möbius, Dietrich Klakow |
Integrating Form and Meaning: A Multi-Task Learning Model for Acoustic Word Embeddings. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Corentin Puffay, Jana Van Canneyt, Jonas Vanthornhout, Hugo Van hamme, Tom Francart |
Relating the fundamental frequency of speech with EEG using a dilated convolutional network. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Mieszko Fras, Marcin Witkowski, Konrad Kowalczyk |
Convolutive Weighted Multichannel Wiener Filter Front-end for Distant Automatic Speech Recognition in Reverberant Multispeaker Scenarios. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jason Fong, Daniel Lyth, Gustav Eje Henter, Hao Tang, Simon King 0001 |
Speech Audio Corrector: using speech from non-target speakers for one-off correction of mispronunciations in grapheme-input text-to-speech. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Cong-Thanh Do, Mohan Li, Rama Doddipatla |
Multiple-hypothesis RNN-T Loss for Unsupervised Fine-tuning and Self-training of Neural Transducer. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jing Zhou, Changchun Bao |
Multi-source wideband DOA estimation method by frequency focusing and error weighting. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Seonwoo Lee, Sunhee Kim, Minhwa Chung |
A Study on the Phonetic Inventory Development of Children with Cochlear Implants for 5 Years after Implantation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Martin Kocour, Katerina Zmolíková, Lucas Ondel, Jan Svec, Marc Delcroix, Tsubasa Ochiai, Lukás Burget, Jan Cernocký |
Revisiting joint decoding based multi-talker speech recognition with DNN acoustic model. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jiamin Xie, John H. L. Hansen |
DEFORMER: Coupling Deformed Localized Patterns with Global Context for Robust End-to-end Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Toshio Irino, Honoka Tamaru, Ayako Yamamoto |
Speech intelligibility of simulated hearing loss sounds and its prediction using the Gammachirp Envelope Similarity Index (GESI). |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Martin Lebourdais, Marie Tahon, Antoine Laurent, Sylvain Meignier |
Overlapped speech and gender detection with WavLM pre-trained features. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Xianchao Wu |
Attention Enhanced Citrinet for Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Darshana Priyasad, Andi Partovi, Sridha Sridharan, Maryam Kashefpoor, Tharindu Fernando, Simon Denman, Clinton Fookes, Jia Tang, David Kaye |
Detecting Heart Failure Through Voice Analysis using Self-Supervised Mode-Based Memory Fusion. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Han Lei, Ning Chen |
Audio-Visual Scene Classification Based on Multi-modal Graph Fusion. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | M. K. Jayesh, Mukesh Sharma, Praneeth Vonteddu, Mahaboob Ali Basha Shaik, Sriram Ganapathy |
Transformer Networks for Non-Intrusive Speech Quality Prediction. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Evelina Bakhturina, Yang Zhang 0089, Boris Ginsburg |
Shallow Fusion of Weighted Finite-State Transducer and Language Model for Text Normalization. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Siddhant Arora, Siddharth Dalmia, Xuankai Chang, Brian Yan, Alan W. Black, Shinji Watanabe 0001 |
Two-Pass Low Latency End-to-End Spoken Language Understanding. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Arlo Faria, Adam Janin, Sidhi Adkoli, Korbinian Riedhammer |
Toward Zero Oracle Word Error Rate on the Switchboard Benchmark. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Liang Xu, Jing Wang 0037, Lizhong Wang, Sijun Bi, Jianqian Zhang, Qiuyue Ma |
Human Sound Classification based on Feature Fusion Method with Air and Bone Conducted Signal. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jihyun Lee, Gary Geunbae Lee |
SF-DST: Few-Shot Self-Feeding Reading Comprehension Dialogue State Tracking with Auxiliary Task. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Wei-Cheng Tseng, Wei-Tsung Kao, Hung-yi Lee |
Membership Inference Attacks Against Self-supervised Speech Models. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Lucas Goncalves, Carlos Busso |
Improving Speech Emotion Recognition Using Self-Supervised Learning with Domain-Specific Audiovisual Tasks. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Hayato Futami, Hirofumi Inaguma, Sei Ueno, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara |
Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Apoorv Vyas, Wei-Ning Hsu, Michael Auli, Alexei Baevski |
On-demand compute reduction with stochastic wav2vec 2.0. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Moakala Tzudir, Priyankoo Sarmah, S. R. Mahadeva Prasanna |
Prosodic Information in Dialect Identification of a Tonal Language: The case of Ao. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Myunghun Jung, Hoi Rin Kim |
Asymmetric Proxy Loss for Multi-View Acoustic Word Embeddings. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Syed Ammar Abbas, Thomas Merritt, Alexis Moinet, Sri Karlapati, Ewa Muszynska, Simon Slangen, Elia Gatti, Thomas Drugman |
Expressive, Variable, and Controllable Duration Modelling in TTS. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Pouriya Amini Digehsara, João Vítor Possamai de Menezes, Christoph Wagner, Michael Bärhold, Petr Schaffer, Dirk Plettemeier, Peter Birkholz |
A user-friendly headset for radar-based silent speech recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Rajeev Rajan, Ananya Ayasi |
Oktoechos Classification in Liturgical Music Using SBU-LSTM/GRU. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ayimnisagul Ablimit, Karen Scholz, Tanja Schultz |
Deep Learning Approaches for Detecting Alzheimer's Dementia from Conversational Speech of ILSE Study. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Shaojin Ding, Phoenix Meadowlark, Yanzhang He, Lukasz Lew, Shivani Agrawal, Oleg Rybakov |
4-bit Conformer with Native Quantization Aware Training for Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Alexandre Bittar, Philip N. Garner |
Bayesian Recurrent Units and the Forward-Backward Algorithm. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Nianzu Zheng, Liqun Deng, Wenyong Huang, Yu Ting Yeung, Baohua Xu, Yuanyuan Guo, Yasheng Wang, Xiao Chen 0012, Xin Jiang 0002, Qun Liu 0001 |
CoCA-MDD: A Coupled Cross-Attention based Framework for Streaming Mispronunciation Detection and Diagnosis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yu Wang 0105, Mark Cartwright, Juan Pablo Bello |
Active Few-Shot Learning for Sound Event Detection. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Gasser Elbanna, Alice Biryukov, Neil Scheidwasser-Clow, Lara Orlandic, Pablo Mainar, Mikolaj Kegler, Pierre Beckmann, Milos Cernak |
Hybrid Handcrafted and Learnable Audio Representation for Analysis of Speech Under Cognitive and Physical Load. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Manh Luong, Viet-Anh Tran |
FlowVocoder: A small Footprint Neural Vocoder based Normalizing Flow for Speech Synthesis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Xiaofeng Ge, Jiangyu Han, Yanhua Long, Haixin Guan |
PercepNet+: A Phase and SNR Aware PercepNet for Real-Time Speech Enhancement. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Xun Gong 0005, Zhikai Zhou, Yanmin Qian |
Knowledge Transfer and Distillation from Autoregressive to Non-Autoregessive Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Manthan Thakker, Sefik Emre Eskimez, Takuya Yoshioka, Huaming Wang |
Fast Real-time Personalized Speech Enhancement: End-to-End Enhancement Network (E3Net) and Knowledge Distillation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Xiang Li 0105, Changhe Song, Xianhao Wei, Zhiyong Wu 0001, Jia Jia 0001, Helen Meng |
Towards Cross-speaker Reading Style Transfer on Audiobook Dataset. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zijiang Yang 0007, Xin Jing, Andreas Triantafyllopoulos, Meishu Song, Ilhan Aslan, Björn W. Schuller |
An Overview & Analysis of Sequence-to-Sequence Emotional Voice Conversion. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Hao Zhang, Ashutosh Pandey 0004, DeLiang Wang |
Attentive Recurrent Network for Low-Latency Active Noise Control. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Denis Ivanko, Dmitry Ryumin, Alexey M. Kashevnik, Alexandr Axyonov, Andrey Kitenko, Igor Lashkov, Alexey Karpov 0001 |
DAVIS: Driver's Audio-Visual Speech recognition. |
INTERSPEECH |
2022 |
DBLP BibTeX RDF |
|
1 | Tuomo Raitio, Petko Petkov, Jiangchuan Li, P. V. Muhammed Shifas, Andrea Davis, Yannis Stylianou |
Vocal effort modeling in neural TTS for improving the intelligibility of synthetic speech in noise. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Siqing Qin, Longbiao Wang, Sheng Li 0010, Yuqin Lin, Jianwu Dang 0001 |
Finer-grained Modeling units-based Meta-Learning for Low-resource Tibetan Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Kenta Udagawa, Yuki Saito, Hiroshi Saruwatari |
Human-in-the-loop Speaker Adaptation for DNN-based Multi-speaker TTS. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Song Zhang, Ken Zheng, Xiaoxu Zhu, Baoxiang Li |
A polyphone BERT for Polyphone Disambiguation in Mandarin Chinese. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zixiu Wu, Rim Helaoui, Diego Reforgiato Recupero, Daniele Riboni |
Towards Automated Counselling Decision-Making: Remarks on Therapist Action Forecasting on the AnnoMI Dataset. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Naoki Makishima, Satoshi Suzuki, Atsushi Ando, Ryo Masumura |
Speaker consistency loss and step-wise optimization for semi-supervised joint training of TTS and ASR using unpaired text data. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jan Lehecka, Jan Svec, Ales Prazák, Josef Psutka |
Exploring Capabilities of Monolingual Audio Transformers using Large Datasets in Automatic Speech Recognition of Czech. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Samik Sadhu, Hynek Hermansky |
Complex Frequency Domain Linear Prediction: A Tool to Compute Modulation Spectrum of Speech. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Wo Jae Lee, Emanuele Coviello |
A Multimodal Strategy for Singing Language Identification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Binu Nisal Abeysinghe, Jesin James, Catherine I. Watson, Felix Marattukalam |
Visualising Model Training via Vowel Space for Text-To-Speech Systems. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Marvin Borsdorf, Kevin Scheck, Haizhou Li 0001, Tanja Schultz |
Blind Language Separation: Disentangling Multilingual Cocktail Party Voices by Language. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Arun Babu, Changhan Wang, Andros Tjandra, Kushal Lakhotia, Qiantong Xu, Naman Goyal, Kritika Singh, Patrick von Platen, Yatharth Saraf, Juan Pino 0001, Alexei Baevski, Alexis Conneau, Michael Auli |
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Karan Singla, Shahab Jalalvand, Yeon-Jun Kim, Ryan Price, Daniel Pressel, Srinivas Bangalore |
Seq-2-Seq based Refinement of ASR Output for Spoken Name Capture. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Xubo Liu, Haohe Liu, Qiuqiang Kong, Xinhao Mei, Jinzheng Zhao, Qiushi Huang, Mark D. Plumbley, Wenwu Wang 0001 |
Separate What You Describe: Language-Queried Audio Source Separation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jing Zhao, Haoyu Wang, Jinpeng Li, Shuzhou Chai, Guanbo Wang, Guoguo Chen, Wei-Qiang Zhang 0001 |
The THUEE System Description for the IARPA OpenASR21 Challenge. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Dario Albesano, Jesús Andrés-Ferrer, Nicola Ferri, Puming Zhan |
On the Prediction Network Architecture in RNN-T for ASR. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Haoyu Li, Junichi Yamagishi |
DDS: A new device-degraded speech dataset for speech enhancement. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Alexandra Antonova, Evelina Bakhturina, Boris Ginsburg |
Thutmose Tagger: Single-pass neural model for Inverse Text Normalization. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Pierre-Michel Bousquet, Mickael Rouvier, Jean-François Bonastre |
Reliability criterion based on learning-phase entropy for speaker recognition with neural network. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yooncheol Ju, Ilhwan Kim, Hongsun Yang, Ji-Hoon Kim, Byeongyeol Kim, Soumi Maiti, Shinji Watanabe 0001 |
TriniTTS: Pitch-controllable End-to-end TTS without External Aligner. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Weiqing Wang, Ming Li 0026, Qingjian Lin |
Online Target Speaker Voice Activity Detection for Speaker Diarization. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Laura Spinu, Ioana Vasilescu, Lori Lamel, Jason Lilley |
Voicing neutralization in Romanian fricatives across different speech styles. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zewang Zhang, Yibin Zheng, Xinhui Li, Li Lu |
WeSinger: Data-augmented Singing Voice Synthesis with Auxiliary Losses. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
|
|