|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
No Growbag Graphs found.
|
|
|
Results
Found 18782 publication records. Showing 18782 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
1 | Salah Zaiem, Titouan Parcollet, Slim Essid |
Automatic Data Augmentation Selection and Parametrization in Contrastive Self-Supervised Speech Representation Learning. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Hanbin Bae, Young-Sun Joo |
Enhancement of Pitch Controllability using Timbre-Preserving Pitch Augmentation in FastPitch. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Sreyan Ghosh, Sonal Kumar, Yaman Kumar 0001, Rajiv Ratn Shah, Srinivasan Umesh |
Span Classification with Structured Information for Disfluency Detection in Spoken Utterances. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Minho Jin, Chelsea Ju, Zeya Chen, Yi-Chieh Liu, Jasha Droppo, Andreas Stolcke |
Adversarial Reweighting for Speaker Verification Fairness. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Michel Cardoso Meneses, Rafael Bérgamo Holanda, Luis Vasconcelos Peres, Gabriela Dantas Rocha |
SiDi KWS: A Large-Scale Multilingual Dataset for Keyword Spotting. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Haodong Zhao, Wei Du, Junjie Guo, Gongshen Liu |
A Universal Identity Backdoor Attack against Speaker Verification based on Siamese Network. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Nan Li, Meng Ge, Longbiao Wang, Masashi Unoki, Sheng Li 0010, Jianwu Dang 0001 |
Global Signal-to-noise Ratio Estimation Based on Multi-subband Processing Using Convolutional Neural Network. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Vishal Sunder, Eric Fosler-Lussier, Samuel Thomas 0001, Hong-Kwang Kuo, Brian Kingsbury |
Tokenwise Contrastive Pretraining for Finer Speech-to-BERT Alignment in End-to-End Speech-to-Intent Systems. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Kay Peterson, Audrey Tong, Yan Yu |
OpenASR21: The Second Open Challenge for Automatic Speech Recognition of Low-Resource Languages. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Danilo de Oliveira, Tal Peer, Timo Gerkmann |
Efficient Transformer-based Speech Enhancement Using Long Frames and STFT Magnitudes. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ngoc-Quan Pham, Alexander Waibel, Jan Niehues |
Adaptive multilingual speech recognition with pretrained models. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Chao Zhang, Bo Li 0028, Tara N. Sainath, Trevor Strohman, Sepand Mavandadi, Shuo-Yiin Chang, Parisa Haghani |
Streaming End-to-End Multilingual Speech Recognition with Joint Language Identification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Franziska Braun, Markus Förstel, Bastian Oppermann, Andreas Erzigkeit, Hartmut Lehfeld, Thomas Hillemacher, Korbinian Riedhammer |
Automated Evaluation of Standardized Dementia Screening Tests. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yiwen Shao, Jesús Villalba 0001, Sonal Joshi, Saurabh Kataria, Sanjeev Khudanpur, Najim Dehak |
Chunking Defense for Adversarial Attacks on ASR. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Eunkyung Yoo, Hyeonseop Song, Taehyeong Kim, Chul Lee |
Online Learning of Open-set Speaker Identification by Active User-registration. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Tao Liu, Shuai Fan 0005, Xu Xiang, Hongbo Song, Shaoxiong Lin, Jiaqi Sun, Tianyuan Han, Siyuan Chen, Binwei Yao, Sen Liu, Yifei Wu, Yanmin Qian, Kai Yu 0004 |
MSDWild: Multi-modal Speaker Diarization Dataset in the Wild. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Shuai Zhang 0014, Jiangyan Yi, Zhengkun Tian, Jianhua Tao 0001, Yu Ting Yeung, Liqun Deng |
reducing multilingual context confusion for end-to-end code-switching automatic speech recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Nik Vaessen, David A. van Leeuwen |
Training speaker recognition systems with limited data. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Hao Tan, Junjian Zhang, Huan Zhang, Le Wang 0008, Yaguan Qian, Zhaoquan Gu |
NRI-FGSM: An Efficient Transferable Adversarial Attack for Speaker Recognition Systems. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yutaro Sanada, Takumi Nakagawa, Yuichiro Wada, Kosaku Takanashi, Yuhui Zhang, Kiichi Tokuyama, Takafumi Kanamori, Tomonori Yamada |
Deep Self-Supervised Learning of Speech Denoising from Noisy Speeches. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ryo Terashima, Ryuichi Yamamoto, Eunwoo Song, Yuma Shirahata, Hyun-Wook Yoon, Jae-Min Kim, Kentaro Tachibana |
Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ruida Li, Shuo Fang, Chenguang Ma, Liang Li |
Adaptive Rectangle Loss for Speaker Verification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zhan Zhang, Yuehai Wang, Jianyi Yang |
End-to-end Mispronunciation Detection with Simulated Error Distance. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ye Wang, Baishun Ling, Yanmeng Wang, Junhao Xue, Shaojun Wang, Jing Xiao 0006 |
Adversarial Knowledge Distillation For Robust Spoken Language Understanding. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ryo Masumura, Yoshihiro Yamazaki, Saki Mizuno, Naoki Makishima, Mana Ihori, Mihiro Uchida, Hiroshi Sato, Tomohiro Tanaka, Akihiko Takashima, Satoshi Suzuki, Shota Orihashi, Takafumi Moriya, Nobukatsu Hojo, Atsushi Ando |
End-to-End Joint Modeling of Conversation History-Dependent and Independent ASR Systems with Multi-History Training. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Long Chen, Yixiong Meng, Venkatesh Ravichandran, Andreas Stolcke |
Graph-based Multi-View Fusion and Local Adaptation: Mitigating Within-Household Confusability for Speaker Identification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Sarthak Yadav, Neil Zeghidour |
Learning neural audio features without supervision. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jieun Song, Hae-Sung Jeon, Jieun Kiaer |
Use of prosodic and lexical cues for disambiguating wh-words in Korean. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zeyang Song, Qi Liu, Qu Yang, Haizhou Li 0001 |
Knowledge distillation for In-memory keyword spotting model. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Xian Li, Xiaofei Li |
ATST: Audio Representation Learning with Teacher-Student Transformer. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Takafumi Moriya, Hiroshi Sato, Tsubasa Ochiai, Marc Delcroix, Takahiro Shinozaki |
Streaming Target-Speaker ASR with Neural Transducer. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Aku Rouhe, Anja Virkkunen, Juho Leinonen 0002, Mikko Kurimo |
Low Resource Comparison of Attention-based and Hybrid ASR Exploiting wav2vec 2.0. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Guangzhi Sun, Chao Zhang 0031, Philip C. Woodland |
Tree-constrained Pointer Generator with Graph Neural Network Encodings for Contextual Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yanyan Yue, Jun Du, Mao-Kui He, Yu Ting Yeung, Renyu Wang |
Online Speaker Diarization with Core Samples Selection. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Mingqiong Luo |
Mandarin nasal place assimilation revisited: an acoustic study. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Benjamin O'Brien, Christine Meunier, Alain Ghio |
Evaluating the effects of modified speech on perceptual speaker identification performance. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yeonjin Cho, Sara Ng, Trang Tran 0001, Mari Ostendorf |
Leveraging Prosody for Punctuation Prediction of Spontaneous Speech. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Neha Reddy, Yoonjeong Lee, Zhaoyan Zhang, Dinesh K. Chhetri |
Optimal thyroplasty implant shape and stiffness for treatment of acute unilateral vocal fold paralysis: Evidence from a canine in vivo phonation model. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jenthe Thienpondt, Kris Demuynck |
Transfer Learning for Robust Low-Resource Children's Speech ASR with Transformers and Source-Filter Warping. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | A. Arunkumar, Srinivasan Umesh |
Joint Encoder-Decoder Self-Supervised Pre-training for ASR. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Puneet Mathur, Franck Dernoncourt, Quan Hung Tran, Jiuxiang Gu, Ani Nenkova, Vlad I. Morariu, Rajiv Jain, Dinesh Manocha |
DocLayoutTTS: Dataset and Baselines for Layout-informed Document-level Neural Speech Synthesis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Qiongqiong Wang, Kong Aik Lee, Tianchi Liu 0004 |
Scoring of Large-Margin Embeddings for Speaker Verification: Cosine or PLDA? |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Koharu Horii, Meiko Fukuda, Kengo Ohta, Ryota Nishimura, Atsunori Ogawa, Norihide Kitaoka |
End-to-End Spontaneous Speech Recognition Using Disfluency Labeling. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Byeonggeun Kim, Seunghan Yang, Jangho Kim, Hyunsin Park, Juntae Lee, Simyung Chang |
Domain Generalization with Relaxed Instance Frequency-wise Normalization for Multi-device Acoustic Scene Classification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Debarpan Bhattacharya, Debottam Dutta, Neeraj Kumar Sharma 0001, Srikanth Raj Chetupalli, Pravin Mote, Sriram Ganapathy, Chandrakiran C, Sahiti Nori, Suhail K. K, Sadhana Gonuguntla, Murali Alagesan |
Coswara: A website application enabling COVID-19 screening by analysing respiratory sound samples and health symptoms. |
INTERSPEECH |
2022 |
DBLP BibTeX RDF |
|
1 | Ruizhe Cao, Sherif Abdulatif, Bin Yang 0009 |
CMGAN: Conformer-based Metric GAN for Speech Enhancement. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Danni Liu, Changhan Wang, Hongyu Gong, Xutai Ma, Yun Tang 0002, Juan Miguel Pino |
From Start to Finish: Latency Reduction Strategies for Incremental Speech Synthesis in Simultaneous Speech-to-Speech Translation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yizhou Wang, Rikke L. Bundgaard-Nielsen, Brett Baker, Olga Maxwell |
Native phonotactic interference in L2 vowel processing: Mouse-tracking reveals cognitive conflicts during identification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Carolina Lins Machado, Volker Dellwo, Lei He 0021 |
Idiosyncratic lingual articulation of American English /æ/ and /ɑ/ using network analysis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Cassia Valentini-Botinhao, Manuel Sam Ribeiro, Oliver Watts, Korin Richmond, Gustav Eje Henter |
Predicting pairwise preferences between TTS audio stimuli using parallel ratings data and anti-symmetric twin neural networks. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jisi Zhang, Catalin Zorila, Rama Doddipatla, Jon Barker |
On monoaural speech enhancement for automatic recognition of real noisy speech using mixture invariant training. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Chiori Hori, Takaaki Hori, Jonathan Le Roux |
Low-Latency Online Streaming VideoQA Using Audio-Visual Transformers. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Dongchao Yang, Helin Wang, Zhongjie Ye, Yuexian Zou, Wenwu Wang 0001 |
RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yogesh Virkar, Marcello Federico, Robert Enyedi, Roberto Barra-Chicote |
Prosodic alignment for off-screen automatic dubbing. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Sathvik Udupa, Aravind Illa, Prasanta Kumar Ghosh |
Streaming model for Acoustic to Articulatory Inversion with transformer networks. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Werner van der Merwe, Herman Kamper, Johan Adam du Preez |
A Temporal Extension of Latent Dirichlet Allocation for Unsupervised Acoustic Unit Discovery. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Hyun-Wook Yoon, Ohsung Kwon, Hoyeon Lee, Ryuichi Yamamoto, Eunwoo Song, Jae-Min Kim, Min-Jae Hwang |
Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ronit Damania, Christopher Homan, Emily Prud'hommeaux |
Combining Simple but Novel Data Augmentation Methods for Improving Conformer ASR. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Brooke Stephenson, Laurent Besacier, Laurent Girin, Thomas Hueber |
BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Gordon Rennie, Olga Perepelkina, Alessandro Vinciarelli |
Which Model is Best: Comparing Methods and Metrics for Automatic Laughter Detection in a Naturalistic Conversational Dataset. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Hoang Thi Thu Uyen, Nguyen Anh Tu, Ta Duc Huy |
Vietnamese Capitalization and Punctuation Recovery Models. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Weixin Meng, Chengshi Zheng, Xiaodong Li 0002 |
Fully Automatic Balance between Directivity Factor and White Noise Gain for Large-scale Microphone Arrays in Diffuse Noise Fields. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Sebastian Peter Bayerl, Dominik Wagner, Elmar Nöth, Korbinian Riedhammer |
Detecting Dysfluencies in Stuttering Therapy Using wav2vec 2.0. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Chenyu Yang, Yu Wang |
Robust End-to-end Speaker Diarization with Generic Neural Clustering. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jinchao Li, Shuai Wang, Yang Chao, Xunying Liu, Helen Meng |
Context-aware Multimodal Fusion for Emotion Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Marc-Antoine Georges, Jean-Luc Schwartz, Thomas Hueber |
Self-supervised speech unit discovery from articulatory and acoustic features using VQ-VAE. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jirí Martínek, Christophe Cerisara, Pavel Král, Ladislav Lenc, Josef Baloun |
Weak supervision for Question Type Detection with large language models. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yusuke Shinohara, Shinji Watanabe 0001 |
Minimum latency training of sequence transducers for streaming end-to-end speech recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Kelvin Tran, Lingfeng Xu, Gabriela Stegmann, Julie Liss, Visar Berisha, Rene Utianski |
Investigating the Impact of Speech Compression on the Acoustics of Dysarthric Speech. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Christin Jose, Joe Wang, Grant P. Strimel, Mohammad Omar Khursheed, Yuriy Mishchenko, Brian Kulis |
Latency Control for Keyword Spotting. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Muqiao Yang, Joseph Konan, David Bick, Anurag Kumar 0003, Shinji Watanabe 0001, Bhiksha Raj |
Improving Speech Enhancement through Fine-Grained Speech Characteristics. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ruibin Yuan, Yuxuan Wu, Jacob Li, Jaxter Kim |
DeID-VC: Speaker De-identification via Zero-shot Pseudo Voice Conversion. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Lev Finkelstein, Heiga Zen, Norman Casagrande, Chun-an Chan, Ye Jia, Tom Kenter, Alexey Petelin, Jonathan Shen, Vincent Wan, Yu Zhang 0033, Yonghui Wu, Rob Clark |
Training Text-To-Speech Systems From Synthetic Data: A Practical Approach For Accent Transfer Tasks. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Haaris Mehmood, Agnieszka Dobrowolska, Karthikeyan Saravanan, Mete Ozay |
FedNST: Federated Noisy Student Training for Automatic Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Junyong Hao, Shunzhou Ye, Cheng Lu, Fei Dong, Jingang Liu, Dong Pi |
Soft-label Learn for No-Intrusive Speech Quality Assessment. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zehui Yang, Yifan Chen, Lei Luo, Runyan Yang, Lingxuan Ye, Gaofeng Cheng, Ji Xu, Yaohui Jin, Qingqing Zhang, Pengyuan Zhang, Lei Xie, Yonghong Yan 0002 |
Open Source MagicData-RAMC: A Rich Annotated Mandarin Conversational(RAMC) Speech Dataset. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Mostafa Sadeghi, Paul Magron |
A Sparsity-promoting Dictionary Model for Variational Autoencoders. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Mengnan He, Tingwei Guo, Zhenxing Lu, Ruixiong Zhang, Caixia Gong |
Improving GAN-based vocoder for fast and high-quality speech synthesis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zalan Borsos, Matthew Sharifi, Marco Tagliasacchi |
SpeechPainter: Text-conditioned Speech Inpainting. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Shinnosuke Takamichi, Wataru Nakata, Naoko Tanji, Hiroshi Saruwatari |
J-MAC: Japanese multi-speaker audiobook corpus for speech synthesis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jeong-Hwan Choi, Joon-Young Yang, Ye-Rin Jeoung, Joon-Hyuk Chang |
Improved CNN-Transformer using Broadcasted Residual Learning for Text-Independent Speaker Verification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Guodong Ma, Pengfei Hu 0004, Nurmemet Yolwas, Shen Huang, Hao Huang |
PM-MMUT: Boosted Phone-mask Data Augmentation using Multi-Modeling Unit Training for Phonetic-Reduction-Robust E2E Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yeonghyeon Lee, Kangwook Jang, Jahyun Goo, Youngmoon Jung, Hoi Rin Kim |
FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Models. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ke Hu, Tara N. Sainath, Yanzhang He, Rohit Prabhavalkar, Trevor Strohman, Sepand Mavandadi, Weiran Wang |
Improving Deliberation by Text-Only and Semi-Supervised Training. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | L. L. Chamara Kasun, Chung Soo Ahn, Jagath C. Rajapakse, Zhiping Lin 0001, Guang-Bin Huang |
Discriminative Adversarial Learning for Speaker Independent Emotion Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Takeru Gorai, Daisuke Saito, Nobuaki Minematsu |
Text-to-speech synthesis using spectral modeling based on non-negative autoencoder. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Shuai Guo, Jiatong Shi, Tao Qian, Shinji Watanabe 0001, Qin Jin |
SingAug: Data Augmentation for Singing Voice Synthesis with Cycle-consistent Training Strategy. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Bahman Mirheidari, Daniel Blackburn, Heidi Christensen |
Automatic cognitive assessment: Combining sparse datasets with disparate cognitive scores. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ashutosh Pandey 0004, Buye Xu, Anurag Kumar 0003, Jacob Donley, Paul Calamia, DeLiang Wang |
Time-domain Ad-hoc Array Speech Enhancement Using a Triple-path Network. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Suliang Bu, Yunxin Zhao, Tuo Zhao |
Steering vector correction in MVDR beamformer for speech enhancement. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Sichen Zhang, Aijun Li |
Acquisition of Two Consecutive Neutral Tones in Mandarin-Speaking Preschoolers: Phonological Representation and Phonetic Realization. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Dan Lim, Sunghee Jung, Eesung Kim |
JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yuna Lee, Seung Jun Baek |
Keyword Spotting with Synthetic Data using Heterogeneous Knowledge Distillation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Minseung Kim, Hyungchan Song, Sein Cheong, Jong Won Shin |
iDeepMMSE: An improved deep learning approach to MMSE speech and noise power spectrum estimation for speech enhancement. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Mohammad Zeineldeen, Jingjing Xu, Christoph Lüscher, Ralf Schlüter, Hermann Ney |
Improving the Training Recipe for a Robust Conformer-based Hybrid Model. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Dmitriy Serdyuk, Otavio Braga, Olivier Siohan |
Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition for Single and Muti-Person Video. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zvi Kons, Hagai Aronowitz, Edmilson da Silva Morais, Matheus Damasceno, Hong-Kwang Kuo, Samuel Thomas 0001, George Saon |
Extending RNN-T-based speech recognition systems with emotion and language classification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Magdalena Rybicka, Jesús Villalba 0001, Najim Dehak, Konrad Kowalczyk |
End-to-End Neural Speaker Diarization with an Iterative Refinement of Non-Autoregressive Attention-based Attractors. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Takuma Udagawa, Masayuki Suzuki, Gakuto Kurata, Nobuyasu Itoh, George Saon |
Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Pranav Dheram, Murugesan Ramakrishnan, Anirudh Raju, I-Fan Chen, Brian King, Katherine Powell, Melissa Saboowala, Karan Shetty, Andreas Stolcke |
Toward Fairness in Speech Recognition: Discovery and mitigation of performance disparities. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
Displaying result #601 - #700 of 18782 (100 per page; Change: ) Pages: [ <<][ 1][ 2][ 3][ 4][ 5][ 6][ 7][ 8][ 9][ 10][ 11][ 12][ 13][ 14][ 15][ 16][ >>] |
|