|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
No Growbag Graphs found.
|
|
|
Results
Found 18782 publication records. Showing 18782 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
1 | Takaaki Saeki, Kentaro Tachibana, Ryuichi Yamamoto |
DRSpeech: Degradation-Robust Text-to-Speech Synthesis with Frame-Level and Utterance-Level Acoustic Representation Learning. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Takashi Maekaku, Yuya Fujita, Yifan Peng, Shinji Watanabe 0001 |
Attention Weight Smoothing Using Prior Distributions for Transformer-Based End-to-End ASR. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jan Svec, Jan Lehecka, Lubos Smídl |
Deep LSTM Spoken Term Detection using Wav2Vec 2.0 Recognizer. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Thi Thu Trang Nguyen, Trung Duc Anh Dang, Quoc Viet Vu, Woomyoung Park |
Building Vietnamese Conversational Smart Home Dataset and Natural Language Understanding Model. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Keiko Ochi, Nobutaka Ono, Keiho Owada, Miho Kuroda, Shigeki Sagayama, Hidenori Yamasue |
Use of Nods Less Synchronized with Turn-Taking and Prosody During Conversations in Adults with Autism. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Hyeon-Kyeong Shin, Hyewon Han, Doyeon Kim, Soo-Whan Chung, Hong-Goo Kang |
Learning Audio-Text Agreement for Open-vocabulary Keyword Spotting. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Themos Stafylakis, Ladislav Mosner, Oldrich Plchot, Johan Rohdin, Anna Silnova, Lukás Burget, Jan Cernocký |
Training speaker embedding extractors using multi-speaker audio with unknown speaker boundaries. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Li Fu, Xiaoxiao Li, Runyu Wang, Lu Fan, Zhengchen Zhang, Meng Chen 0006, Youzheng Wu, Xiaodong He 0001 |
SCaLa: Supervised Contrastive Learning for End-to-End Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jeong-Hwan Choi, Joon-Young Yang, Ye-Rin Jeoung, Joon-Hyuk Chang |
HYU Submission for the SASV Challenge 2022: Reforming Speaker Embeddings with Spoofing-Aware Conditioning. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yajian Wang, Jun Du, Hang Chen, Qing Wang 0008, Chin-Hui Lee 0001 |
Deep Segment Model for Acoustic Scene Classification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Li Zhang 0084, Yue Li, Huan Zhao, Qing Wang 0039, Lei Xie 0001 |
Backend Ensemble for Speaker Verification and Spoofing Countermeasure. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Juliana N. Saba, John H. L. Hansen |
Speech Modification for Intelligibility in Cochlear Implant Listeners: Individual Effects of Vowel- and Consonant-Boosting. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Panagiotis Kakoulidis, Nikolaos Ellinas, Georgios Vamvoukakis, Konstantinos Markopoulos, June Sig Sung, Gunu Jho, Pirros Tsiakoulis, Aimilios Chalamandaris |
Karaoker: Alignment-free singing voice synthesis with speech training data. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Alexander Alenin, Nikita Torgashov, Anton Okhotnikov, Rostislav Makarov, Ivan Yakovlev |
A Subnetwork Approach for Spoofing Aware Speaker Verification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Adriana Stan |
The ZevoMOS entry to VoiceMOS Challenge 2022. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Hagen Soltau, Izhak Shafran, Mingqiu Wang, Laurent El Shafey |
RNN Transducers for Named Entity Recognition with constraints on alignment for understanding medical conversations. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Mateusz Guzik, Konrad Kowalczyk |
NTF of Spectral and Spatial Features for Tracking and Separation of Moving Sound Sources in Spherical Harmonic Domain. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Helen Gent, Chase Adams, Yan Tang, Chilin Shih |
Deep Learning for Prosody-Based Irony Classification in Spontaneous Speech. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Soky Kak, Sheng Li 0010, Masato Mimura, Chenhui Chu, Tatsuya Kawahara |
Leveraging Simultaneous Translation for Enhancing Transcription of Low-resource Language via Cross Attention Mechanism. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Pol van Rijn, Silvan Mertes, Dominik Schiller, Piotr Dura, Hubert Siuzdak, Peter M. C. Harrison, Elisabeth André, Nori Jacoby |
VoiceMe: Personalized voice generation in TTS. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Kye Min Tan, Richeng Duan, Xin Huang, Bowei Zou, Xuan Long Do |
A Deep Learning Platform for Language Education Research and Development. |
INTERSPEECH |
2022 |
DBLP BibTeX RDF |
|
1 | C. Siddarth, Sathvik Udupa, Prasanta Kumar Ghosh |
Watch Me Speak: 2D Visualization of Human Mouth during Speech. |
INTERSPEECH |
2022 |
DBLP BibTeX RDF |
|
1 | Zhiyuan Peng, Xuanji He, Ke Ding, Tan Lee, Guanglu Wan |
Unifying Cosine and PLDA Back-ends for Speaker Verification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Pranay Manocha, Anurag Kumar 0003, Buye Xu, Anjali Menon, Israel Dejene Gebru, Vamsi Krishna Ithapu, Paul Calamia |
SAQAM: Spatial Audio Quality Assessment Metric. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Biswaranjan Pattanayak, Gayadhar Pradhan |
Significance of single frequency filter for the development of children's KWS system. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Christina Sartzetaki, Georgios Paraskevopoulos, Alexandros Potamianos |
Extending Compositional Attention Networks for Social Reasoning in Videos. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Sanae Matsui, Kyoji Iwamoto, Reiko Mazuka |
Development of allophonic realization until adolescence: A production study of the affricate-fricative variation of /z/ among Japanese children. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | George Close, Samuel Hollands, Stefan Goetze, Thomas Hain |
Non-intrusive Speech Intelligibility Metric Prediction for Hearing Impaired Individuals. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jingwen Cheng, Yuchen Yan, Yingming Gao, Xiaoli Feng, Yannan Wang, Jinsong Zhang 0001 |
A study of production error analysis for Mandarin-speaking Children with Hearing Impairment. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Arne-Lukas Fietkau, Simon Stone, Peter Birkholz |
Relationship between the acoustic time intervals and tongue movements of German diphthongs. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Kentaro Mitsui, Tianyu Zhao, Kei Sawada, Yukiya Hono, Yoshihiko Nankaku, Keiichi Tokuda |
End-to-End Text-to-Speech Based on Latent Representation of Speaking Styles Using Spontaneous Dialogue. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Keisuke Kinoshita, Thilo von Neumann, Marc Delcroix, Christoph Böddeker, Reinhold Haeb-Umbach |
Utterance-by-utterance overlap-aware neural diarization with Graph-PIT. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jiaxu He, Cheng Gong, Longbiao Wang, Di Jin 0001, Xiaobao Wang, Junhai Xu, Jianwu Dang 0001 |
Improve emotional speech synthesis quality by learning explicit and implicit representations with semi-supervised training. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Hideyuki Tachibana, Muneyoshi Inahara, Mocho Go, Yotaro Katayama, Yotaro Watanabe |
Diffusion Generative Vocoder for Fullband Speech Synthesis Based on Weak Third-order SDE Solver. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Seong-Hwan Heo, WonKee Lee, Jong-Hyeok Lee |
mcBERT: Momentum Contrastive Learning with BERT for Zero-Shot Slot Filling. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zikai Chen, Lin Wu, Junjie Pan, Xiang Yin 0006 |
An Automatic Soundtracking System for Text-to-Speech Audiobooks. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jiahui Pan, Shuai Nie, Hui Zhang 0031, Shulin He, Kanghao Zhang, Shan Liang, Xueliang Zhang 0001, Jianhua Tao 0001 |
Speaker recognition-assisted robust audio deepfake detection. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Kimiko Tsukada, Yurong Yurong |
Non-native Perception of Japanese Singleton/Geminate Contrasts: Comparison of Mandarin and Mongolian Speakers Differing in Japanese Experience. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yuanhao Yi, Lei He 0005, Shifeng Pan, Xi Wang 0016, Yuchao Zhang |
SoftSpeech: Unsupervised Duration Model in FastSpeech 2. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Sangwook Park, Sandeep Reddy Kothinti, Mounya Elhilali |
Temporal coding with magnitude-phase regularization for sound event detection. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Julitta Bartolewska, Stanislaw Kacprzak, Konrad Kowalczyk |
Refining DNN-based Mask Estimation using CGMM-based EM Algorithm for Multi-channel Noise Reduction. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Changsheng Quan, Xiaofei Li |
Multichannel Speech Separation with Narrow-band Conformer. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Hiroaki Sato, Tomoyasu Komori, Takeshi Mishima, Yoshihiko Kawai, Takahiro Mochizuki, Shoei Sato, Tetsuji Ogawa |
Text-Only Domain Adaptation Based on Intermediate CTC. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Nikko Strom, Haidar Khan, Wael Hamza |
Squashed Weight Distribution for Low Bit Quantization of Deep Models. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Shuta Taniguchi, Tsuneo Kato, Akihiro Tamura, Keiji Yasuda |
Transformer-Based Automatic Speech Recognition with Auxiliary Input of Source Language Text Toward Transcribing Simultaneous Interpretation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Tatsuya Kitamura, Naoki Kunimoto, Hideki Kawahara, Shigeaki Amano |
Perceptual Evaluation of Penetrating Voices through a Semantic Differential Method. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yu Bai, Ferdy Hubers, Catia Cucchiarini, Roeland van Hout, Helmer Strik |
The Effects of Implicit and Explicit Feedback in an ASR-based Reading Tutor for Dutch First-graders. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yuma Koizumi, Heiga Zen, Kohei Yatabe, Nanxin Chen, Michiel Bacchiani |
SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral Shaping. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Tarun Sai Bandarupalli, Shakti Rath, Nirmesh Shah, Naoyuki Onoe, Sriram Ganapathy |
Semi-supervised Acoustic and Language Modeling for Hindi ASR. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zhan Zhang, Yuehai Wang, Jianyi Yang |
BiCAPT: Bidirectional Computer-Assisted Pronunciation Training with Normalizing Flows. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Nathaniel Romney Robinson, Perez Ogayo, Swetha R. Gangu, David R. Mortensen, Shinji Watanabe 0001 |
When Is TTS Augmentation Through a Pivot Language Useful? |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Shogo Seki |
MISRNet: Lightweight Neural Vocoder Using Multi-Input Single Shared Residual Blocks. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Bin Gu |
Deep speaker embedding with frame-constrained training strategy for speaker verification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Dominika Woszczyk, Anna Hlédiková, Alican Akman, Soteris Demetriou, Björn W. Schuller |
Data Augmentation for Dementia Detection in Spoken Language. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Lorenz Diener, Sten Sootla, Solomiya Branets, Ando Saabas, Robert Aichner, Ross Cutler |
INTERSPEECH 2022 Audio Deep Packet Loss Concealment Challenge. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ansen Antony, Sumanth Reddy Kota, Akhilesh Lade, Spoorthy V, Shashidhar G. Koolagudi |
An Improved Transformer Transducer Architecture for Hindi-English Code Switched Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Maxim Markitantov, Elena Ryumina, Dmitry Ryumin, Alexey Karpov 0001 |
Biometric Russian Audio-Visual Extended MASKS (BRAVE-MASKS) Corpus: Multimodal Mask Type Recognition Task. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Han Zhu, Li Wang, Gaofeng Cheng, Jindong Wang 0001, Pengyuan Zhang, Yonghong Yan 0002 |
Wav2vec-S: Semi-Supervised Pre-Training for Low-Resource ASR. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Heting Gao, Junrui Ni, Kaizhi Qian, Yang Zhang 0001, Shiyu Chang, Mark Hasegawa-Johnson |
WavPrompt: Towards Few-Shot Spoken Language Understanding with Frozen Language Models. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Karolos Nikitaras, Georgios Vamvoukakis, Nikolaos Ellinas, Konstantinos Klapsas, Konstantinos Markopoulos, Spyros Raptis, June Sig Sung, Gunu Jho, Aimilios Chalamandaris, Pirros Tsiakoulis |
Fine-grained Noise Control for Multispeaker Speech Synthesis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Takeshi Kishiyama, Chuyu Huang, Yuki Hirose |
One-step models in pitch perception: Experimental evidence from Japanese. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Junpeng Liu, Yanyan Zou, Yuxuan Xi, Shengjie Li, Mian Ma, Zhuoye Ding, Bo Long |
Negative Guided Abstractive Dialogue Summarization. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Tsiky Rakotomalala, Pierre Baraduc, Pascal Perrier |
Trajectories predicted by optimal speech motor control using LSTM networks. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Xiaoxiao Miao, Xin Wang 0037, Erica Cooper, Junichi Yamagishi, Natalia A. Tomashenko |
Analyzing Language-Independent Speaker Anonymization Framework under Unseen Conditions. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yui Sudo, Muhammad Shakeel 0001, Kazuhiro Nakadai, Jiatong Shi, Shinji Watanabe 0001 |
Streaming Automatic Speech Recognition with Re-blocking Processing Based on Integrated Voice Activity Detection. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Iona Gessinger, Michelle Cohn, Georgia Zellou, Bernd Möbius |
Cross-Cultural Comparison of Gradient Emotion Perception: Human vs. Alexa TTS Voices. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Dong Wang, Yanhui Ding, Qing Zhao, Peilin Yang, Shuping Tan, Ya Li |
ECAPA-TDNN Based Depression Detection from Clinical Speech. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Axel Berg, Mark O'Connor, Kalle Åström, Magnus Oskarsson |
Extending GCC-PHAT using Shift Equivariant Neural Networks. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Thomas R. O'Malley, Arun Narayanan, Quan Wang |
A universally-deployable ASR frontend for joint acoustic echo cancellation, speech enhancement, and voice separation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Asahi Ogushi, Toshiki Onishi, Yohei Tahara, Ryo Ishii, Atsushi Fukayama, Takao Nakamura, Akihiro Miyata |
Analysis of praising skills focusing on utterance contents. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Helard Becerra Martinez, Alessandro Ragano, Andrew Hines |
Exploring the influence of fine-tuning data on wav2vec 2.0 model for blind speech quality prediction. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Sarenne Carrol Wallbridge, Catherine Lai, Peter Bell 0001 |
Investigating perception of spoken dialogue acceptability through surprisal. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Claus M. Larsen, Peter Koch 0001, Zheng-Hua Tan |
Adversarial Multi-Task Deep Learning for Noise-Robust Voice Activity Detection with Low Algorithmic Delay. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jian Xue, Peidong Wang, Jinyu Li 0001, Matt Post, Yashesh Gaur |
Large-Scale Streaming End-to-End Speech Translation with Neural Transducers. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Haohan Guo, Hui Lu, Xixin Wu, Helen Meng |
A Multi-Scale Time-Frequency Spectrogram Discriminator for GAN-based Non-Autoregressive TTS. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Kentaro Mitsui, Kei Sawada |
MSR-NV: Neural Vocoder Using Multiple Sampling Rates. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Anirudh Raju, Milind Rao, Gautam Tiwari, Pranav Dheram, Bryan Anderson, Zhe Zhang, Chul Lee, Bach Bui, Ariya Rastrow |
On joint training with interfaces for spoken language understanding. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Julian Zaïdi, Hugo Seuté, Benjamin van Niekerk, Marc-André Carbonneau |
Daft-Exprt: Cross-Speaker Prosody Transfer on Any Text for Expressive Speech Synthesis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zhengjun Yue, Erfan Loweimi, Heidi Christensen, Jon Barker, Zoran Cvetkovic |
Dysarthric Speech Recognition From Raw Waveform with Parametric CNNs. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Xinhao Mei, Xubo Liu, Jianyuan Sun, Mark D. Plumbley, Wenwu Wang 0001 |
On Metric Learning for Audio-Text Cross-Modal Retrieval. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Dorina De Jong, Aldo Pastore, Noël Nguyen, Alessandro D'Ausilio |
Speech imitation skills predict automatic phonetic convergence: a GMM-UBM study on L2. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Suyoun Kim, Duc Le, Weiyi Zheng, Tarun Singh, Abhinav Arora, Xiaoyu Zhai, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer |
Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jingyu Li, Wei Liu, Tan Lee |
EDITnet: A Lightweight Network for Unsupervised Domain Adaptation in Speaker Verification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Raul Fernandez, David Haws, Guy Lorberbom, Slava Shechtman, Alexander Sorin |
Transplantation of Conversational Speaking Style with Interjections in Sequence-to-Sequence Speech Synthesis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jeremy Heng Meng Wong, Huayun Zhang, Nancy F. Chen |
Variations of multi-task learning for spoken language assessment. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Linh The Nguyen, Nguyen Luong Tran, Long Doan, Manh Luong, Dat Quoc Nguyen |
A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Federico Landini, Alicia Lozano-Diez, Mireia Díez, Lukás Burget |
From Simulated Mixtures to Simulated Conversations as Training Data for End-to-End Neural Diarization. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Nimshi Venkat Meripo, Sandeep Konam |
ASR Error Detection via Audio-Transcript entailment. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Chengyi Wang 0002, Yiming Wang, Yu Wu 0012, Sanyuan Chen, Jinyu Li 0001, Shujie Liu 0001, Furu Wei |
Supervision-Guided Codebooks for Masked Prediction in Speech Pre-training. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | P. Schäfer, Paula Andrea Pérez-Toro, Philipp Klumpp, Juan Rafael Orozco-Arroyave, Elmar Nöth, Andreas K. Maier, A. Abad, Maria Schuster, Tomás Arias-Vergara |
CoachLea: an Android Application to Evaluate the Speech Production and Perception of Children with Hearing Loss. |
INTERSPEECH |
2022 |
DBLP BibTeX RDF |
|
1 | Peng Shen, Xugang Lu, Hisashi Kawai |
Transducer-based language embedding for spoken language identification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | John H. L. Hansen, Zhenyu Wang |
Audio Anti-spoofing Using Simple Attention Module and Joint Optimization Based on Additive Angular Margin Loss and Meta-learning. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ashutosh Pandey 0004, DeLiang Wang |
Attentive Training: A New Training Framework for Talker-independent Speaker Extraction. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Xuyi Zhuang, Lu Zhang, Zehua Zhang, Yukun Qian, Mingjiang Wang |
Coarse-Grained Attention Fusion With Joint Training Framework for Complex Speech Enhancement and End-to-End Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Venkata Srikanth Nallanthighal, Aki Härmä, Helmer Strik |
COVID-19 detection based on respiratory sensing from speech. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jean-Marc Valin, Ahmed Mustafa, Christopher Montgomery, Timothy B. Terriberry, Michael Klingbeil, Paris Smaragdis, Arvindh Krishnaswamy |
Real-Time Packet Loss Concealment With Mixed Generative and Predictive Model. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Dehua Tao, Tan Lee, Harold Chui, Sarah Luk |
Hierarchical Attention Network for Evaluating Therapist Empathy in Counseling Session. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yoshiaki Bando, Takahiro Aizawa, Katsutoshi Itoyama, Kazuhiro Nakadai |
Weakly-Supervised Neural Full-Rank Spatial Covariance Analysis for a Front-End System of Distant Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Philipp Buech, Simon Roessig, Lena Pagel, Doris Mücke, Anne Hermes |
ema2wav: doing articulation by Praat. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ting-Wei Wu, I-Fan Chen, Ankur Gandhe |
Learning to rank with BERT-based confidence models in ASR rescoring. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
Displaying result #301 - #400 of 18782 (100 per page; Change: ) Pages: [ <<][ 1][ 2][ 3][ 4][ 5][ 6][ 7][ 8][ 9][ 10][ 11][ 12][ 13][ >>] |
|