|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
No Growbag Graphs found.
|
|
|
Results
Found 18782 publication records. Showing 18782 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
1 | Xiaohai Tian, Kaiqi Fu, Shaojun Gao, Yiwei Gu, Kai Wang, Wei Li, Zejun Ma |
A Transfer and Multi-Task Learning based Approach for MOS Prediction. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Peter Wu, Shinji Watanabe 0001, Louis Goldstein, Alan W. Black, Gopala Krishna Anumanchipalli |
Deep Speech Synthesis from Articulatory Representations. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Wei Zhou 0043, Wilfried Michel, Ralf Schlüter, Hermann Ney |
Efficient Training of Neural Transducer for Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Farhat Jabeen, Simon Betz |
Hesitations in Urdu/Hindi: Distribution and Properties of Fillers & Silences. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Konstantinos Klapsas, Nikolaos Ellinas, Karolos Nikitaras, Georgios Vamvoukakis, Panagiotis Kakoulidis, Konstantinos Markopoulos, Spyros Raptis, June Sig Sung, Gunu Jho, Aimilios Chalamandaris, Pirros Tsiakoulis |
Self supervised learning for robust voice cloning. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Anderson R. Avila, Khalil Bibi, Rui Heng Yang, Xinlin Li, Chao Xing, Xiao Chen |
Low-bit Shift Network for End-to-End Spoken Language Understanding. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Chu-Xiao Zuo, Jia-Yi Leng, Wu-Jun Li |
Speaker-Specific Utterance Ensemble based Transfer Attack on Speaker Identification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Khaled Koutini, Jan Schlüter, Hamid Eghbal-zadeh, Gerhard Widmer |
Efficient Training of Audio Transformers with Patchout. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Tzeviya Fuchs, Yedid Hoshen, Yossi Keshet |
Unsupervised Word Segmentation using K Nearest Neighbors. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Guolong Zhong, Hongyu Song, Ruoyu Wang 0029, Lei Sun 0010, Diyuan Liu, Jia Pan, Xin Fang, Jun Du, Jie Zhang 0042, Lirong Dai |
External Text Based Data Augmentation for Low-Resource Speech Recognition in the Constrained Condition of OpenASR21 Challenge. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Hideki Kawahara, Kohei Yatabe, Ken-Ichi Sakakibara, Tatsuya Kitamura, Hideki Banno, Masanori Morise |
An objective test tool for pitch extractors' response attributes. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Sarina Meyer, Florian Lux, Pavel Denisov, Julia Koch, Pascal Tilli, Ngoc Thang Vu |
Speaker Anonymization with Phonetic Intermediate Representations. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Chiang-Jen Peng, Yun-Ju Chan, Yih-Liang Shen, Cheng Yu, Yu Tsao 0001, Tai-Shih Chi |
Perceptual Characteristics Based Multi-objective Model for Speech Enhancement. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Théo Lepage, Réda Dehak |
Label-Efficient Self-Supervised Speaker Verification With Information Maximization and Contrastive Learning. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Abner Hernandez, Paula Andrea Pérez-Toro, Elmar Nöth, Juan Rafael Orozco-Arroyave, Andreas K. Maier, Seung Hee Yang |
Cross-lingual Self-Supervised Speech Representations for Improved Dysarthric Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Kai Zhen, Hieu Duy Nguyen, Raviteja Chinta, Nathan Susanj, Athanasios Mouchtaris, Tariq Afzal, Ariya Rastrow |
Sub-8-Bit Quantization Aware Training for 8-Bit Neural Network Accelerator with On-Device Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Deebha Mumtaz, Ajit Jena, Vinit Jakhetiya, Karan Nathwani, Sharath Chandra Guntuku |
Transformer-based quality assessment model for generalized user-generated multimedia audio content. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Andreas Weise, Rivka Levitan |
Investigating the influence of personality on acoustic-prosodic entrainment. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Georgia Maniati, Alexandra Vioni, Nikolaos Ellinas, Karolos Nikitaras, Konstantinos Klapsas, June Sig Sung, Gunu Jho, Aimilios Chalamandaris, Pirros Tsiakoulis |
SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural Text-to-Speech Synthesis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Xinjian Li, Florian Metze, David R. Mortensen, Alan W. Black, Shinji Watanabe 0001 |
ASR2K: Speech Recognition for Around 2000 Languages without Audio. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zifeng Zhao, Dongchao Yang, Rongzhi Gu, Haoran Zhang, Yuexian Zou |
Target Confusion in End-to-end Speaker Extraction: Analysis and Approaches. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Venkatesh Shenoy Kadandale, Juan F. Montesinos, Gloria Haro |
VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Takashi Fukuda, Samuel Thomas 0001, Masayuki Suzuki, Gakuto Kurata, George Saon, Brian Kingsbury |
Global RNN Transducer Models For Multi-dialect Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jinhan Wang, Vijay Ravi, Jonathan Flint, Abeer Alwan |
Unsupervised Instance Discriminative Learning for Depression Detection from Speech Signals. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Alena Velichko, Maxim Markitantov, Heysem Kaya, Alexey Karpov 0001 |
Complex Paralinguistic Analysis of Speech: Predicting Gender, Emotions and Deception in a Hierarchical Framework. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Tao Li, Xinsheng Wang, Qicong Xie, Zhichao Wang, Mingqi Jiang, Lei Xie 0001 |
Cross-speaker Emotion Transfer Based On Prosody Compensation for End-to-End Speech Synthesis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Huy Nguyen, Kai Li 0018, Masashi Unoki |
Automatic Mean Opinion Score Estimation with Temporal Modulation Features on Gammatone Filterbank for Speech Assessment. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ayushi Pandey, Sébastien Le Maguer, Julie Carson-Berndsen, Naomi Harte |
Production characteristics of obstruents in WaveNet and older TTS systems. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jin Sakuma, Shinya Fujie, Tetsunori Kobayashi |
Response Timing Estimation for Spoken Dialog System using Dialog Act Estimation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Rishabh Kumar, Devaraja Adiga, Mayank Kothyari, Jatin Dalal, Ganesh Ramakrishnan, Preethi Jyothi |
VAgyojaka: An Annotating and Post-Editing Tool for Automatic Speech Recognition. |
INTERSPEECH |
2022 |
DBLP BibTeX RDF |
|
1 | Nabarun Goswami, Tatsuya Harada |
SATTS: Speaker Attractor Text to Speech, Learning to Speak by Learning to Separate. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yuanyuan Zhang, Yixuan Zhang, Bence Mark Halpern, Tanvina Patel, Odette Scharenborg |
Mitigating bias against non-native accents. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Kevin Kilgour, Beat Gfeller, Qingqing Huang, Aren Jansen, Scott Wisdom, Marco Tagliasacchi |
Text-Driven Separation of Arbitrary Sounds. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Vincent Hughes, Carmen Llamas, Thomas Kettig |
Eliciting and evaluating likelihood ratios for speaker recognition by human listeners under forensically realistic channel-mismatched conditions. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yuhong Yang 0001, Xufeng Chen, Qingmu Liu, Weiping Tu, Hongyang Chen, Linjun Cai |
Mandarin Lombard Grid: a Lombard-grid-like corpus of Standard Chinese. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Soha A. Nossier, Julie A. Wall, Mansour Moniri, Cornelius Glackin, Nigel Cannings |
Convolutional Recurrent Smart Speech Enhancement Architecture for Hearing Aids. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jennifer Drexler Fox, Natalie Delworth |
Improving Contextual Recognition of Rare Words with an Alternate Spelling Prediction Model. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | May Pik Yu Chan, June Choe, Aini Li, Yiran Chen 0017, Xin Gao, Nicole R. Holliday |
Training and typological bias in ASR performance for world Englishes. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Haibin Wu, Lingwei Meng, Jiawen Kang, Jinchao Li, Xu Li, Xixin Wu, Hung-yi Lee, Helen Meng |
Spoofing-Aware Speaker Verification by Multi-Level Fusion. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yoshiki Masuyama, Kouei Yamaoka, Nobutaka Ono |
Joint Optimization of Sampling Rate Offsets Based on Entire Signal Relationship Among Distributed Microphones. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Sreeram Manghat, Sreeja Manghat, Tanja Schultz |
Normalization of code-switched text for speech synthesis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Nicolae-Catalin Ristea, Radu Tudor Ionescu, Fahad Shahbaz Khan |
SepTr: Separable Transformer for Audio Spectrogram Processing. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Desheng Hu, Xinhui Hu, Xinkang Xu |
Multiple Enhancements to LSTM for Learning Emotion-Salient Features in Speech Emotion Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zixia Fan, Jing Shao, Weigong Pan, Lan Wang |
Revisiting visuo-spatial processing in individuals with congenital amusia. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Kei Furukawa, Takeshi Kishiyama, Satoshi Nakamura 0001 |
Applying Syntax-Prosody Mapping Hypothesis and Prosodic Well-Formedness Constraints to Neural Sequence-to-Sequence Speech Synthesis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Shreyas Seshadri, Tuomo Raitio, Dan Castellani, Jiangchuan Li |
Emphasis Control for Parallel Neural TTS. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jun Wang 0090 |
ESSumm: Extractive Speech Summarization from Untranscribed Meeting. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Avamarie Brueggeman, John H. L. Hansen |
Speaker Trait Enhancement for Cochlear Implant Users: A Case Study for Speaker Emotion Perception. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ting-Wei Wu, Biing-Hwang Juang |
Induce Spoken Dialog Intents via Deep Unsupervised Context Contrastive Clustering. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Kristina Tesch, Nils-Hendrik Mohrmann, Timo Gerkmann |
On the Role of Spatial, Spectral, and Temporal Processing for DNN-based Non-linear Multi-channel Speech Enhancement. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Peng Liu, Songbin Li, Jigang Tang |
An End-to-End Macaque Voiceprint Verification Method Based on Channel Fusion Mechanism. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ingo Langheinrich, Simon Stone, Xinyu Zhang, Peter Birkholz |
Glottal inverse filtering based on articulatory synthesis and deep learning. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Joosung Lee |
The Emotion is Not One-hot Encoding: Learning with Grayscale Label for Emotion Recognition in Conversation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yufei Liu, Rao Ma, Haihua Xu, Yi He, Zejun Ma, Weibin Zhang |
Internal Language Model Estimation Through Explicit Context Vector Learning for Attention-based Encoder-decoder ASR. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Prateeth Nayak, Takuya Higuchi, Anmol Gupta, Shivesh Ranjan, Stephen Shum, Siddharth Sigtia, Erik Marchi, Varun Lakshminarasimhan, Minsik Cho, Saurabh Adya, Chandra Dhir, Ahmed H. Tewfik |
Improving Voice Trigger Detection with Metric Learning. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Taejin Park, Nithin Rao Koluguri, Fei Jia, Jagadeesh Balam, Boris Ginsburg |
NeMo Open Source Speaker Diarization System. |
INTERSPEECH |
2022 |
DBLP BibTeX RDF |
|
1 | Baiyun Liu, Qi Song, Mingxue Yang, Wuwen Yuan, Tianbao Wang |
PLCNet: Real-time Packet Loss Concealment with Semi-supervised Generative Adversarial Network. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Xiao Wang 0022, Song Cheng, Jun Li, Shushan Qiao, Yumei Zhou, Yi Zhan |
Low-complex and Highly-performed Binary Residual Neural Network for Small-footprint Keyword Spotting. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Francisco Teixeira, Alberto Abad, Bhiksha Raj, Isabel Trancoso |
Towards End-to-End Private Automatic Speaker Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Daniel Zhang, Ashwinkumar Ganesan, Sarah Campbell, Daniel Korzekwa |
L2-GEN: A Neural Phoneme Paraphrasing Approach to L2 Speech Synthesis for Mispronunciation Diagnosis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Debottam Dutta, Debarpan Bhattacharya, Sriram Ganapathy, Amir Hossein Poorjam, Deepak Mittal, Maneesh Singh 0001 |
Acoustic Representation Learning on Breathing and Speech Signals for COVID-19 Detection. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Junrui Ni, Liming Wang, Heting Gao, Kaizhi Qian, Yang Zhang 0001, Shiyu Chang, Mark Hasegawa-Johnson |
Unsupervised Text-to-Speech Synthesis by Unsupervised Automatic Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yifan Sun, Qinlong Huang, Xihong Wu |
Unsupervised Acoustic-to-Articulatory Inversion with Variable Vocal Tract Anatomy. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Qibing Bai, Tom Ko, Yu Zhang 0006 |
A Study of Modeling Rising Intonation in Cantonese Neural Speech Synthesis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Dan Berrebbi, Jiatong Shi, Brian Yan, Osbel López-Francisco, Jonathan D. Amith, Shinji Watanabe 0001 |
Combining Spectral and Self-Supervised Features for Low Resource Speech Recognition and Translation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Nan Li, Xiguang Zheng, Chen Zhang, Liang Guo, Bing Yu |
End-to-End Multi-Loss Training for Low Delay Packet Loss Concealment. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jie Wei, Guanyu Hu, Xinyu Yang, Anh Tuan Luu, Yizhuo Dong |
Audio-Visual Domain Adaptation Feature Fusion for Speech Emotion Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Shuo-Yiin Chang, Guru Prakash, Zelin Wu, Tara N. Sainath, Bo Li 0028, Qiao Liang 0001, Adam Stambler, Shyam Upadhyay, Manaal Faruqui, Trevor Strohman |
Streaming Intended Query Detection using E2E Modeling for Continued Conversation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Xue Yang, Changchun Bao |
Embedding Recurrent Layers with Dual-Path Strategy in a Variant of Convolutional Network for Speaker-Independent Speech Separation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Eklavya Sarkar, RaviShankar Prasad, Mathew Magimai-Doss |
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yang Xiao, Nana Hou, Eng Siong Chng |
Rainbow Keywords: Efficient Incremental Learning for Online Spoken Keyword Spotting. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Nicola Pia, Kishan Gupta, Srikanth Korse, Markus Multrus, Guillaume Fuchs |
NESC: Robust Neural End-2-End Speech Coding with GANs. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Arindam Ghosh, Mark C. Fuhs, Deblin Bagchi, Bahman Farahani, Monika Woszczyna |
Low-resource Low-footprint Wake-word Detection using Knowledge Distillation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Kun Wei, Yike Zhang, Sining Sun, Lei Xie 0001, Long Ma |
Leveraging Acoustic Contextual Representation by Audio-textual Cross-modal Learning for Conversational ASR. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Kun Wei, Pengcheng Guo, Ning Jiang |
Improving Transformer-based Conversational ASR by Inter-Sentential Attention Mechanism. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Youngdo Ahn, Sung Joo Lee, Jong Won Shin |
Multi-Corpus Speech Emotion Recognition for Unseen Corpus Using Corpus-Wise Weights in Classification Loss. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yuto Nishimura, Yuki Saito, Shinnosuke Takamichi, Kentaro Tachibana, Hiroshi Saruwatari |
Acoustic Modeling for End-to-End Empathetic Dialogue Speech Synthesis Using Linguistic and Prosodic Contexts of Dialogue History. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Rui Tao, Long Yan, Kazushige Ouchi, Xiangdong Wang |
Couple learning for semi-supervised sound event detection. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Okan Köpüklü, Maja Taseska |
ResectNet: An Efficient Architecture for Voice Activity Detection on Mobile Devices. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Tanya Talkar, Christina Manxhari, James J. Williamson, Kara M. Smith, Thomas F. Quatieri |
Speech Acoustics in Mild Cognitive Impairment and Parkinson's Disease With and Without Concurrent Drawing Tasks. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Piotr Kawa, Marcin Plata, Piotr Syga |
Attack Agnostic Dataset: Towards Generalization and Stabilization of Audio DeepFake Detection. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Fumio Nihei, Ryo Ishii, Yukiko I. Nakano, Kyosuke Nishida, Ryo Masumura, Atsushi Fukayama, Takao Nakamura |
Dialogue Acts Aided Important Utterance Detection Based on Multiparty and Multimodal Information. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Nicolas Audibert, Cécile Fougeron |
Intra-speaker phonetic variation in read speech: comparison with inter-speaker variability in a controlled population. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Woo Hyun Kang, Md. Jahangir Alam, Abderrahim Fathan |
Mixup regularization strategies for spoofing countermeasure system. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Duc Le, Akshat Shrivastava, Paden D. Tomasello, Suyoun Kim, Aleksandr Livshits, Ozlem Kalinli, Michael L. Seltzer |
Deliberation Model for On-Device Spoken Language Understanding. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Cécile Fougeron, Nicolas Audibert, Ina Kodrasi, Parvaneh Janbakhshi, Michaela Pernon, Nathalie Lévêque, Stephanie Borel, Marina Laganaro, Hervé Bourlard, Frédéric Assal |
Comparison of 5 methods for the evaluation of intelligibility in mild to moderate French dysarthric speech. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yaroslav Getman, Ragheb Al-Ghezi, Katja Voskoboinik, Tamás Grósz, Mikko Kurimo, Giampiero Salvi, Torbjørn Svendsen, Sofia Strömbergsson |
wav2vec2-based Speech Rating System for Children with Speech Sound Disorder. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Shuo-Yiin Chang, Bo Li 0028, Tara N. Sainath, Chao Zhang, Trevor Strohman, Qiao Liang 0001, Yanzhang He |
Turn-Taking Prediction for Natural Conversational Speech. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | W. Ronny Huang, Shuo-Yiin Chang, David Rybach, Tara N. Sainath, Rohit Prabhavalkar, Cal Peyser, Zhiyun Lu, Cyril Allauzen |
E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Junghun Kim, Yoojin An, Jihie Kim |
Improving Speech Emotion Recognition Through Focus and Calibration Attention Mechanisms. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Xin-Chun Li, Jin-Lin Tang, Shaoming Song, Bingshuai Li, Yinchuan Li, Yunfeng Shao 0001, Le Gan, De-Chuan Zhan |
Avoid Overfitting User Specific Information in Federated Keyword Spotting. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yan Gao, Javier Fernández-Marqués, Titouan Parcollet, Abhinav Mehrotra, Nicholas D. Lane |
Federated Self-supervised Speech Representations: Are We There Yet? |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Pranay Manocha, Anurag Kumar 0003 |
Speech Quality Assessment through MOS using Non-Matching References. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ryu Takeda, Yui Sudo, Kazuhiro Nakadai, Kazunori Komatani |
Empirical Sampling from Latent Utterance-wise Evidence Model for Missing Data ASR based on Neural Encoder-Decoder Model. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Louise Coppieters de Gibson, Philip N. Garner |
Low-Level Physiological Implications of End-to-End Learning for Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Naoyuki Kanda, Jian Wu 0027, Yu Wu 0012, Xiong Xiao, Zhong Meng, Xiaofei Wang 0009, Yashesh Gaur, Zhuo Chen 0006, Jinyu Li 0001, Takuya Yoshioka |
Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Chengdong Liang, Yijiang Chen, Jiadi Yao, Xiao-Lei Zhang 0001 |
Multi-Channel Far-Field Speaker Verification with Large-Scale Ad-hoc Microphone Arrays. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Chung Soo Ahn, L. L. Chamara Kasun, Sunil Sivadas, Jagath C. Rajapakse |
Recurrent multi-head attention fusion network for combining audio and text for speech emotion recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Marco Dinarelli, Marco Naguib, François Portet |
Toward Low-Cost End-to-End Spoken Language Understanding. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Takayuki Nagamine |
Acquisition of allophonic variation in second language speech: An acoustic and articulatory study of English laterals by Japanese speakers. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
Displaying result #101 - #200 of 18782 (100 per page; Change: ) Pages: [ <<][ 1][ 2][ 3][ 4][ 5][ 6][ 7][ 8][ 9][ 10][ 11][ >>] |
|