|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
No Growbag Graphs found.
|
|
|
Results
Found 18782 publication records. Showing 18782 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
1 | Shahar Lutati, Eliya Nachmani, Lior Wolf |
SepIt: Approaching a Single Channel Speech Separation Bound. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Tuan Vu Ho, Quoc Huy Nguyen, Masato Akagi, Masashi Unoki |
Vector-quantized Variational Autoencoder for Phase-aware Speech Enhancement. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ashishkumar Prabhakar Gudmalwar, Biplove Basel, Anirban Dutta, Ch V. Rama Rao |
The Magnitude and Phase based Speech Representation Learning using Autoencoder for Classifying Speech Emotions using Deep Canonical Correlation Analysis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Paul Konstantin Krug, Peter Birkholz, Branislav Gerazov, Daniel Rudolph van Niekerk, Anqi Xu, Yi Xu 0007 |
Articulatory Synthesis for Data Augmentation in Phoneme Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Chang Zeng, Lin Zhang, Meng Liu, Junichi Yamagishi |
Spoofing-Aware Attention based ASV Back-end with Multiple Enrollment Utterances and a Sampling Strategy for the SASV Challenge 2022. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Rini A. Sharon, Heet Shah, Debdoot Mukherjee, Vikram Gupta |
Multilingual and Multimodal Abuse Detection. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Leticia Arco, Carlos Mosquera, Fabjola Braho, Yisel Clavel Quintero, Johan Loeckx |
Evaluation of call centre conversations based on a high-level symbolic representation. |
INTERSPEECH |
2022 |
DBLP BibTeX RDF |
|
1 | W. Ronny Huang, Cal Peyser, Tara N. Sainath, Ruoming Pang, Trevor D. Strohman, Shankar Kumar |
Sentence-Select: Large-Scale Language Model Data Selection for Rare-Word Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yi Xu 0007, Anqi Xu, Daniel R. van Niekerk, Branislav Gerazov, Peter Birkholz, Paul Konstantin Krug, Santitham Prom-on, Lorna F. Halliday |
Evoc-Learn - High quality simulation of early vocal learning. |
INTERSPEECH |
2022 |
DBLP BibTeX RDF |
|
1 | Chandan K. A. Reddy, Vishak Gopal, Harishchandra Dubey, Ross Cutler, Sergiy Matusevych, Robert Aichner |
MusicNet: Compact Convolutional Neural Network for Real-time Background Music Detection. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Maureen de Seyssel, Marvin Lavechin, Yossi Adi, Emmanuel Dupoux, Guillaume Wisniewski |
Probing phoneme, language and speaker information in unsupervised speech representations. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Wenjing Liu, Chuan Xie |
Gated Convolutional Fusion for Time-Domain Target Speaker Extraction Network. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Seongkyu Mun, Dhananjaya Gowda, Jihwan Lee, Changwoo Han, Dokyun Lee, Chanwoo Kim 0001 |
Prototypical speaker-interference loss for target voice separation using non-parallel audio samples. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Farhad Javanmardi, Sudarsana Reddy Kadiri, Manila Kodali, Paavo Alku |
Comparing 1-dimensional and 2-dimensional spectral feature representations in voice pathology detection using machine learning and deep learning classifiers. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ryo Fukuda, Katsuhito Sudoh, Satoshi Nakamura 0001 |
Speech Segmentation Optimization using Segmented Bilingual Speech Corpus for End-to-end Speech Translation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yehoshua Dissen, Felix Kreuk, Joseph Keshet |
Self-supervised Speaker Diarization. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Donghyeon Kim, Bowon Lee |
Phase Vocoder For Time Stretch Based On Center Frequency Estimation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Matthew Baas, Herman Kamper |
Voice Conversion Can Improve ASR in Very Low-Resource Settings. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Dino Rattcliffe, You Wang, Alex Mansbridge, Penny Karanasou, Alexis Moinet, Marius Cotescu |
Cross-lingual Style Transfer with Conditional Prior VAE and Style Loss. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Sashi Novitasari, Takashi Fukuda, Gakuto Kurata |
Improving ASR Robustness in Noisy Condition Through VAD Integration. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Minyue Zhang, Hongwei Ding |
Impact of Background Noise and Contribution of Visual Information in Emotion Identification by Native Mandarin Speakers. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Mohd Abbas Zaidi, Beomseok Lee, Sangha Kim 0002, Chanwoo Kim 0001 |
Cross-Modal Decision Regularization for Simultaneous Speech Translation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jianjing Kuang, May Pik Yu Chan, Nari Rhee, Mark Liberman, Hongwei Ding |
The mapping between syntactic and prosodic phrasing in English and Mandarin. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Weiran Wang, Tongzhou Chen, Tara N. Sainath, Ehsan Variani, Rohit Prabhavalkar, W. Ronny Huang, Bhuvana Ramabhadran, Neeraj Gaur, Sepand Mavandadi, Cal Peyser, Trevor Strohman, Yanzhang He, David Rybach |
Improving Rare Word Recognition with LM-aware MWER Training. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Sicheng Yang, Methawee Tantrawenith, Haolin Zhuang, Zhiyong Wu 0001, Aolan Sun, Jianzong Wang, Ning Cheng 0001, Huaizhen Tang, Xintao Zhao, Jie Wang, Helen Meng |
Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Hubert Siuzdak, Piotr Dura, Pol van Rijn, Nori Jacoby |
WavThruVec: Latent speech representation as intermediate features for neural speech synthesis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Adrian Leemann, Péter Jeszenszky, Carina Steiner, Corinne Lanthemann |
Factors affecting the percept of Yanny v. Laurel (or mixed): Insights from a large-scale study on Swiss German listeners. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Hang Chen, Jun Du, Yusheng Dai, Chin-Hui Lee 0001, Sabato Marco Siniscalchi, Shinji Watanabe 0001, Odette Scharenborg, Jingdong Chen, Baocai Yin, Jia Pan |
Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Wei Yang, Satoru Fukayama, Panikos Heracleous, Jun Ogata |
Exploiting Fine-tuning of Self-supervised Learning Models for Improving Bi-modal Sentiment Analysis and Emotion Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Shahaf Bassan, Yossi Adi, Jeffrey S. Rosenschein |
Unsupervised Symbolic Music Segmentation using Ensemble Temporal Prediction Errors. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Rui Liu 0008, Berrak Sisman, Björn W. Schuller, Guanglai Gao, Haizhou Li 0001 |
Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Xin Yuan, Robin Feng, Mingming Ye, Cheng Tuo, Minghang Zhang |
AdaVocoder: Adaptive Vocoder for Custom Voice. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jie Chen, Changhe Song, Deyi Tuo, Xixin Wu, Shiyin Kang, Zhiyong Wu 0001, Helen Meng |
Improving Mandarin Prosodic Structure Prediction with Multi-level Contextual Information. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Franklin Alvarez Cardinale, Waldo Nogueira |
Predicting Speech Intelligibility using the Spike Acativity Mutual Information Index. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Alan Baade, Puyuan Peng, David Harwath |
MAE-AST: Masked Autoencoding Audio Spectrogram Transformer. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Sebastian Peter Bayerl, Gabriel Roccabruna, Shammur Absar Chowdhury, Tommaso Ciulli, Morena Danieli, Korbinian Riedhammer, Giuseppe Riccardi |
What can Speech and Language Tell us About the Working Alliance in Psychotherapy. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Dacheng Yin, Chuanxin Tang, Yanqing Liu, Xiaoqiang Wang, Zhiyuan Zhao, Yucheng Zhao, Zhiwei Xiong, Sheng Zhao, Chong Luo |
RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ilja Baumann, Dominik Wagner, Sebastian P. Bayerl, Tobias Bocklet |
Nonwords Pronunciation Classification in Language Development Tests for Preschool Children. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Si Ioi Ng, Cymie Wing-Yee Ng, Jiarui Wang, Tan Lee |
Automatic Detection of Speech Sound Disorder in Child Speech Using Posterior-based Speaker Representations. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Geoffrey T. Frost, Grant Theron, Thomas Niesler |
TB or not TB? Acoustic cough analysis for tuberculosis classification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jen-Tzung Chien, Yu-Han Huang |
Bayesian Transformer Using Disentangled Mask Attention. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Tatsuya Komatsu, Yusuke Fujita, Jaesong Lee, Lukas Lee, Shinji Watanabe 0001, Yusuke Kida |
Better Intermediates Improve CTC Inference. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yixuan Zhou 0002, Changhe Song, Jingbei Li, Zhiyong Wu 0001, Yanyao Bian, Dan Su 0002, Helen Meng |
Enhancing Word-Level Semantic Representation via Dependency Structure for Expressive Text-to-Speech Synthesis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Hengshun Zhou, Jun Du, Gongzhen Zou, Zhaoxu Nian, Chin-Hui Lee 0001, Sabato Marco Siniscalchi, Shinji Watanabe 0001, Odette Scharenborg, Jingdong Chen, Shifu Xiong, Jianqing Gao |
Audio-Visual Wake Word Spotting in MISP2021 Challenge: Dataset Release and Deep Analysis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Peter Makarov, Syed Ammar Abbas, Mateusz Lajszczak, Arnaud Joly, Sri Karlapati, Alexis Moinet, Thomas Drugman, Penny Karanasou |
Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Leon Liebig, Christoph Wagner, Alexander Mainka, Peter Birkholz |
An investigation of regression-based prediction of the femininity or masculinity in speech of transgender people. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Mamady Nabé, Julien Diard, Jean-Luc Schwartz |
Isochronous is beautiful? Syllabic event detection in a neuro-inspired oscillatory model is facilitated by isochrony in speech. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jiachen Lian, Alan W. Black, Louis Goldstein, Gopala Krishna Anumanchipalli |
Deep Neural Convolutive Matrix Factorization for Articulatory Representation Decomposition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jeewoo Yoon, Jinyoung Han, Erik P. Bucy, Jungseock Joo |
Predicting Emotional Intensity in Political Debates via Non-verbal Signals. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zhao Zhang, Ju Zhang 0001, Jianguo Wei, Kiyoshi Honda, Tatsuya Kitamura |
Vocal-Tract Area Functions with Articulatory Reality for Tract Opening. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yashish M. Siriwardena, Ganesh Sivaraman, Carol Y. Espy-Wilson |
Acoustic-to-articulatory Speech Inversion with Multi-task Learning. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jack Deadman, Jon Barker |
Modelling Turn-taking in Multispeaker Parties for Realistic Data Simulation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Giang Le, Chilin Shih, Yan Tang |
A Laryngographic Study on the Voice Quality of Northern Vietnamese Tones under the Lombard Effect. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Tiantian Feng, Shrikanth Narayanan |
Semi-FedSER: Semi-supervised Learning for Speech Emotion Recognition On Federated Learning using Multiview Pseudo-Labeling. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Wei-Cheng Tseng, Wei-Tsung Kao, Hung-yi Lee |
DDOS: A MOS Prediction Framework utilizing Domain Adaptive Pre-training and Distribution of Opinion Scores. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zehai Tu, Ning Ma 0002, Jon Barker |
Unsupervised Uncertainty Measures of Automatic Speech Recognition for Non-intrusive Speech Intelligibility Prediction. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zhaoci Liu, Ning-Qian Wu, Yajie Zhang, Zhenhua Ling |
Integrating Discrete Word-Level Style Variations into Non-Autoregressive Acoustic Models for Speech Synthesis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jee-weon Jung, You Jin Kim, Hee-Soo Heo, Bong-Jin Lee, Youngki Kwon, Joon Son Chung |
Pushing the limits of raw waveform speaker recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Andreas Liesenfeld, Mark Dingemanse |
Bottom-up discovery of structure and variation in response tokens ('backchannels') across diverse languages. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | WooSeok Shin, Hyun Joon Park, Jin Sob Kim, Byung Hoon Lee, Sung Won Han 0003 |
Multi-View Attention Transfer for Efficient Speech Enhancement. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zhijie Shen, Wu Guo |
An Improved Deliberation Network with Text Pre-training for Code-Switching Automatic Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Adrien Pupier, Maximin Coavoux, Benjamin Lecouteux, Jérôme Goulian |
End-to-End Dependency Parsing of Spoken French. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ye Jia, Yifan Ding, Ankur Bapna, Colin Cherry, Yu Zhang 0033, Alexis Conneau, Nobu Morioka |
Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Nina Benway, Jonathan L. Preston, Elaine Hitchcock, Asif Salekin, Harshit Sharma, Tara McAllister Byun |
PERCEPT-R: An Open-Access American English Child/Clinical Speech Corpus Specialized for the Audio Classification of /ɹ/. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Qian Wang, Chen Wang, Jiajun Zhang |
Investigating Parameter Sharing in Multilingual Speech Translation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Huahuan Zheng, Keyu An, Zhijian Ou, Chen Huang, Ke Ding, Guanglu Wan |
An Empirical Study of Language Model Integration for Transducer based Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Derek Tam, Surafel Melaku Lakew, Yogesh Virkar, Prashant Mathur, Marcello Federico |
Isochrony-Aware Neural Machine Translation for Automatic Dubbing. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zhuohuang Zhang, Donald S. Williamson, Yi Shen 0008 |
Investigation on the Band Importance of Phase-aware Speech Enhancement. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Dong-Hyun Kim, Jae-Hong Lee, Ji-Hwan Mo, Joon-Hyuk Chang |
W2V2-Light: A Lightweight Version of Wav2vec 2.0 for Automatic Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zhenke Gao, Man-Wai Mak, Weiwei Lin 0002 |
UNet-DenseNet for Robust Far-Field Speaker Verification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Samuel Bellows, Timothy W. Leishman |
Effect of Head Orientation on Speech Directivity. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Kyuhong Shim, Wonyong Sung |
Similarity and Content-based Phonetic Self Attention for Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Takaaki Saeki, Shinnosuke Takamichi, Tomohiko Nakamura, Naoko Tanji, Hiroshi Saruwatari |
SelfRemaster: Self-Supervised Speech Restoration with Analysis-by-Synthesis Approach Using Channel Modeling. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Bishal Lamichhane, Nidal Moukaddam, Ankit B. Patel, Ashutosh Sabharwal |
Dyadic Interaction Assessment from Free-living Audio for Depression Severity Assessment. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yuheng Wei, Junzhao Du, Hui Liu 0006, Qian Wang |
CTFALite: Lightweight Channel-specific Temporal and Frequency Attention Mechanism for Enhancing the Speaker Embedding Extractor. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Chih-Chiang Chang, Hung-yi Lee |
Exploring Continuous Integrate-and-Fire for Adaptive Simultaneous Speech Translation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jen-Hung Huang, Chung-Hsien Wu |
Memory-Efficient Multi-Step Speech Enhancement with Neural ODE. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jaeuk Lee, Joon-Hyuk Chang |
Advanced Speaker Embedding with Predictive Variance of Gaussian Distribution for Speaker Adaptation in TTS. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jing Su, Longxiang Zhang, Hamid Reza Hassanzadeh, Thomas Schaaf |
Extract and Abstract with BART for Clinical Notes from Doctor-Patient Conversations. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jon Barker, Michael Akeroyd, Trevor J. Cox, John F. Culling, Jennifer Firth, Simone Graetzer, Holly Griffiths, Lara Harris, Graham Naylor, Zuzanna Podwinska, Eszter Porter, Rhoddy Viveros Muñoz |
The 1st Clarity Prediction Challenge: A machine learning challenge for hearing aid intelligibility prediction. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Arijit Mukherjee, Shubham Bansal, Sandeepkumar Satpal, Rupesh K. Mehta |
Text aware Emotional Text-to-speech with BERT. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ali Aroudi, Stefan Uhlich, Marc Ferras Font |
TRUNet: Transformer-Recurrent-U Network for Multi-channel Reverberant Sound Source Separation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zhuo Li, Runqiu Xiao, Hangting Chen, Zhenduo Zhao, Zihan Zhang, Wenchao Wang |
The HCCL System for the NIST SRE21. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Woo Hyun Kang, Md. Jahangir Alam, Abderrahim Fathan |
MIM-DG: Mutual information minimization-based domain generalization for speaker verification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Kwanghee Choi, Hyung-Min Park |
Distilling a Pretrained Language Model to a Multilingual ASR Model. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Krishna Subramani, Jean-Marc Valin, Umut Isik, Paris Smaragdis, Arvindh Krishnaswamy |
End-to-end LPCNet: A Neural Vocoder With Fully-Differentiable LPC Estimation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Lester Phillip Violeta, Wen-Chin Huang, Tomoki Toda |
Investigating Self-supervised Pretraining Frameworks for Pathological Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Christoph Draxler, Julian Pomp |
OCTRA - An Innovative Approach to Orthographic Transcription. |
INTERSPEECH |
2022 |
DBLP BibTeX RDF |
|
1 | Kirandevraj R, Vinod Kumar Kurmi, Vinay P. Namboodiri, C. V. Jawahar |
Generalized Keyword Spotting using ASR embeddings. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Marcel de Korte, Jaebok Kim, Aki Kunikoshi, Adaeze Adigwe, Esther Klabbers |
Data-augmented cross-lingual synthesis in a teacher-student framework. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Chi-Chang Lee, Cheng-Hung Hu, Yu-Chen Lin, Chu-Song Chen, Hsin-Min Wang, Yu Tsao 0001 |
NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Mariia Lesnichaia, Veranika Mikhailava, Natalia Bogach, Yurii Lezhenin, John Blake 0002, Evgeny Pyshkin |
Classification of Accented English Using CNN Model Trained on Amplitude Mel-Spectrograms. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Bruce Xiao Wang, Vincent Hughes |
Reducing uncertainty at the score-to-LR stage in likelihood ratio-based forensic voice comparison using automatic speaker recognition systems. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ge Zhu, Juan Pablo Cáceres, Justin Salamon |
Filler Word Detection and Classification: A Dataset and Benchmark. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Artem Ploujnikov, Mirco Ravanelli |
SoundChoice: Grapheme-to-Phoneme Models with Semantic Disambiguation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yasuhito Ohsugi, Itsumi Saito, Kyosuke Nishida, Sen Yoshida |
Japanese ASR-Robust Pre-trained Language Model with Pseudo-Error Sentences Generated by Grapheme-Phoneme Conversion. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jana Roßbach, Rainer Huber, Saskia Röttges, Christopher F. Hauth, Thomas Biberger, Thomas Brand, Bernd T. Meyer, Jan Rennies |
Speech Intelligibility Prediction for Hearing-Impaired Listeners with the LEAP Model. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zohreh Mostaani, Mathew Magimai-Doss |
On Breathing Pattern Information in Synthetic Speech. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ding Zhao, Zhan Zhang, Bin Yu, Yuehai Wang |
Improve Speech Enhancement using Perception-High-Related Time-Frequency Loss. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yihan Wu, Xu Tan 0003, Bohan Li 0003, Lei He 0005, Sheng Zhao, Ruihua Song, Tao Qin 0001, Tie-Yan Liu |
AdaSpeech 4: Adaptive Text to Speech in Zero-Shot Scenarios. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
|
|