The FacetedDBLP logo    Search for: in:

Disable automatic phrases ?     Syntactic query expansion: ?

Publications at "Interspeech"( http://dblp.L3S.de/Venues/Interspeech )

URL (DBLP): http://dblp.uni-trier.de/db/conf/interspeech

Publication years (Num. hits)
2000 (923) 2001 (671) 2002 (679) 2003 (799) 2004 (776) 2005 (870) 2006 (660) 2007 (752) 2008 (764) 2009 (766) 2010 (782) 2011 (852) 2012 (678) 2013 (789) 2014 (641) 2015 (793) 2016 (819) 2017 (845) 2018 (792) 2019 (971) 2020 (1036) 2021 (1000) 2022 (1124)
Publication types (Num. hits)
inproceedings(18759) proceedings(23)
Venues (Conferences, Journals, ...)
Interspeech(18782)
GrowBag graphs for keyword ? (Num. hits/coverage)

Group by:
No Growbag Graphs found.

Results
Found 18782 publication records. Showing 18782 according to the selection in the facets
Hits ? Authors Title Venue Year Link Author keywords
1A. Arunkumar, Vrunda Nileshkumar Sukhadia, Srinivasan Umesh Investigation of Ensemble features of Self-Supervised Pretrained Models for Automatic Speech Recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Chenfeng Miao, Kun Zou, Ziyang Zhuang, Tao Wei, Jun Ma 0018, Shaojun Wang, Jing Xiao 0006 Towards Efficiently Learning Monotonic Alignments for Attention-based End-to-End Speech Recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Golan Pundak, Tsendsuren Munkhdalai, Khe Chai Sim On-the-fly ASR Corrections with Audio Exemplars. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Hannah Muckenhirn, Aleksandr Safin, Hakan Erdogan, Felix de Chaumont Quitry, Marco Tagliasacchi, Scott Wisdom, John R. Hershey CycleGAN-based Unpaired Speech Dereverberation. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Byeongseon Park, Ryuichi Yamamoto, Kentaro Tachibana A Unified Accent Estimation Method Based on Multi-Task Learning for Japanese Text-to-Speech. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Yixuan Zhang 0005, Heming Wang, DeLiang Wang Densely-connected Convolutional Recurrent Network for Fundamental Frequency Estimation in Noisy Speech. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Bronya Roni Chernyak, Talia Ben Simon, Yael Segal, Jeremy Steffman, Eleanor Chodroff, Jennifer Cole 0001, Joseph Keshet DeepFry: Identifying Vocal Fry Using Deep Neural Networks. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Shengyuan Xu, Wenxiao Zhao, Jing Guo RefineGAN: Universally Generating Waveform Better than Ground Truth with Highly Accurate Pitch and Intensity Responses. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Ilya Sklyar, Anna Piunova, Christian Osendorfer Separator-Transducer-Segmenter: Streaming Recognition and Segmentation of Multi-party Speech. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Mutian He 0001, Jingzhou Yang, Lei He 0005, Frank K. Soong Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Tobias Weise, Philipp Klumpp, Andreas K. Maier, Elmar Nöth, Björn Heismann, Maria Schuster, Seung Hee Yang Disentangled Latent Speech Representation for Automatic Pathological Intelligibility Assessment. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Sanli Tian, Keqi Deng, Zehan Li, Lingxuan Ye, Gaofeng Cheng, Ta Li, Yonghong Yan 0002 Knowledge Distillation For CTC-based Speech Recognition Via Consistent Acoustic Representation Learning. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Ioannis Tsiamas, Gerard I. Gállego, José A. R. Fonollosa, Marta R. Costa-jussà SHAS: Approaching optimal Segmentation for End-to-End Speech Translation. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Chenfeng Miao, Ting Chen, Minchuan Chen, Jun Ma 0018, Shaojun Wang, Jing Xiao 0006 A compact transformer-based GAN vocoder. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Haoyue Zhan, Xinyuan Yu, Haitong Zhang, Yang Zhang, Yue Lin 0002 Exploring Timbre Disentanglement in Non-Autoregressive Cross-Lingual Text-to-Speech. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Zuoyu Tian, Xiao Dong, Feier Gao, Haining Wang, Chien-Jer Charles Lin Mandarin Tone Sandhi Realization: Evidence from Large Speech Corpora. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Xue Jiang, Xiulian Peng, Huaying Xue, Yuan Zhang, Yan Lu 0001 Cross-Scale Vector Quantization for Scalable Neural Speech Coding. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Changhwan Kim, Se-Yun Um, Hyungchan Yoon, Hong-Goo Kang FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Peng Zhang, Peng Hu, Xueliang Zhang Norm-constrained Score-level Ensemble for Spoofing Aware Speaker Verification. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Min-Kyung Kim 0002, Joon-Hyuk Chang Adversarial and Sequential Training for Cross-lingual Prosody Transfer TTS. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Kaiqi Fu, Shaojun Gao, Xiaohai Tian, Wei Li 0012, Zejun Ma Using Fluency Representation Learned from Sequential Raw Features for Improving Non-native Fluency Scoring. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Georgios Karakasidis, Tamás Grósz, Mikko Kurimo Comparison and Analysis of New Curriculum Criteria for End-to-End ASR. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Rahil Parikh, Gaspar Rochette, Carol Y. Espy-Wilson, Shihab A. Shamma An Empirical Analysis on the Vulnerabilities of End-to-End Speech Segregation Models. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Hira Dhamyal, Bhiksha Raj, Rita Singh Positional Encoding for Capturing Modality Specific Cadence for Emotion Detection. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Seungu Han, Junhyeok Lee NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Zongyang Du, Berrak Sisman, Kun Zhou 0003, Haizhou Li 0001 Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conversion. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Vera Bernhard, Sandra Schwab, Jean-Philippe Goldman Acoustic Stress Detection in Isolated English Words for Computer-Assisted Pronunciation Training. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Mohammad MohammadAmini, Driss Matrouf, Jean-François Bonastre, Sandipana Dowerah, Romain Serizel, Denis Jouvet Barlow Twins self-supervised learning for robust speaker recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Matthew Perez, Mimansa Jaiswal, Minxue Niu, Cristina Gorrostieta, Matthew Roddy, Kye Taylor, Reza Lotfian, John Kane, Emily Mower Provost Mind the gap: On the value of silence representations to lexical-based speech emotion recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Zhihan Wang, Feng Hou, Yuanhang Qiu, Zhizhong Ma, Satwinder Singh, Ruili Wang CyclicAugment: Speech Data Random Augmentation with Cosine Annealing Scheduler for Auotmatic Speech Recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Eleftherios Kapelonis, Efthymios Georgiou, Alexandros Potamianos A Multi-Task BERT Model for Schema-Guided Dialogue State Tracking. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Janine Rugayan, Torbjørn Svendsen, Giampiero Salvi Semantically Meaningful Metrics for Norwegian ASR Systems. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Tasnima Sadekova, Vladimir Gogoryan, Ivan Vovk, Vadim Popov, Mikhail A. Kudinov, Jiansheng Wei A Unified System for Voice Cloning and Voice Conversion through Diffusion Probabilistic Modeling. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Tong Ye, Shijing Si, Jianzong Wang, Ning Cheng 0001, Jing Xiao 0006 Uncertainty Calibration for Deep Audio Classifiers. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Junyi Peng, Rongzhi Gu, Ladislav Mosner, Oldrich Plchot, Lukás Burget, Jan Cernocký Learnable Sparse Filterbank for Speaker Verification. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Haoran Yin, Meng Ge, Yanjie Fu, Gaoyan Zhang, Longbiao Wang, Lei Zhang, Lin Qiu, Jianwu Dang 0001 MIMO-DoAnet: Multi-channel Input and Multiple Outputs DoA Network with Unknown Number of Sound Sources. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Siqi Zheng, Hongbin Suo, Qian Chen PRISM: Pre-trained Indeterminate Speaker Representation Model for Speaker Diarization and Speaker Verification. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Emiru Tsunoo, Yosuke Kashiwagi, Chaitanya Prasad Narisetty, Shinji Watanabe 0001 Residual Language Model for End-to-end Speech Recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Paula Andrea Pérez-Toro, Philipp Klumpp, Abner Hernandez, Tomas Arias, Patricia Lillo, Andrea Slachevsky, Adolfo Martín García, Maria Schuster, Andreas K. Maier, Elmar Nöth, Juan Rafael Orozco-Arroyave Alzheimer's Detection from English to Spanish Using Acoustic and Linguistic Embeddings. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Yifei Xin, Dongchao Yang, Yuexian Zou Audio Pyramid Transformer with Domain Adaption for Weakly Supervised Sound Event Detection and Audio Classification. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Fan Qian, Hongwei Song, Jiqing Han 0001 Word-wise Sparse Attention for Multimodal Sentiment Analysis. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Anuroop Sriram, Michael Auli, Alexei Baevski Wav2Vec-Aug: Improved self-supervised training with limited data. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Youxiang Zhu, Xiaohui Liang, John A. Batsis, Robert M. Roth Domain-aware Intermediate Pretraining for Dementia Detection with Limited Data. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Xueshuai Zhang, Jiakun Shen, Jun Zhou 0024, Pengyuan Zhang, Yonghong Yan 0002, Zhihua Huang, Yanfen Tang, Yu Wang, Fujie Zhang, Shaoxing Zhang, Aijun Sun Robust Cough Feature Extraction and Classification Method for COVID-19 Cough Detection Based on Vocalization Characteristics. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Gasper Begus, Alan Zhou Modeling speech recognition and synthesis simultaneously: Encoding and decoding lexical and sublexical semantic information into speech with no direct access to speech data. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Jiatong Shi, Shuai Guo, Tao Qian, Tomoki Hayashi, Yuning Wu, Fangzheng Xu, Xuankai Chang, Huazhe Li, Peter Wu, Shinji Watanabe 0001, Qin Jin Muskits: an End-to-end Music Processing Toolkit for Singing Voice Synthesis. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Mao-Kui He, Jun Du, Chin-Hui Lee 0001 End-to-End Audio-Visual Neural Speaker Diarization. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Ju-ho Kim, Jungwoo Heo, Hye-jin Shim, Ha-Jin Yu Extended U-Net for Speaker Verification in Noisy Environments. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Aaqib Saeed Binary Early-Exit Network for Adaptive Inference on Low-Resource Devices. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Mu Yang, Kevin Hirschi, Stephen Daniel Looney, Okim Kang, John H. L. Hansen Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Assessment. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Yuanbo Hou, Zhaoyi Liu, Bo Kang, Yun Wang, Dick Botteldooren CT-SAT: Contextual Transformer for Sequential Audio Tagging. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Sondes Abderrazek, Corinne Fredouille, Alain Ghio, Muriel Lalain, Christine Meunier, Virginie Woisard Validation of the Neuro-Concept Detector framework for the characterization of speech disorders: A comparative study including Dysarthria and Dysphonia. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Vineet Garg, Ognjen Rudovic, Pranay Dighe, Ahmed Hussen Abdelaziz, Erik Marchi, Saurabh Adya, Chandra Dhir, Ahmed H. Tewfik Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Daniel Fernau, Stefan Hillmann, Nils Feldhus, Tim Polzehl Towards Automated Dialog Personalization using MBTI Personality Indicators. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Xingming Wang, Xiaoyi Qin, Yikang Wang, Yunfei Xu, Ming Li 0026 The DKU-OPPO System for the 2022 Spoofing-Aware Speaker Verification Challenge. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Francesco Nespoli, Daniel Barreda, Patrick A. Naylor Relative Acoustic Features for Distance Estimation in Smart-Homes. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Kohei Saijo, Tetsuji Ogawa Unsupervised Training of Sequential Neural Beamformer Using Coarsely-separated and Non-separated Signals. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Zhaoyan Zhang, Jason Zhang, Jody Kreiman Effects of laryngeal manipulations on voice gender perception. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Yi-Chang Chen, Yu-Chuan Steven, Yen-Cheng Chang, Yi-Ren Yeh g2pW: A Conditional Weighted Softmax BERT for Polyphone Disambiguation in Mandarin. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Shaohuan Zhou, Shun Lei, Weiya You, Deyi Tuo, Yuren You, Zhiyong Wu 0001, Shiyin Kang, Helen Meng Towards Improving the Expressiveness of Singing Voice Synthesis with BERT Derived Semantic Information. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Jovan M. Dalhouse, Katunobu Itou Cross-Lingual Transfer Learning Approach to Phoneme Error Detection via Latent Phonetic Representation. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Qianqian Dong, Fengpeng Yue, Tom Ko, Mingxuan Wang, Qibing Bai, Yu Zhang 0006 Leveraging Pseudo-labeled Data to Improve Direct Speech-to-Speech Translation. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Amir Shirian, Krishna Somandepalli, Victor Sanchez, Tanaya Guha Visually-aware Acoustic Event Detection using Heterogeneous Graphs. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Rui Wang 0073, Qibing Bai, Junyi Ao, Long Zhou, Zhixiang Xiong, Zhihua Wei, Yu Zhang 0006, Tom Ko, Haizhou Li 0001 LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Ronglai Zuo, Brian Mak Local Context-aware Self-attention for Continuous Sign Language Recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Vinicius Ribeiro, Yves Laprie Autoencoder-Based Tongue Shape Estimation During Continuous Speech. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Wen-Chin Huang, Erica Cooper, Yu Tsao 0001, Hsin-Min Wang, Tomoki Toda, Junichi Yamagishi The VoiceMOS Challenge 2022. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Séverine Guillaume, Guillaume Wisniewski, Benjamin Galliot, Minh Chau Nguyen, Maxime Fily, Guillaume Jacques, Alexis Michaud Plugging a neural phoneme recognizer into a simple language model: a workflow for low-resource setting. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Hokuto Munakata, Ryu Takeda, Kazunori Komatani Training Data Generation with DOA-based Selecting and Remixing for Unsupervised Training of Deep Separation Models. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Kai-Wei Chang, Wei-Cheng Tseng, Shang-Wen Li 0001, Hung-yi Lee An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Tuan Vu Ho, Maori Kobayashi, Masato Akagi Speak Like a Professional: Increasing Speech Intelligibility by Mimicking Professional Announcer Voice with Voice Conversion. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Woo Hyun Kang, Md. Jahangir Alam, Abderrahim Fathan End-to-end framework for spoof-aware speaker verification. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Daiki Yoshioka, Yusuke Yasuda, Noriyuki Matsunaga, Yamato Ohtani, Tomoki Toda Spoken-Text-Style Transfer with Conditional Variational Autoencoder and Content Word Storage. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Ruchao Fan, Abeer Alwan DRAFT: A Novel Framework to Reduce Domain Shifting in Self-supervised Learning and Its Application to Children's ASR. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Theresa Breiner, Swaroop Ramaswamy, Ehsan Variani, Shefali Garg, Rajiv Mathews, Khe Chai Sim, Kilol Gupta, Mingqing Chen, Lara McConnaughey UserLibri: A Dataset for ASR Personalization Using Only Text. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Yuxiang Zhang, Zhuo Li, Wenchao Wang, Pengyuan Zhang SASV Based on Pre-trained ASV System and Integrated Scoring Module. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Mark Gibson, Marcel Schlechtweg, Beatriz Blecua Falgueras, Judit Ayala Alcalde Language-specific interactions of vowel discrimination in noise. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Amrit Romana, Minxue Niu, Matthew Perez, Angela Roberts, Emily Mower Provost Enabling Off-the-Shelf Disfluency Detection and Categorization for Pathological Speech. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Gary Wang, Andrew Rosenberg, Bhuvana Ramabhadran, Fadi Biadsy, Jesse Emond, Yinghui Huang, Pedro J. Moreno 0001 Non-Parallel Voice Conversion for ASR Augmentation. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Ryan M. Corey, Manan Mittal, Kanad Sarkar, Andrew C. Singer Cooperative Speech Separation With a Microphone Array and Asynchronous Wearable Devices. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Léane Salais, Pablo Arias 0003, Clément Le Moine, Victor Rosi, Yann Teytaut, Nicolas Obin, Axel Roebel Production Strategies of Vocal Attitudes. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Shaojin Ding, Rajeev Rikhye, Qiao Liang 0001, Yanzhang He, Quan Wang, Arun Narayanan, Tom O'Malley, Ian McGraw Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Yuansheng Guan, Guochen Yu, Andong Li, Chengshi Zheng, Jie Wang TMGAN-PLC: Audio Packet Loss Concealment using Temporal Memory Generative Adversarial Network. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Xinmeng Xu, Yang Wang, Jie Jia, Binbin Chen 0006, Jianjun Hao GLD-Net: Improving Monaural Speech Enhancement by Learning Global and Local Dependency Features with GLD Block. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Hemant Yadav, Akshat Gupta, Sai Krishna Rallabandi, Alan W. Black, Rajiv Ratn Shah Intent classification using pre-trained language agnostic embeddings for low resource languages. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Kavan Fatehi, Mercedes Torres Torres, Ayse Küçükyilmaz ScoutWav: Two-Step Fine-Tuning on Self-Supervised Automatic Speech Recognition for Low-Resource Environments. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1K. V. Vijay Girish, Srikanth Konjeti, Jithendra Vepa Interpretabilty of Speech Emotion Recognition modelled using Self-Supervised Speech and Text Pre-Trained Embeddings. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Hyunjae Cho, Wonbin Jung, Junhyeok Lee, Sang Hoon Woo SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Christin Kirchhübel, Georgina Brown Spoofed speech from the perspective of a forensic phonetician. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Feifei Xiong, Pengyu Wang, Zhongfu Ye, Jinwei Feng Joint Estimation of Direction-of-Arrival and Distance for Arrays with Directional Sensors based on Sparse Bayesian Learning. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Shaojin Ding, Weiran Wang, Ding Zhao, Tara N. Sainath, Yanzhang He, Robert David, Rami Botros, Xin Wang 0016, Rina Panigrahy, Qiao Liang 0001, Dongseong Hwang, Ian McGraw, Rohit Prabhavalkar, Trevor Strohman A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Linjun Cai, Yuhong Yang 0001, Xufeng Chen, Weiping Tu, Hongyang Chen CS-CTCSCONV1D: Small footprint speaker verification with channel split time-channel-time separable 1-dimensional convolution. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Yuhang He, Andrew Markham SoundDoA: Learn Sound Source Direction of Arrival and Semantics from Sound Raw Waveforms. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Anoop Kumar, Pankaj Kumar Sharma, Aravind Illa, Sriram Venkatapathy, Subhrangshu Nandi, Pritam Varma, Anurag Dwarakanath, Aram Galstyan Learning Under Label Noise for Robust Spoken Language Understanding systems. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Anwesha Roy, Varun Belagali, Prasanta Kumar Ghosh Air tissue boundary segmentation using regional loss in real-time Magnetic Resonance Imaging video for speech production. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Zihan Zhao, Yanfeng Wang, Yu Wang 0027 Multi-level Fusion of Wav2vec 2.0 and BERT for Multimodal Emotion Recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Kai Li 0018, Sheng Li 0010, Xugang Lu, Masato Akagi, Meng Liu, Lin Zhang, Chang Zeng, Longbiao Wang, Jianwu Dang 0001, Masashi Unoki Data Augmentation Using McAdams-Coefficient-Based Speaker Anonymization for Fake Audio Detection. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Yixuan Zhou 0002, Changhe Song, Xiang Li 0105, Luwen Zhang, Zhiyong Wu 0001, Yanyao Bian, Dan Su 0002, Helen Meng Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Katrina Kechun Li, Julia Schwarz, Jasper Hong Sim, Yixin Zhang, Elizabeth Buchanan-Worster, Brechtje Post, Kirsty McDougall Recording and timing vocal responses in online experimentation. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
1Yang Liu 0262, Haoqin Sun, Wenbo Guan, Yuqi Xia, Zhen Zhao 0006 Discriminative Feature Representation Based on Cascaded Attention Network with Adversarial Joint Loss for Speech Emotion Recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
Displaying result #1 - #100 of 18782 (100 per page; Change: )
Pages: [1][2][3][4][5][6][7][8][9][10][>>]
Valid XHTML 1.1! Valid CSS! [Valid RSS]
Maintained by L3S.
Previously maintained by Jörg Diederich.
Based upon DBLP by Michael Ley.
open data data released under the ODC-BY 1.0 license