|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
No Growbag Graphs found.
|
|
|
Results
Found 18782 publication records. Showing 18782 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
1 | Hae-Sung Jeon, Stephen Nichols |
Investigating Prosodic Variation in British English Varieties using ProPer. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Efthymios Tzinis, Gordon Wichern, Aswin Shanmugam Subramanian, Paris Smaragdis, Jonathan Le Roux |
Heterogeneous Target Speech Separation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Pengwei Wang 0005, Yinpei Su, Xiaohuan Zhou, Xin Ye, Liangchen Wei, Ming Liu, Yuan You, Feijun Jiang |
Speech2Slot: A Limited Generation Framework with Boundary Detection for Slot Filling from Speech. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Nicolas M. Müller, Pavel Czempin, Franziska Dieckmann, Adam Froghyar, Konstantin Böttinger |
Does Audio Deepfake Detection Generalize? |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Saida Mussakhojayeva, Yerbolat Khassanov, Huseyin Atakan Varol |
KSC2: An Industrial-Scale Open-Source Kazakh Speech Corpus. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ryandhimas Edo Zezario, Fei Chen 0011, Chiou-Shann Fuh, Hsin-Min Wang, Yu Tsao 0001 |
MBI-Net: A Non-Intrusive Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ariadna Sánchez, Alessio Falai, Ziyao Zhang, Orazio Angelini, Kayoko Yanagisawa |
Unify and Conquer: How Phonetic Feature Representation Affects Polyglot Text-To-Speech (TTS). |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Shijun Wang, Hamed Hemati, Jón Guðnason, Damian Borth |
Generative Data Augmentation Guided by Triplet Loss for Speech Emotion Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Mun-Hak Lee, Joon-Hyuk Chang, Sang-Eon Lee, Ju-Seok Seong, Chanhee Park, Haeyoung Kwon |
Regularizing Transformer-based Acoustic Models by Penalizing Attention Weights. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Chin-Yueh Chien, Kuan-Yu Chen |
A BERT-based Language Modeling Framework. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Konrad Zielinski, Marek Grzelec, Martin Hagmüller |
Humanizing bionic voice: interactive demonstration of aesthetic design and control factors influencing the devices assembly and waveshape engineering. |
INTERSPEECH |
2022 |
DBLP BibTeX RDF |
|
1 | Simon Welker, Julius Richter, Timo Gerkmann |
Speech Enhancement with Score-Based Generative Models in the Complex STFT Domain. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Andong Li, Guochen Yu, Chengshi Zheng, Xiaodong Li 0002 |
TaylorBeamformer: Learning All-Neural Beamformer for Multi-Channel Speech Enhancement from Taylor's Approximation Theory. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Mufan Sang, John H. L. Hansen |
Multi-Frequency Information Enhanced Channel Attention Module for Speaker Representation Learning. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Kunnar Kukk, Tanel Alumäe |
Improving Language Identification of Accented Speech. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Xiaoquan Ke, Man-Wai Mak, Helen M. Meng |
Automatic Selection of Discriminative Features for Dementia Detection in Cantonese-Speaking People. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yangyang Ou, Peng Zhang 0002, Jing Zhang, Hui Gao, Xing Ma |
Incorporating Dual-Aware with Hierarchical Interactive Memory Networks for Task-Oriented Dialogue. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zhanheng Yang, Sining Sun, Jin Li, Xiaoming Zhang, Xiong Wang, Long Ma, Lei Xie 0001 |
CaTT-KWS: A Multi-stage Customized Keyword Spotting Framework based on Cascaded Transducer-Transformer. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Amber Afshan, Abeer Alwan |
Attention-based conditioning methods using variable frame rate for style-robust speaker verification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ramit Sawhney, Megh Thakkar, Vishwa Shah, Puneet Mathur, Vasu Sharma, Dinesh Manocha |
PISA: PoIncaré Saliency-Aware Interpolative Augmentation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Martin Flechl, Shou-Chun Yin, Junho Park, Peter Skala |
End-to-end speech recognition modeling from de-identified data. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Oralie Cattan, Sahar Ghannay, Christophe Servan, Sophie Rosset |
Benchmarking Transformers-based models on French Spoken Language Understanding tasks. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Bing Han, Zhengyang Chen, Yanmin Qian |
Self-Supervised Speaker Verification Using Dynamic Loss-Gate and Label Correction. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Running Zhao, Jiangtao Yu, Tingle Li, Hang Zhao, Edith C. H. Ngai |
Radio2Speech: High Quality Speech Recovery from Radio Frequency Signals. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Reo Yoneyama, Yi-Chiao Wu, Tomoki Toda |
Unified Source-Filter GAN with Harmonic-plus-Noise Source Excitation Generation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Pengqi Li, Lantian Li, Askar Hamdulla, Dong Wang 0013 |
Reliable Visualization for Deep Speaker Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Katharine Patterson, Kevin W. Wilson, Scott Wisdom, John R. Hershey |
Distance-Based Sound Separation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yuanbo Hou, Dick Botteldooren |
Event-related data conditioning for acoustic event classification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Mai Hoang Dao, Thinh Hung Truong, Dat Quoc Nguyen |
From Disfluency Detection to Intent Detection and Slot Filling. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Sishi Liao, Phil Hoole, Conceição Cunha, Esther Kunay, Aletheia Cui, Lia Saki Bucar Shigemori, Felicitas Kleber, Dirk Voit, Jens Frahm, Jonathan Harrington |
Nasal Coda Loss in the Chengdu Dialect of Mandarin: Evidence from RT-MRI. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Alexander Johnson, Kevin Everson, Vijay Ravi, Anissa Gladney, Mari Ostendorf, Abeer Alwan |
Automatic Dialect Density Estimation for African American English. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Kun Chen, Jun Wang 0077, Feng Deng, Xiaorui Wang |
iCNN-Transformer: An improved CNN-Transformer with Channel-spatial Attention and Keyword Prediction for Automated Audio Captioning. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Haohe Liu, Xubo Liu, Qiuqiang Kong, Qiao Tian, Yan Zhao 0010, DeLiang Wang, Chuanzeng Huang, Yuxuan Wang 0002 |
VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Nadee Seneviratne, Carol Y. Espy-Wilson |
Multimodal Depression Severity Score Prediction Using Articulatory Coordination Features and Hierarchical Attention Based Text Embeddings. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Kazuma Iwamoto, Tsubasa Ochiai, Marc Delcroix, Rintaro Ikeshita, Hiroshi Sato, Shoko Araki, Shigeru Katagiri |
How bad are artifacts?: Analyzing the impact of speech enhancement errors on ASR. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Junhao Xu, Shoukang Hu, Xunying Liu, Helen Meng |
Towards Green ASR: Lossless 4-bit Quantization of a Hybrid TDNN System on the 300-hr Swithboard Corpus. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Adria Mallol-Ragolta, Helena Cuesta, Emilia Gómez, Björn W. Schuller |
Multi-Type Outer Product-Based Fusion of Respiratory Sounds for Detecting COVID-19. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Debasish Ray Mohapatra, Mario Fleischer, Victor Zappi, Peter Birkholz, Sidney S. Fels |
Three-dimensional finite-difference time-domain acoustic analysis of simplified vocal tract shapes. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Mao Saeki, Kotoka Miyagi, Shinya Fujie, Shungo Suzuki, Tetsuji Ogawa, Tetsunori Kobayashi, Yoichi Matsuyama |
Confusion Detection for Adaptive Conversational Strategies of An Oral Proficiency Assessment Interview Agent. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Gerasimos Chatzoudis, Manos Plitsis, Spyridoula Stamouli, Athanasia-Lida Dimou, Nassos Katsamanis, Vassilis Katsouros |
Zero-Shot Cross-lingual Aphasia Detection using Automatic Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Qijie Shao, Jinghao Yan, Jian Kang 0006, Pengcheng Guo, Xian Shi, Pengfei Hu 0004, Lei Xie 0001 |
Linguistic-Acoustic Similarity Based Accent Shift for Accent Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Takanori Ashihara, Takafumi Moriya, Kohei Matsuura, Tomohiro Tanaka |
Deep versus Wide: An Analysis of Student Architectures for Task-Agnostic Knowledge Distillation of Self-Supervised Speech Models. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Fan Yu, Zhihao Du, Shiliang Zhang, Yuxiao Lin, Lei Xie 0001 |
A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Weiran Wang, Ke Hu, Tara N. Sainath |
Streaming Align-Refine for Non-autoregressive Deliberation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Chenggang Zhang, Jinjiang Liu, Xueliang Zhang 0001 |
LCSM: A Lightweight Complex Spectral Mapping Framework for Stereophonic Acoustic Echo Cancellation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Bowen Shi, Wei-Ning Hsu, Abdelrahman Mohamed |
Robust Self-Supervised Audio-Visual Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Mohan Li, Rama Sanand Doddipatla, Catalin Zorila |
Self-regularised Minimum Latency Training for Streaming Transformer-based Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yuma Koizumi, Shigeki Karita, Arun Narayanan, Sankaran Panchapagesan, Michiel Bacchiani |
SNRi Target Training for Joint Speech Enhancement and Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Quentin Meeus, Marie-Francine Moens, Hugo Van hamme |
Multitask Learning for Low Resource Spoken Language Understanding. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Andrea Alicehajic, Silke Hamann |
The discrimination of [zi]-[dʑi] by Japanese listeners and the prospective phonologization of /zi/. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Gregory Ciccarelli, Jarred Barber, Arun Nair, Israel Cohen, Tao Zhang |
Challenges and Opportunities in Multi-device Speech Processing. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yuta Ide, Susumu Saito, Teppei Nakano, Tetsuji Ogawa |
Can Humans Correct Errors From System? Investigating Error Tendencies in Speaker Identification Using Crowdsourcing. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Arnon Turetzky, Tzvi Michelson, Yossi Adi, Shmuel Peleg |
Deep Audio Waveform Prior. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Miseul Kim, Zhenyu Piao, Se-Yun Um, Ran Lee, Jaemin Joh, Seungshin Lee, Hong-Goo Kang |
Light-Weight Speaker Verification with Global Context Information. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Tomohiro Tanaka, Ryo Masumura, Hiroshi Sato, Mana Ihori, Kohei Matsuura, Takanori Ashihara, Takafumi Moriya |
Domain Adversarial Self-Supervised Speech Representation Learning for Improving Unknown Domain Downstream Tasks. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Takaaki Saeki, Detai Xin, Wataru Nakata, Tomoki Koriyama, Shinnosuke Takamichi, Hiroshi Saruwatari |
UTMOS: UTokyo-SaruLab System for VoiceMOS Challenge 2022. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Debarpan Bhattacharya, Debottam Dutta, Neeraj Kumar Sharma 0001, Srikanth Raj Chetupalli, Pravin Mote, Sriram Ganapathy, Chandrakiran C, Sahiti Nori, Suhail K. K, Sadhana Gonuguntla, Murali Alagesan |
Analyzing the impact of SARS-CoV-2 variants on respiratory sound signals. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Raymond Chung, Brian Mak |
Synthesizing Near Native-accented Speech for a Non-native Speaker by Imitating the Pronunciation and Prosody of a Native Speaker. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jin Woo Lee, Eungbeom Kim, Junghyun Koo, Kyogu Lee |
Representation Selective Self-distillation and wav2vec 2.0 Feature Exploration for Spoof-aware Speaker Verification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Perry Lam, Huayun Zhang, Nancy F. Chen, Berrak Sisman |
EPIC TTS Models: Empirical Pruning Investigations Characterizing Text-To-Speech Models. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Takuya Kunihara, Chuanbo Zhu 0001, Nobuaki Minematsu, Noriko Nakanishi |
Gradual Improvements Observed in Learners' Perception and Production of L2 Sounds Through Continuing Shadowing Practices on a Daily Basis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Aleksandr Laptev, Somshubra Majumdar, Boris Ginsburg |
CTC Variations Through New WFST Topologies. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zhiyuan Zhao, Chuanxin Tang, Chengdong Yao, Chong Luo |
An Anchor-Free Detector for Continuous Speech Keyword Spotting. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Guan-Ting Lin, Shang-Wen Li 0001, Hung-yi Lee |
Listen, Adapt, Better WER: Source-free Single-utterance Test-time Adaptation for Automatic Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Valentin Pelloin, Franck Dary, Nicolas Hervé, Benoît Favre, Nathalie Camelin, Antoine Laurent, Laurent Besacier |
ASR-Generated Text for Language Model Pre-training Applied to Speech Tasks. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Constantijn Kaland |
Bending the string: intonation contour length as a correlate of macro-rhythm. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zixia Fan, Jing Shao, Weigong Pan, Min Xu, Lan Wang |
The effect of backward noise on lexical tone discrimination in Mandarin-speaking amusics. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yukun Liu, Ta Li, Pengyuan Zhang, Yonghong Yan 0002 |
NAS-SCAE: Searching Compact Attention-based Encoders For End-to-end Automatic Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Tarun Gupta, Duc-Tuan Truong, Tran The Anh, Eng Siong Chng |
Estimation of speaker age and height from speech signal using bi-encoder transformer mixture model. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Mirek Novak, Pavlos Papadopoulos |
RNN-T lattice enhancement by grafting of pruned paths. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yookyung Shin, Younggun Lee, Suhee Jo, Yeongtae Hwang, Taesu Kim |
Text-driven Emotional Style Control and Cross-speaker Style Transfer in Neural TTS. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Hao Shi, Longbiao Wang, Sheng Li 0010, Jianwu Dang 0001, Tatsuya Kawahara |
Monaural Speech Enhancement Based on Spectrogram Decomposition for Convolutional Neural Network-sensitive Feature Extraction. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Erik Ekstedt, Gabriel Skantze |
Voice Activity Projection: Self-supervised Learning of Turn-taking Events. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Pierre Champion, Anthony Larcher, Denis Jouvet |
Are disentangled representations all you need to build speaker anonymization systems? |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Sung-Lin Yeh, Hao Tang 0002 |
Autoregressive Co-Training for Learning Discrete Speech Representation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Or Tal, Moshe Mandel, Felix Kreuk, Yossi Adi |
A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zhuangqi Chen, Pingjian Zhang |
Lightweight Full-band and Sub-band Fusion Network for Real Time Speech Enhancement. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Fadi Biadsy, Youzheng Chen, Xia Zhang, Oleg Rybakov, Andrew Rosenberg, Pedro J. Moreno 0001 |
A Scalable Model Specialization Framework for Training and Inference using Submodels and its Application to Speech Model Personalization. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Vipula Dissanayake, Sachith Seneviratne, Hussel Suriyaarachchi, Elliott Wen, Suranga Nanayakkara |
Self-supervised Representation Fusion for Speech and Wearable Based Emotion Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Naoyuki Kanda, Jian Wu 0027, Yu Wu 0012, Xiong Xiao, Zhong Meng, Xiaofei Wang 0009, Yashesh Gaur, Zhuo Chen 0006, Jinyu Li 0001, Takuya Yoshioka |
Streaming Multi-Talker ASR with Token-Level Serialized Output Training. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yu-Lin Huang, Bo-Hao Su, Y.-W. Peter Hong, Chi-Chun Lee |
An Attention-Based Method for Guiding Attribute-Aligned Speech Representation Learning. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Daxin Tan, Guangyan Zhang, Tan Lee |
Environment Aware Text-to-Speech Synthesis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Abdul Hameed Azeemi, Ihsan Ayyub Qazi, Agha Ali Raza |
Dataset Pruning for Resource-constrained Spoofed Audio Detection. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Junyi Ao, Ziqiang Zhang, Long Zhou, Shujie Liu 0001, Haizhou Li 0001, Tom Ko, Lirong Dai 0001, Jinyu Li 0001, Yao Qian, Furu Wei |
Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Xiaodong Cui, George Saon, Tohru Nagano, Masayuki Suzuki, Takashi Fukuda, Brian Kingsbury, Gakuto Kurata |
Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Junteng Jia, Jay Mahadeokar, Weiyi Zheng, Yuan Shangguan, Ozlem Kalinli, Frank Seide |
Federated Domain Adaptation for ASR with Full Self-Supervision. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Waris Quamer, Anurag Das, John Levis, Evgeny Chukharev-Hudilainen, Ricardo Gutierrez-Osuna |
Zero-Shot Foreign Accent Conversion without a Native Reference. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yuki Saito, Yuto Nishimura, Shinnosuke Takamichi, Kentaro Tachibana, Hiroshi Saruwatari |
STUDIES: Corpus of Japanese Empathetic Dialogue Speech Towards Friendly Voice Agent. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Amir Ivry, Israel Cohen, Baruch Berdugo |
Objective Metrics to Evaluate Residual-Echo Suppression During Double-Talk in the Stereophonic Case. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jash Rathod, Nauman Dawalatabad, Shatrughan Singh, Dhananjaya Gowda |
Multi-stage Progressive Compression of Conformer Transducer for On-device Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Chao Wang, Zhonghao Li, Benlai Tang, Xiang Yin 0006, Yuan Wan, Yibiao Yu, Zejun Ma |
Towards high-fidelity singing voice conversion with acoustic reference and contrastive predictive coding. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Saket Dingliwal, Ashish Shenoy, Sravan Bodapati, Ankur Gandhe, Ravi Teja Gadde, Katrin Kirchhoff |
Domain Prompts: Towards memory and compute efficient domain adaptation of ASR systems. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Fareeha S. Rana, Daniel Pape, Elisabet Service |
The effect of increasing acoustic and linguistic complexity on auditory processing: an EEG study. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Rachid Ridouane, Philipp Buech |
Complex sounds and cross-language influence: The case of ejectives in Omani Mehri. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jin Li, Rongfeng Su, Xurong Xie, Lan Wang, Nan Yan |
A Multi-level Acoustic Feature Extraction Framework for Transformer Based End-to-End Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yi Meng, Xiang Li 0105, Zhiyong Wu 0001, Tingtian Li, Zixun Sun, Xinyu Xiao, Chi Sun, Hui Zhan, Helen Meng |
CALM: Constrastive Cross-modal Speaking Style Modeling for Expressive Text-to-Speech Synthesis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Huang-Cheng Chou, Chi-Chun Lee, Carlos Busso |
Exploiting Co-occurrence Frequency of Emotions in Perceptual Evaluations To Train A Speech Emotion Classifier. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Einari Vaaras, Manu Airaksinen, Okko Räsänen |
Analysis of Self-Supervised Learning and Dimensionality Reduction Methods in Clustering-Based Active Learning for Speech Emotion Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Andrew Hard, Kurt Partridge, Neng Chen, Sean Augenstein, Aishanee Shah, Hyun Jin Park, Alex Park 0001, Sara Ng, Jessica Nguyen, Ignacio López-Moreno, Rajiv Mathews, Françoise Beaufays |
Production federated keyword spotting via distillation, filtering, and joint federated-centralized training. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Katariina Martikainen, Jussi Karlgren, Khiet Truong |
Exploring audio-based stylistic variation in podcasts. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
Displaying result #401 - #500 of 18782 (100 per page; Change: ) Pages: [ <<][ 1][ 2][ 3][ 4][ 5][ 6][ 7][ 8][ 9][ 10][ 11][ 12][ 13][ 14][ >>] |
|