|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
No Growbag Graphs found.
|
|
|
Results
Found 18782 publication records. Showing 18782 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
1 | Bei Liu, Zhengyang Chen, Yanmin Qian |
Dual Path Embedding Learning for Speaker Verification with Triplet Attention. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Waseem Gharbieh, Jinmiao Huang, Qianhui Wan, Han Suk Shim, Hyun Chul Lee |
DyConvMixer: Dynamic Convolution Mixer Architecture for Open-Vocabulary Keyword Spotting. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Sunmook Choi, Il-Youp Kwak, Seungsang Oh |
Overlapped Frequency-Distributed Network: Frequency-Aware Voice Spoofing Countermeasure. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ambika Kirkland, Harm Lameris, Éva Székely, Joakim Gustafson |
Where's the uh, hesitation? The interplay between filled pause location, speech rate and fundamental frequency in perception of confidence. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Andreas Triantafyllopoulos, Johannes Wagner 0001, Hagen Wierstorf, Maximilian Schmitt, Uwe Reichel, Florian Eyben, Felix Burkhardt, Björn W. Schuller |
Probing speech emotion recognition transformers for linguistic knowledge. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yosi Shrem, Felix Kreuk, Joseph Keshet |
Formant Estimation and Tracking using Probabilistic Heat-Maps. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Hexin Liu, Leibny Paola García-Perera, Andy W. H. Khong, Suzy J. Styles, Sanjeev Khudanpur |
PHO-LID: A Unified Model Incorporating Acoustic-Phonetic and Phonotactic Information for Language Identification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jie Wang, Yuji Liu, Binling Wang, Yiming Zhi, Song Li, Shipeng Xia, Jiayang Zhang, Feng Tong, Lin Li 0032, Qingyang Hong |
Spatial-aware Speaker Diarizaiton for Multi-channel Multi-party Meeting. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Andrei Bîrladeanu, Helen Minnis, Alessandro Vinciarelli |
Automatic Detection of Reactive Attachment Disorder Through Turn-Taking Analysis in Clinical Child-Caregiver Sessions. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jian Luo, Jianzong Wang, Ning Cheng 0001, Edward Xiao, Xulong Zhang 0001, Jing Xiao 0006 |
Tiny-Sepformer: A Tiny Time-Domain Transformer Network For Speech Separation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Alexis Conneau, Ankur Bapna, Yu Zhang 0033, Min Ma, Patrick von Platen, Anton Lozhkov, Colin Cherry, Ye Jia, Clara Rivera, Mihir Kale, Daan van Esch, Vera Axelrod, Simran Khanuja, Jonathan H. Clark, Orhan Firat, Michael Auli, Sebastian Ruder, Jason Riesa, Melvin Johnson |
XTREME-S: Evaluating Cross-lingual Speech Representations. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zhenglin Zhang, Lizhuang Yang, Xun Wang, Hai Li 0006 |
Automated Detection of Wilson's Disease Based on Improved Mel-frequency Cepstral Coefficients with Signal Decomposition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yuhan Li, Ying Shen 0005, Dongqing Wang, Lin Zhang 0014 |
SiD-WaveFlow: A Low-Resource Vocoder Independent of Prior Knowledge. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jaesung Tae, Hyeongju Kim, Taesu Kim |
EdiTTS: Score-based Editing for Controllable Text-to-Speech. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Miao Liu, Jing Wang 0037, Liang Xu, Jianqian Zhang, Shicong Li, Fei Xiang |
BIT-MI Deep Learning-based Model to Non-intrusive Speech Quality Assessment Challenge in Online Conferencing Applications. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Karl El Hajal, Milos Cernak, Pablo Mainar |
MOSRA: Joint Mean Opinion Score and Room Acoustics Speech Quality Assessment. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Johannah O'Mahony, Catherine Lai, Simon King 0001 |
Combining conversational speech with read speech to improve prosody in Text-to-Speech synthesis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Kaitao Song, Teng Wan, Bixia Wang, Huiqiang Jiang, Luna Qiu, Jiahang Xu, Liping Jiang, Qun Lou, Yuqing Yang 0001, Dongsheng Li 0002, Xudong Wang, Lili Qiu |
Improving Hypernasality Estimation with Automatic Speech Recognition in Cleft Palate Speech. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zhiyun Fan, Zhenlin Liang, Linhao Dong, Yi Liu, Shiyu Zhou, Meng Cai, Jun Zhang 0066, Zejun Ma, Bo Xu |
Token-level Speaker Change Detection Using Speaker Difference and Speech Content via Continuous Integrate-and-fire. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Hiroshi Sato, Tsubasa Ochiai, Marc Delcroix, Keisuke Kinoshita, Takafumi Moriya, Naoki Makishima, Mana Ihori, Tomohiro Tanaka, Ryo Masumura |
Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Daiki Takeuchi, Yasunori Ohishi, Daisuke Niizumi, Noboru Harada, Kunio Kashino |
Introducing Auxiliary Text Query-modifier to Content-based Audio Retrieval. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Phani Sankar Nidadavolu, Na Xu, Nick Jutila, Ravi Teja Gadde, Aswarth Abhilash Dara, Joseph Savold, Sapan Patel, Aaron Hoff, Veerdhawal Pande, Kevin Crews, Ankur Gandhe, Ariya Rastrow, Roland Maas |
RefTextLAS: Reference Text Biased Listen, Attend, and Spell Model For Accurate Reading Evaluation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Hung-Shin Lee, Pin-Tuan Huang, Yao-Fei Cheng, Hsin-Min Wang |
Chain-based Discriminative Autoencoders for Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Leying Zhang, Zhengyang Chen, Yanmin Qian |
Enroll-Aware Attentive Statistics Pooling for Target Speaker Verification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jungwoo Heo, Ju-Ho Kim, Hyun-seo Shin |
Two Methods for Spoofing-Aware Speaker Verification: Multi-Layer Perceptron Score Fusion Model and Integrated Embedding Projector. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Chang Liu, Zhen-Hua Ling, Ling-Hui Chen |
Pronunciation Dictionary-Free Multilingual Speech Synthesis by Combining Unsupervised and Supervised Phonetic Representations. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Nian Shao, Erfan Loweimi, Xiaofei Li |
RCT: Random consistency training for semi-supervised sound event detection. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ying Hu, Xiujuan Zhu, Yunlong Li, Hao Huang, Liang He 0003 |
A Multi-grained based Attention Network for Semi-supervised Sound Event Detection. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jovan Eranovic, Daniel Pape, Magda Stroinska, Elisabet Service, Marijana Matkovski |
Effects of Noise on Speech Perception and Spoken Word Comprehension. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ivan Shchekotov, Pavel K. Andreev, Oleg Ivanov, Aibek Alanov, Dmitry P. Vetrov |
FFC-SE: Fast Fourier Convolution for Speech Enhancement. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ye Bai, Jie Li, Wenjing Han, Hao Ni, Kaituo Xu, Zhuo Zhang, Cheng Yi, Xiaorui Wang |
Parameter-Efficient Conformers via Sharing Sparsely-Gated Experts for End-to-End Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jinmiao Huang, Waseem Gharbieh, Qianhui Wan, Han Suk Shim, Hyun Chul Lee |
QbyE-MLPMixer: Query-by-Example Open-Vocabulary Keyword Spotting using MLPMixer. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Luke Prananta, Bence Mark Halpern, Siyuan Feng 0001, Odette Scharenborg |
The Effectiveness of Time Stretching for Enhancing Dysarthric Speech for Improved Dysarthric Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Xiaofei Wang 0009, Dongmei Wang, Naoyuki Kanda, Sefik Emre Eskimez, Takuya Yoshioka |
Leveraging Real Conversational Data for Multi-Channel Continuous Speech Separation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Michael Kuhlmann, Fritz Seebauer, Janek Ebbers, Petra Wagner, Reinhold Haeb-Umbach |
Investigation into Target Speaking Rate Adaptation for Voice Conversion. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yifan Chen, Yifan Guo, Qingxuan Li, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan 0002 |
Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yu Suzuki, Tsuneo Kato, Akihiro Tamura |
Automatic Prosody Evaluation of L2 English Read Speech in Reference to Accent Dictionary with Transformer Encoder. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ji Sub Um, Yeunju Choi, Hoi Rin Kim |
ACNN-VC: Utilizing Adaptive Convolution Neural Network for One-Shot Voice Conversion. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Chenpeng Du, Yiwei Guo, Xie Chen 0001, Kai Yu 0004 |
VQTTS: High-Fidelity Text-to-Speech Synthesis with Self-Supervised VQ Acoustic Feature. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zeyuan Wei, Li Hao, Xueliang Zhang |
Model Compression by Iterative Pruning with Knowledge Distillation and Its Application to Speech Enhancement. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yael Segal, Kasia Hitczenko, Matthew Goldrick 0001, Adam Buchwald, Angela Roberts, Joseph Keshet |
DDKtor: Automatic Diadochokinetic Speech Analysis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jigang Ren, Qirong Mao |
DCTCN: Deep Complex Temporal Convolutional Network for Long Time Speech Enhancement. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Shun Lei, Yixuan Zhou 0002, Liyang Chen, Jiankun Hu, Zhiyong Wu 0001, Shiyin Kang, Helen Meng |
Towards Multi-Scale Speaking Style Modelling with Hierarchical Context Information for Mandarin Speech Synthesis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yi Lei, Shan Yang, Jian Cong, Lei Xie 0001, Dan Su 0002 |
Glow-WaveGAN 2: High-quality Zero-shot Text-to-speech Synthesis and Any-to-any Voice Conversion. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Weiqiao Zheng, Ping Yang, Rongfeng Lai, Kongyang Zhu, Tao Zhang, Junpeng Zhang, Hongcheng Fu |
Exploring Multi-task Learning Based Gender Recognition and Age Estimation for Class-imbalanced Data. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Fan-Lin Wang, Hung-Shin Lee, Yu Tsao 0001, Hsin-Min Wang |
Disentangling the Impacts of Language and Channel Variability on Speech Separation Networks. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Bo Li 0028, Tara N. Sainath, Ruoming Pang, Shuo-Yiin Chang, Qiumin Xu, Trevor Strohman, Vince Chen, Qiao Liang 0001, Heguang Liu, Yanzhang He, Parisa Haghani, Sameer Bidichandani |
A Language Agnostic Multilingual Streaming On-Device ASR System. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yicheng Du, Aditya Arie Nugraha, Kouhei Sekiguchi, Yoshiaki Bando, Mathieu Fontaine 0002, Kazuyoshi Yoshii |
Direction-Aware Joint Adaptation of Neural Speech Enhancement and Recognition in Real Multiparty Conversational Environments. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Long Mai, Julie Carson-Berndsen |
Unsupervised domain adaptation for speech recognition with unsupervised error correction. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ryandhimas Edo Zezario, Szu-Wei Fu, Fei Chen 0011, Chiou-Shann Fuh, Hsin-Min Wang, Yu Tsao 0001 |
MTI-Net: A Multi-Target Speech Intelligibility Prediction Model. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ye-Qian Du, Jie Zhang 0042, Qiu-Shi Zhu, Lirong Dai 0001, Ming-Hui Wu, Xin Fang, Zhou-Wang Yang |
A Complementary Joint Training Approach Using Unpaired Speech and Text A Complementary Joint Training Approach Using Unpaired Speech and Text. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Mathilde Hutin, Martine Adda-Decker, Lori Lamel, Ioana Vasilescu |
When Phonetics Meets Morphology: Intervocalic Voicing Within and Across Words in Romance Languages. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Binbin Zhang, Di Wu 0061, Zhendong Peng, Xingchen Song, Zhuoyuan Yao, Hang Lv 0001, Lei Xie 0001, Chao Yang 0031, Fuping Pan, Jianwei Niu 0002 |
WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jean-Marie Lemercier, Joachim Thiemann, Raphael Koning, Timo Gerkmann |
Neural Network-augmented Kalman Filtering for Robust Online Speech Dereverberation in Noisy Reverberant Environments. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Hang-Rui Hu, Yan Song 0001, Li-Rong Dai 0001, Ian McLoughlin 0001, Lin Liu 0017 |
Class-Aware Distribution Alignment based Unsupervised Domain Adaptation for Speaker Verification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Navin Raj Prabhu, Guillaume Carbajal, Nale Lehmann-Willenbrock, Timo Gerkmann |
End-To-End Label Uncertainty Modeling for Speech-based Arousal Recognition Using Bayesian Neural Networks. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Rongmei Lin, Yonghui Xiao, Tien-Ju Yang, Ding Zhao, Li Xiong 0001, Giovanni Motta, Françoise Beaufays |
Federated Pruning: Improving Neural Network Efficiency with Federated Learning. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Takuya Kunihara, Chuanbo Zhu 0001, Daisuke Saito, Nobuaki Minematsu, Noriko Nakanishi |
Detection of Learners' Listening Breakdown with Oral Dictation and Its Use to Model Listening Skill Improvement Exclusively Through Shadowing. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Tanvina Patel, Odette Scharenborg |
Using cross-model learnings for the Gram Vaani ASR Challenge 2022. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Verdiana De Fino, Lionel Fontan, Julien Pinquier, Isabelle Ferrané, Sylvain Detey |
Prediction of L2 speech proficiency based on multi-level linguistic features. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Sreyan Ghosh, Samden Lepcha, S. Sakshi, Rajiv Ratn Shah, Srinivasan Umesh |
DeToxy: A Large-Scale Multimodal Dataset for Toxicity Classification in Spoken Utterances. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Md. Iftekhar Tanveer, Diego Casabuena, Jussi Karlgren, Rosie Jones |
Unsupervised Speaker Diarization that is Agnostic to Language, Overlap-Aware, and Tuning Free. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Joel Rixen, Matthias Renz |
QDPN - Quasi-dual-path Network for single-channel Speech Separation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ashutosh Chaubey, Sparsh Sinha, Susmita Ghose |
Improved Relation Networks for End-to-End Speaker Verification and Identification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yi-Kai Zhang, Da-Wei Zhou 0001, Han-Jia Ye, De-Chuan Zhan |
Audio-Visual Generalized Few-Shot Learning with Prototype-Based Co-Adaptation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Kamil Deja, Ariadna Sánchez, Julian Roth, Marius Cotescu |
Automatic Evaluation of Speaker Similarity. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Sangjun Park, Kihyun Choo, Joohyung Lee, Anton V. Porov, Konstantin Osipov, June Sig Sung |
Bunched LPCNet2: Efficient Neural Vocoders Covering Devices from Cloud to Edge. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Takayuki Arai, Miho Yamada, Megumi Okusawa |
Syllable sequence of /a/+/ta/ can be heard as /atta/ in Japanese with visual or tactile cues. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Diego Aguirre, Nigel G. Ward, Jonathan E. Avila, Heike Lehnert-LeHouillier |
Comparison of Models for Detecting Off-Putting Speaking Styles. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ondrej Klejch, Electra Wallington, Peter Bell 0001 |
Deciphering Speech: a Zero-Resource Approach to Cross-Lingual Transfer in ASR. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Hirokazu Kameoka, Takuhiro Kaneko, Shogo Seki, Kou Tanaka |
CAUSE: Crossmodal Action Unit Sequence Estimation from Speech. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Théo Mariotte, Anthony Larcher, Silvio Montrésor, Jean-Hugh Thomas |
Microphone Array Channel Combination Algorithms for Overlapped Speech Detection. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Hangting Chen, Yi Yang 0057, Feng Dang, Pengyuan Zhang |
Beam-Guided TasNet: An Iterative Speech Separation Framework with Multi-Channel Output. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Catarina Botelho, Tanja Schultz, Alberto Abad, Isabel Trancoso |
Challenges of using longitudinal and cross-domain corpora on studies of pathological speech. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jiahong Huang, Wen Xu, Yule Li, Junshi Liu, Dongpeng Ma, Wei Xiang |
FlowCPCVC: A Contrastive Predictive Coding Supervised Flow Framework for Any-to-Any Voice Conversion. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ivan Vovk, Tasnima Sadekova, Vladimir Gogoryan, Vadim Popov, Mikhail A. Kudinov, Jiansheng Wei |
Fast Grad-TTS: Towards Efficient Diffusion-Based Speech Generation on CPU. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Parvaneh Janbakhshi, Ina Kodrasi |
Adversarial-Free Speaker Identity-Invariant Representation Learning for Automatic Dysarthric Speech Classification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yukun Peng, Zhenhua Ling |
Decoupled Pronunciation and Prosody Modeling in Meta-Learning-based Multilingual Speech Synthesis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zhehuai Chen, Yu Zhang 0033, Andrew Rosenberg, Bhuvana Ramabhadran, Pedro J. Moreno 0001, Ankur Bapna, Heiga Zen |
MAESTRO: Matched Speech Text Representations through Modality Matching. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Shuo Ren, Shujie Liu 0001, Yu Wu 0012, Long Zhou, Furu Wei |
Speech Pre-training with Acoustic Piece. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Joon-Hyuk Chang, Won-Gook Choi |
Convolutional Recurrent Neural Network with Auxiliary Stream for Robust Variable-Length Acoustic Scene Classification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yifan Sun, Qinlong Huang, Xihong Wu |
Unsupervised Inference of Physiologically Meaningful Articulatory Trajectories with VocalTractLab. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Seunghan Yang, Debasmit Das, Janghoon Cho, Hyoungwoo Park, Sungrack Yun |
Domain Agnostic Few-shot Learning for Speaker Verification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Vijay Ravi, Jinhan Wang, Jonathan Flint, Abeer Alwan |
A Step Towards Preserving Speakers' Identity While Detecting Depression Via Speaker Disentanglement. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Chun-Yu Chen, Yun-Shao Lin, Chi-Chun Lee |
Emotion-Shift Aware CRF for Decoding Emotion Sequence in Conversation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Salvatore Fara, Stefano Goria, Emilia Molimpakis, Nicholas Cummins |
Speech and the n-Back task as a lens into depression. How combining both may allow us to isolate different core symptoms of depression. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Wenjing Liu, Chuan Xie |
MOS Prediction Network for Non-intrusive Speech Quality Assessment in Online Conferencing. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Joanna Hong, Minsu Kim, Daehun Yoo, Yong Man Ro |
Visual Context-driven Audio Feature Enhancement for Robust End-to-End Audio-Visual Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yuntao Li, Hanchu Zhang, Yutian Li, Sirui Wang, Wei Wu 0014, Yan Zhang |
Pay More Attention to History: A Context Modeling Strategy for Conversational Text-to-SQL. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Han Zhu, Jindong Wang 0001, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan 0002 |
Decoupled Federated Learning for ASR with Non-IID Data. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Feifei Xiong, Weiguang Chen, Pengyu Wang, Xiaofei Li, Jinwei Feng |
Spectro-Temporal SubNet for Real-Time Monaural Speech Denoising and Dereverberation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Nicolás Schmidt, Jordi Pons, Marius Miron |
PodcastMix: A dataset for separating music and speech in podcasts. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Wen-Chin Huang, Dejan Markovic, Alexander Richard, Israel Dejene Gebru, Anjali Menon |
End-to-End Binaural Speech Synthesis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Eesung Kim, Jae-Jin Jeon, Hyeji Seo, Hoon Kim |
Automatic Pronunciation Assessment using Self-Supervised Speech Representation Learning. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ho-Hsiang Wu, Magdalena Fuentes, Prem Seetharaman, Juan Pablo Bello |
How to Listen? Rethinking Visual Sound Localization. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Christoph Böddeker, Tobias Cord-Landwehr, Thilo von Neumann, Reinhold Haeb-Umbach |
An Initialization Scheme for Meeting Separation with Spatial Mixture Models. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Rahil Parikh, Nadee Seneviratne, Ganesh Sivaraman, Shihab A. Shamma, Carol Y. Espy-Wilson |
Acoustic To Articulatory Speech Inversion Using Multi-Resolution Spectro-Temporal Representations Of Speech Signals. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Bogdan Ludusan, Marin Schröer, Petra Wagner |
Investigating phonetic convergence of laughter in conversation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Haohe Liu, Woosung Choi, Xubo Liu, Qiuqiang Kong, Qiao Tian, DeLiang Wang |
Neural Vocoder is All You Need for Speech Super-resolution. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | David Feinberg |
VoiceLab: Software for Fully Reproducible Automated Voice Analysis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
Displaying result #501 - #600 of 18782 (100 per page; Change: ) Pages: [ <<][ 1][ 2][ 3][ 4][ 5][ 6][ 7][ 8][ 9][ 10][ 11][ 12][ 13][ 14][ 15][ >>] |
|