|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
The graphs summarize 1342 occurrences of 771 keywords
|
|
|
Results
Found 13615 publication records. Showing 13615 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
15 | Raymond Chung, Brian Mak |
Synthesizing Near Native-accented Speech for a Non-native Speaker by Imitating the Pronunciation and Prosody of a Native Speaker. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Zhiyun Fan, Zhenlin Liang, Linhao Dong, Yi Liu, Shiyu Zhou, Meng Cai, Jun Zhang 0066, Zejun Ma, Bo Xu |
Token-level Speaker Change Detection Using Speaker Difference and Speech Content via Continuous Integrate-and-fire. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Kenta Udagawa, Yuki Saito, Hiroshi Saruwatari |
Human-in-the-loop Speaker Adaptation for DNN-based Multi-speaker TTS. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Weiqing Wang, Ming Li 0026, Qingjian Lin |
Online Target Speaker Voice Activity Detection for Speaker Diarization. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Zifeng Zhao, Rongzhi Gu, Dongchao Yang, Jinchuan Tian, Yuexian Zou |
Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Jaeuk Lee, Joon-Hyuk Chang |
Advanced Speaker Embedding with Predictive Variance of Gaussian Distribution for Speaker Adaptation in TTS. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Xiaoyi Qin, Na Li 0012, Chao Weng, Dan Su 0002, Ming Li 0026 |
Cross-Age Speaker Verification: Learning Age-Invariant Speaker Embeddings. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Srikanth Raj Chetupalli, Sriram Ganapathy |
Speaker conditioned acoustic modeling for multi-speaker conversational ASR. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Dengfeng Ke, Liangjie Huang, Wenhan Yao, Ruixin Hu, Xueyin Zu, Yanlu Xie, Jinsong Zhang 0001 |
Voicifier-LN: An Novel Approach to Elevate the Speaker Similarity for General Zero-shot Multi-Speaker TTS. |
AIPR |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Lu Yi, Man-Wai Mak |
Disentangled Speaker Embedding for Robust Speaker Verification. |
ICASSP |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Chunlei Zhang, Jiatong Shi, Chao Weng, Meng Yu 0003, Dong Yu 0001 |
Towards end-to-end Speaker Diarization with Generalized Neural Speaker Clustering. |
ICASSP |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Shutong Niu, Jun Du, Lei Sun 0010, Chin-Hui Lee 0001 |
Improving Separation-Based Speaker Diarization Via Iterative Model Refinement And Speaker Embedding Based Post-Processing. |
ICASSP |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Wei Xia, Han Lu, Quan Wang, Anshuman Tripathi, Yiling Huang, Ignacio López-Moreno, Hasim Sak |
Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection. |
ICASSP |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Youngki Kwon, Hee-Soo Heo, Jee-Weon Jung, You Jin Kim, Bong-Jin Lee, Joon Son Chung |
Multi-Scale Speaker Embedding-Based Graph Attention Networks For Speaker Diarisation. |
ICASSP |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Naoyuki Kanda, Xiong Xiao, Yashesh Gaur, Xiaofei Wang 0009, Zhong Meng, Zhuo Chen 0006, Takuya Yoshioka |
Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers Using End-to-End Speaker-Attributed ASR. |
ICASSP |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Botao Zhao 0001, Xulong Zhang 0001, Jianzong Wang, Ning Cheng 0001, Jing Xiao 0006 |
nnSpeech: Speaker-Guided Conditional Variational Autoencoder for Zero-Shot Multi-speaker text-to-speech. |
ICASSP |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Disong Wang, Songxiang Liu, Xixin Wu, Hui Lu, Lifa Sun, Xunying Liu, Helen Meng |
Speaker Identity Preservation in Dysarthric Speech Reconstruction by Adversarial Speaker Adaptation. |
ICASSP |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Manaka Takamizawa, Satoru Tsuge, Yasuo Horiuchi, Shingo Kuroiwa |
Same Speaker Identification with Deep Learning and Application to Text-Dependent Speaker Verification. |
KES-HCIS |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Abudukelimu Wuerkaixi, Kunda Yan, You Zhang 0001, Zhiyao Duan, Changshui Zhang |
DyViSE: Dynamic Vision-Guided Speaker Embedding for Audio-Visual Speaker Diarization. |
MMSP |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Zhiqing Chen, Yifan Pan, Haoran Zhang, Yuesheng Zhu |
Wav2sv: End-to-end Speaker Embeddings Learning from Raw Waveforms based on Metric Learning for Speaker Verification. |
ITCC |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Juan Zuluaga-Gomez, Seyyed Saeed Sarfjoo, Amrutha Prasad, Iuliia Nigmatulina, Petr Motlícek, Karel Ondrej, Oliver Ohneiser, Hartmut Helmke |
Bertraffic: Bert-Based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Alessandro Arezzo, Stefano Berretti |
SPEAKER VGG CCT: Cross-Corpus Speech Emotion Recognition with Speaker Embedding and Vision Transformers. |
MMAsia |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Ragini Sinha, Marvin Tammen, Christian Rollwage, Simon Doclo |
Speaker-Conditioning Single-Channel Target Speaker Extraction using Conformer-based Architectures. |
IWAENC |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Roman Shrestha, Cornelius Glackin, Julie A. Wall, Nigel Cannings, Marvin Rajwadi, Satya Kada, James Laird, Thea Laird, Chris Woodruff |
Speaker Recognition using Multiple X-Vector Speaker Representations with Two-Stage Clustering and Outlier Detection Refinement. |
DASC/PiCom/CBDCom/CyberSciTech |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Rémi Uro, David Doukhan, Albert Rilliard, Laetitia Larcher, Anissa-Claire Adgharouamane, Marie Tahon, Antoine Laurent |
A Semi-Automatic Approach to Create Large Gender- and Age-Balanced Speaker Corpora: Usefulness of Speaker Diarization & Identification. |
LREC |
2022 |
DBLP BibTeX RDF |
|
15 | Ahmad Aloradi, Wolfgang Mack, Mohamed Elminshawi, Emanuël A. P. Habets |
Speaker Verification in Multi-Speaker Environments Using Temporal Feature Fusion. |
EUSIPCO |
2022 |
DBLP BibTeX RDF |
|
15 | Seyed Omid Sadjadi, Craig S. Greenberg, Elliot Singer, Lisa P. Mason, Douglas A. Reynolds |
The 2021 NIST Speaker Recognition Evaluation. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Junyi Peng, Chunlei Zhang, Jan Honza Cernocký, Dong Yu 0001 |
Progressive Contrastive Learning for Self-Supervised Text-Independent Speaker Verification. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Yosef Solewicz, Noa Cohen, Johan Rohdin, Srikanth R. Madikeri, Jan Honza Cercnocký |
Speaker Recognition on Mono-Channel Telephony Recordings. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Joonas Kalda, Tanel Alumäe |
Collar-Aware Training for Streaming Speaker Change Detection in Broadcast Speech. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Jintao Kang, Aijun Li, Jingyang Li |
Formant Dynamics of Chinese Compound Vowels with Implications for Forensic Speaker Identification. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Yijun Gong, Xiao-Lei Zhang |
DP-Means: An Efficient Bayesian Nonparametric Model for Speaker Diarization. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Woo Hyun Kang, Jahangir Alam, Abderrahim Fathan |
Domain Generalized Speaker Embedding Learning via Mutual Information Minimization. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Jahangir Alam, Woo Hyun Kang, Abderrahim Fathan |
Hybrid Neural Network-Based Deep Embedding Extractors for Text-Independent Speaker Verification. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Lantian Li, Di Wang, Wenqiang Du, Dong Wang 0013 |
C-P Map: A Novel Evaluation Toolkit for Speaker Verification. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Alexey Sholokhov, Xuechen Liu, Md. Sahidullah, Tomi Kinnunen |
Baselines and Protocols for Household Speaker Recognition. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | YingWei Tan, XueFeng Ding 0001 |
The Volkswagen-Mobvoi System for CN-Celeb Speaker Recognition Challenge 2022. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Chenguang Hu, Qingran Zhan, Miao Liu, Xiang Xie |
BIT Submission for the Conversational Speaker Diarization Challenge. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Fuchuan Tong, Siqi Zheng, Haodong Zhou, Xingjia Xie, Qingyang Hong, Lin Li 0032 |
Deep Representation Decomposition for Rate-Invariant Speaker Verification. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Sarala Padi, Seyed Omid Sadjadi, Dinesh Manocha, Ram D. Sriram |
Multimodal Emotion Recognition Using Transfer Learning from Speaker Recognition and BERT-Based Models. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Galina Lavrentyeva, Sergey Novoselov, Vladimir Volokhov, Anastasia Avdeeva, Aleksei Gusev, Alisa Vinogradova, Igor Korsunov, Alexander Kozlov, Timur Pekhovsky, Andrey Shulipa, Evgeny Smirnov, Vasiliy Galyuk |
STC Speaker Recognition System for the NIST SRE 2021. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Hemlata Tak, Massimiliano Todisco, Xin Wang 0037, Jee-weon Jung, Junichi Yamagishi, Nicholas W. D. Evans |
Automatic Speaker Verification Spoofing and Deepfake Detection Using Wav2vec 2.0 and Data Augmentation. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Zuoer Chen, Liang He |
A Quick and Effective Speaker Diarization System. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Jason Pelecanos, Quan Wang, Yiling Huang, Ignacio López-Moreno |
Parameter-Free Attentive Scoring for Speaker Verification. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | You Zhang 0001, Ge Zhu, Zhiyao Duan |
A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Yanxiong Li, Wucheng Wang, Hao Chen, Wenchang Cao, Wei Li, Qianhua He |
Few-Shot Speaker Identification Using Depthwise Separable Convolutional Network with Channel Attention. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Diego Castán, Md. Hafizur Rahman, Sarah Bakst, Chris Cobo-Kroenke, Mitchell McLaren, Martin Graciarena, Aaron Lawson |
Speaker-Targeted Synthetic Speech Detection. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Xuechen Liu, Md. Sahidullah, Tomi Kinnunen |
Spoofing-Aware Speaker Verification with Unsupervised Domain Adaptation. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Seyed Omid Sadjadi, Craig S. Greenberg, Elliot Singer, Lisa P. Mason, Douglas A. Reynolds |
The NIST CTS Speaker Recognition Challenge. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Xiaoxiao Miao, Xin Wang 0037, Erica Cooper, Junichi Yamagishi, Natalia A. Tomashenko |
Language-Independent Speaker Anonymization Approach Using Self-Supervised Pre-Trained Models. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Anna Silnova, Themos Stafylakis, Ladislav Mosner, Oldrich Plchot, Johan Rohdin, Pavel Matejka, Lukás Burget, Ondrej Glembek, Niko Brummer |
Analyzing Speaker Verification Embedding Extractors and Back-Ends Under Language and Channel Mismatch. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Haibin Wu, Jiawen Kang, Lingwei Meng, Yang Zhang 0025, Xixin Wu, Zhiyong Wu 0001, Hung-yi Lee, Helen Meng |
Tackling Spoofing-Aware Speaker Verification with Multi-Model Fusion. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Sandip Ghimire, Tomi Kinnunen, Rosa González Hautamäki |
Gamified Speaker Comparison by Listening. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Jialin Zhang, Qinghua Ren, You-cai Qin, Zi-Kai Wan, Qirong Mao |
Cross-Scene Speaker Verification Based on Dynamic Convolution for the CNSRC 2022 Challenge. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Yucong Zhang, Qingjian Lin, Weiqing Wang, Lin Yang, Xuyang Wang, Junjie Wang, Ming Li 0026 |
Low-Latency Online Speaker Diarization with Graph-Based Label Generation. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Jahangir Alam, Radek Benes, Marian Beszédes, Lukás Burget, Mohamed Dahmane, Abderrahim Fathan, Hamed Ghodrati, Ondrej Glembek, Woo Hyun Kang, Pavel Matejka, Ladislav Mosner, Oldrich Plchot, Johan Rohdin, Anna Silnova, Themos Stafylakis |
Development of ABC Systems for the 2021 Edition of NIST Speaker Recognition Evaluation. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Jesús Villalba 0001, Bengt J. Borgstrom, Saurabh Kataria, Magdalena Rybicka, Carlos D. Castillo, Jaejin Cho, L. Paola García-Perera, Pedro A. Torres-Carrasquillo, Najim Dehak |
Advances in Cross-Lingual and Cross-Source Audio-Visual Speaker Recognition: The JHU-MIT System for NIST SRE21. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Sarah Bakst, Chris Cobo-Kroenke, Aaron Lawson, Mitchell McLaren, Allen R. Stauffer |
Time-Varying Score Reliability Prediction in Speaker Identification. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Jesús Villalba 0001, Bengt J. Borgstrom, Saurabh Kataria, Jaejin Cho, Pedro A. Torres-Carrasquillo, Najim Dehak |
Advances in Speaker Recognition for Multilingual Conversational Telephone Speech: The JHU-MIT System for NIST SRE20 CTS Challenge. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Nikita Kuzmin, Igor Fedorov, Alexey Sholokhov |
Magnitude-Aware Probabilistic Speaker Embeddings. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Xinmei Su, Qingran Zhan, Chenguang Hu, Xiang Xie |
Combination of Multiple Embeddings for Speaker Retrieval. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Hye-jin Shim, Hemlata Tak, Xuechen Liu, Hee-Soo Heo, Jee-weon Jung, Joon Son Chung, Soo-Whan Chung, Ha-Jin Yu, Bong-Jin Lee, Massimiliano Todisco, Héctor Delgado, Kong Aik Lee, Md. Sahidullah, Tomi Kinnunen, Nicholas W. D. Evans |
Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Sandro Cumani, Salvatore Sarni |
Impostor Score Statistics as Quality Measures for the Calibration of Speaker Verification Systems. |
Odyssey |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Sung Hwan Mun, Min Hyun Han, Dongjune Lee, Jihwan Kim, Nam Soo Kim |
Bootstrap Equilibrium and Probabilistic Speaker Representation Learning for Self-Supervised Speaker Verification. |
IEEE Access |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Fahimeh Bahmaninezhad, Chunlei Zhang, John H. L. Hansen |
An investigation of domain adaptation in speaker embedding space for speaker recognition. |
Speech Commun. |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Kentaro Mitsui, Tomoki Koriyama, Hiroshi Saruwatari |
Deep Gaussian process based multi-speaker speech synthesis with latent speaker representation. |
Speech Commun. |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Mohammad Azharuddin Laskar, Chuya China Bhanja, Rabul Hussain Laskar |
Speaker-Phrase-Specific Adaptation of PLDA Model for Improved Performance in Text-Dependent Speaker Verification. |
Circuits Syst. Signal Process. |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Li Zhang, Huan Zhao, Qinling Meng, Yanli Chen, Min Liu, Lei Xie |
Beijing ZKJ-NPU Speaker Verification System for VoxCeleb Speaker Recognition Challenge 2021. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Wei Xia, Han Lu, Quan Wang, Anshuman Tripathi, Ignacio López-Moreno, Hasim Sak |
Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Danwei Cai, Ming Li 0026 |
The DKU-DukeECE System for the Self-Supervision Speaker Verification Task of the 2021 VoxCeleb Speaker Recognition Challenge. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Chiang-Jen Peng, Yun-Ju Chan, Cheng Yu, Syu-Siang Wang, Yu Tsao 0001, Tai-Shih Chi |
Attention-based multi-task learning for speech-enhancement and speaker-identification in multi-speaker dialogue scenario. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Naoyuki Kanda, Xiong Xiao, Yashesh Gaur, Xiaofei Wang 0009, Zhong Meng, Zhuo Chen 0006, Takuya Yoshioka |
Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Zhuo Li, Ce Fang, Runqiu Xiao, Zhigao Chen, Wenchao Wang, Yonghong Yan 0002 |
The HCCL Speaker Verification System for Far-Field Speaker Verification Challenge. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Shangeth Rajaa, Van Tung Pham, Chng Eng Siong |
Learning Speaker Representation with Semi-supervised Learning approach for Speaker Profiling. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Keke Wang, Xudong Mao, Hao Wu, Chen Ding, Chuxiang Shang, Rui Xia, Yuxuan Wang |
The ByteDance Speaker Diarization System for the VoxCeleb Speaker Recognition Challenge 2021. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Pengfei Wu, Junjie Pan, Chenchang Xu, Junhui Zhang, Lin Wu, Xiang Yin 0006, Zejun Ma |
Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Guangzhi Sun, D. Liu, Chao Zhang 0031, Philip C. Woodland |
Content-Aware Speaker Embeddings for Speaker Diarisation. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Qicong Xie, Tao Li, Xinsheng Wang, Zhichao Wang, Lei Xie 0001, Guoqiao Yu, Guanglu Wan |
Multi-speaker Multi-style Text-to-speech Synthesis With Single-speaker Single-style Training Data Scenarios. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Pengcheng Guo, Xuankai Chang, Shinji Watanabe 0001, Lei Xie 0001 |
Multi-Speaker ASR Combining Non-Autoregressive Conformer CTC and Conditional Speaker Chain. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Zhenning Tan, Yuguang Yang 0004, Eunjung Han, Andreas Stolcke |
Improving Speaker Identification for Shared Devices by Adapting Embeddings to Speaker Subsets. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Beáta Lorincz, Adriana Stan, Mircea Giurgiu |
An objective evaluation of the effects of recording conditions and speaker characteristics in multi-speaker deep neural speech synthesis. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Chung-Ming Chien, Jheng-Hao Lin, Chien-yu Huang, Po-Chun Hsu, Hung-yi Lee |
Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-Speech. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Sung Hwan Mun, Min Hyun Han, Dongjune Lee, Jihwan Kim, Nam Soo Kim |
Bootstrap Equilibrium and Probabilistic Speaker Representation Learning for Self-supervised Speaker Verification. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Midia Yousefi, John H. L. Hanse |
Speaker conditioning of acoustic models using affine transformation for multi-speaker speech recognition. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Youngki Kwon, Hee-Soo Heo, Jee-weon Jung, You Jin Kim, Bong-Jin Lee, Joon Son Chung |
Multi-scale speaker embedding-based graph attention networks for speaker diarisation. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Youngki Kwon, Jee-weon Jung, Hee-Soo Heo, You Jin Kim, Bong-Jin Lee, Joon Son Chung |
Adapting Speaker Embeddings for Speaker Diarisation. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
15 | Hongwei Luo, Yijie Shen, Feng Lin 0004, Guoai Xu |
Spoofing Speaker Verification System by Adversarial Examples Leveraging the Generalized Speaker Difference. |
Secur. Commun. Networks |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Leda Sari, Mark Hasegawa-Johnson, Samuel Thomas 0001 |
Auxiliary Networks for Joint Speaker Adaptation and Speaker Change Detection. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Peidong Wang, Zhuo Chen 0006, DeLiang Wang, Jinyu Li 0001, Yifan Gong 0001 |
Speaker Separation Using Speaker Inventories and Estimated Speech. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Yuki Saito, Shinnosuke Takamichi, Hiroshi Saruwatari |
Perceptual-Similarity-Aware Deep Speaker Representation Learning for Multi-Speaker Generative Modeling. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Chengfang Luo, Xin Guo, Aiwen Deng, Wei Xu, Junhong Zhao, Wenxiong Kang |
Learning Discriminative Speaker Embedding by Improving Aggregation Strategy and Loss Function for Speaker Verification. |
IJCB |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Lingjun Zhao, Man-Wai Mak |
Channel Interdependence Enhanced Speaker Embeddings for Far-Field Speaker Verification. |
ISCSLP |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Youngki Kwon, Jee-weon Jung, Hee-Soo Heo, You Jin Kim, Bong-Jin Lee, Joon Son Chung |
Adapting Speaker Embeddings for Speaker Diarisation. |
Interspeech |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Saurabh Kataria, Jesús Villalba 0001, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak |
Deep Feature CycleGANs: Speaker Identity Preserving Non-Parallel Microphone-Telephone Domain Adaptation for Speaker Verification. |
Interspeech |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Maokui He, Desh Raj, Zili Huang, Jun Du, Zhuo Chen 0006, Shinji Watanabe 0001 |
Target-Speaker Voice Activity Detection with Improved i-Vector Estimation for Unknown Number of Speaker. |
Interspeech |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Xiaoyi Qin, Chao Wang, Yong Ma, Min Liu, Shilei Zhang, Ming Li 0026 |
Our Learned Lessons from Cross-Lingual Speaker Verification: The CRMI-DKU System Description for the Short-Duration Speaker Verification Challenge 2021. |
Interspeech |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Kenichi Fujita, Atsushi Ando, Yusuke Ijima |
Phoneme Duration Modeling Using Speech Rhythm-Based Speaker Embeddings for Multi-Speaker Speech Synthesis. |
Interspeech |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Wupeng Wang, Chenglin Xu, Meng Ge, Haizhou Li 0001 |
Neural Speaker Extraction with Speaker-Speech Cross-Attention Network. |
Interspeech |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Chau Luu, Peter Bell 0001, Steve Renals |
Leveraging Speaker Attribute Information Using Multi Task Learning for Speaker Verification and Diarization. |
Interspeech |
2021 |
DBLP DOI BibTeX RDF |
|
15 | Pengcheng Guo, Xuankai Chang, Shinji Watanabe 0001, Lei Xie 0001 |
Multi-Speaker ASR Combining Non-Autoregressive Conformer CTC and Conditional Speaker Chain. |
Interspeech |
2021 |
DBLP DOI BibTeX RDF |
|
|
|