|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
The graphs summarize 1342 occurrences of 771 keywords
|
|
|
Results
Found 13615 publication records. Showing 13615 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
20 | Shigeo Morishima, Satoshi Nakamura 0001 |
Multi-Modal Translation System and Its Evaluation. |
ICMI |
2002 |
DBLP DOI BibTeX RDF |
|
20 | Hung-Ju Huang, Chun-Nan Hsu |
Recognizing 100 Speakers Using Homologous Naive Bayes. |
PRICAI |
2002 |
DBLP DOI BibTeX RDF |
|
20 | Harriet J. Nock, Giridharan Iyengar, Chalapathy Neti |
Assessing face and speech consistency for monologue detection in video. |
ACM Multimedia |
2002 |
DBLP DOI BibTeX RDF |
|
20 | Kimmo Koskenniemi |
Is natural language an inconvenience or an opportunity for IR?. |
SIGIR |
2002 |
DBLP DOI BibTeX RDF |
|
20 | Roland Hausser |
Spatio-temporal Indexing in Database Semantics. |
CICLing |
2001 |
DBLP DOI BibTeX RDF |
|
20 | Shin Ogata, Kazumasa Murai, Satoshi Nakamura 0001, Shigeo Morishima |
Model-Based Lip Synchronization With Automatically Translated Systhetic Voice Toward A Multi-Modal Translation System. |
ICME |
2001 |
DBLP DOI BibTeX RDF |
|
20 | Shigeo Morishima, Shin Ogata, Satoshi Nakamura 0001 |
Trends of Learning Technology Standard. |
ICME |
2001 |
DBLP DOI BibTeX RDF |
|
20 | Jingyuan Zhang, Min Zhang |
Software Solution to Completely Wireless Presentation. |
ICPP Workshops |
2001 |
DBLP DOI BibTeX RDF |
|
20 | Liwei He, Elizabeth Sanocki, Anoop Gupta, Jonathan Grudin |
Comparing presentation summaries: slides vs. reading vs. listening. |
CHI |
2000 |
DBLP DOI BibTeX RDF |
multimedia, video summarization, video browsing, digital video library, video skim, video abstraction |
20 | Lorien Y. Pratt, Kathleen D. Cebulka, Peter Clitherow |
Residual Speech Signal Compression: An Experiment in the Practical Application of Neural Network Technology. |
IEA/AIE (Vol. 2) |
1990 |
DBLP DOI BibTeX RDF |
|
15 | Kenichi Fujita, Atsushi Ando, Yusuke Ijima |
Speech Rhythm-Based Speaker Embeddings Extraction from Phonemes and Phoneme Duration for Multi-Speaker Speech Synthesis. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Yejin Jeon, Yunsu Kim 0001, Gary Geunbae Lee |
Enhancing Zero-Shot Multi-Speaker TTS with Negated Speaker Representations. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Can Cui, Imran A. Sheikh, Mostafa Sadeghi, Emmanuel Vincent 0001 |
Improving Speaker Assignment in Speaker-Attributed ASR for Real Meeting Applications. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Kenichi Fujita, Atsushi Ando, Yusuke Ijima |
Speech Rhythm-Based Speaker Embeddings Extraction from Phonemes and Phoneme Duration for Multi-Speaker Speech Synthesis. |
IEICE Trans. Inf. Syst. |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Maros Jakubec, Roman Jarina, Eva Lieskovska, Peter Kasak |
Deep speaker embeddings for Speaker Verification: Review and experimental comparison. |
Eng. Appl. Artif. Intell. |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Yejin Jeon, Yunsu Kim 0001, Gary Geunbae Lee |
Enhancing Zero-Shot Multi-Speaker TTS with Negated Speaker Representations. |
AAAI |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Illia Zaiets, Vitalii Brydinskyi, Dmytro Sabodashko, Yuriy Khoma, Khrystyna Ruda |
Integrated System for Speaker Diarization and Intruder Detection using Speaker Embeddings. |
CPITS |
2024 |
DBLP BibTeX RDF |
|
15 | Nirupam Shome, Banala Saritha, Richik Kashyap, Rabul Hussain Laskar |
A robust DNN model for text-independent speaker identification using non-speaker embeddings in diverse data conditions. |
Neural Comput. Appl. |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Pengbin Fu, Yuchen Ma, Huirong Yang |
Speaker diarization with variants of self-attention and joint speaker embedding extractor. |
J. Intell. Fuzzy Syst. |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Hyungchan Yoon, Changhwan Kim, Seyun Um, Hyun-Wook Yoon, Hong-Goo Kang |
SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems. |
IEEE Signal Process. Lett. |
2023 |
DBLP DOI BibTeX RDF |
|
15 | P. S. Subhashini Pedalanka, Manchikalapudi Satya Sai Ram, Duggirala Sreenivasa Rao |
Cross B-HUB Based RNN with Random Aural-Feature Extraction for Enhanced Speaker Extraction and Speaker Recognition. |
Wirel. Pers. Commun. |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Zezhong Jin, Youzhi Tu, Man-Wai Mak |
Phonetic-aware speaker embedding for far-field speaker verification. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Ashutosh Chaubey, Sparsh Sinha, Susmita Ghose |
Speaker-specific Thresholding for Robust Imposter Identification in Unseen Speaker Recognition. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Kai Liu, Xucheng Wan, Ziqing Du, Huan Zhou 0008 |
Improving Target Speaker Extraction with Sparse LDA-transformed Speaker Embeddings. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Kai Liu, Ziqing Du, Xucheng Wan, Huan Zhou 0008 |
X-SepFormer: End-to-end Speaker Extraction Network with Explicit Optimization on Speaker Confusion. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Jun Chen 0024, Wei Rao, Zilin Wang, Jiuxin Lin, Yukai Ju, Shulin He, Yannan Wang, Zhiyong Wu 0001 |
MC-SpEx: Towards Effective Speaker Extraction with Multi-Scale Interfusion and Conditional Speaker Modulation. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Can Cui, Sheikh Imran Ahamad, Mostafa Sadeghi, Emmanuel Vincent 0001 |
End-to-end Multichannel Speaker-Attributed ASR: Speaker Guided Decoder and Input Feature Analysis. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Ze Li, Yuke Lin, Xiaoyi Qin, Ning Jiang, Guoqing Zhao, Ming Li |
The DKU-MSXF Speaker Verification System for the VoxCeleb Speaker Recognition Challenge 2023. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Guangke Chen, Yedi Zhang, Fu Song |
SLMIA-SR: Speaker-Level Membership Inference Attacks against Speaker Recognition Systems. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Duc-Tuan Truong, Ruijie Tao, Jia Qi Yip, Kong Aik Lee, Eng Siong Chng |
Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Automatic Speaker Verification. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Rohit Paturi, Sundararajan Srinivasan, Xiang Li |
Lexical Speaker Error Correction: Leveraging Language Models for Speaker Diarization Error Correction. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Gaobin Yang, Maokui He, Shutong Niu, Ruoyu Wang 0029, Yanyan Yue, Shuangqing Qian, Shilong Wu, Jun Du, Chin-Hui Lee 0001 |
Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding with Sequence-to-Sequence Architecture. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Luyao Cheng, Siqi Zheng, Qinglin Zhang, Hui Wang, Yafeng Chen, Qian Chen 0003 |
Exploring Speaker-Related Information in Spoken Language Understanding for Better Speaker Diarization. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Jiuxin Lin, Peng Wang, Heinrich Dinkel, Jun Chen 0024, Zhiyong Wu 0001, Zhiyong Yan, Yongqing Wang, Junbo Zhang, Yujun Wang |
Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Sung Hwan Mun, Min Hyun Han, Canyeong Moon, Nam Soo Kim |
EEND-DEMUX: End-to-End Neural Speaker Diarization via Demultiplexed Speaker Embeddings. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Qiongqiong Wang, Kong Aik Lee, Tianchi Liu 0004 |
Incorporating Uncertainty from Speaker Embedding Estimation to Speaker Verification. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Weiqing Wang, Ming Li |
End-to-end Online Speaker Diarization with Target Speaker Tracking. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Jenthe Thienpondt, Nilesh Madhu, Kris Demuynck |
Margin-Mixup: A Method for Robust Speaker Verification in Multi-Speaker Audio. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Zhengyang Chen, Bing Han, Shuai Wang 0016, Yanmin Qian |
Attention-based Encoder-Decoder Network for End-to-End Neural Speaker Diarization with Target Speaker Attractor. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Luca Serafini, Samuele Cornell, Giovanni Morrone, Enrico Zovato, Alessio Brutti, Stefano Squartini |
An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordings. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Vishwanath Pratap Singh, Md. Sahidullah, Tomi Kinnunen |
Speaker Verification Across Ages: Investigating Deep Speaker Embedding Sensitivity to Age Mismatch in Enrollment and Test Speech. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Luca Serafini, Samuele Cornell, Giovanni Morrone, Enrico Zovato, Alessio Brutti, Stefano Squartini |
An experimental review of speaker diarization methods with application to two-speaker conversational telephone speech recordings. |
Comput. Speech Lang. |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Chae-Woon Bang, Chanjun Chun |
Effective Zero-Shot Multi-Speaker Text-to-Speech Technique Using Information Perturbation and a Speaker Encoder. |
Sensors |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Qian-Bei Hong, Chung-Hsien Wu, Hsin-Min Wang |
Generalization Ability Improvement of Speaker Representation and Anti-Interference for Speaker Verification. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Mao-Kui He, Jun Du, Qing-Feng Liu, Chin-Hui Lee 0001 |
ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Yingke Zhu, Brian Mak |
Bayesian Self-Attentive Speaker Embeddings for Text-Independent Speaker Verification. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Tomoka Wakamatsu, Sayaka Shiota, Hitoshi Kiya |
Vocal Tract Length Perturbation-based Pseudo-Speaker Augmentation for Speaker Embedding Learning. |
APSIPA ASC |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Shalini Saini, Nitesh Saxena |
Speaker Anonymity and Voice Conversion Vulnerability: A Speaker Recognition Analysis. |
CNS |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Can Cui, Imran A. Sheikh, Mostafa Sadeghi, Emmanuel Vincent 0001 |
End-to-End Multichannel Speaker-Attributed ASR: Speaker Guided Decoder and Input Feature Analysis. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Kai Liu, Ziqing Du, Xucheng Wan, Huan Zhou 0008 |
X-SEPFORMER: End-To-End Speaker Extraction Network with Explicit Optimization on Speaker Confusion. |
ICASSP |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Tao Liu, Zhengyang Chen, Yanmin Qian, Kai Yu 0004 |
Multi-Speaker End-to-End Multi-Modal Speaker Diarization System for the MISP 2022 Challenge. |
ICASSP |
2023 |
DBLP DOI BibTeX RDF |
|
15 | You Jin Kim, Hee-Soo Heo, Jee-Weon Jung, Youngki Kwon, Bong-Jin Lee, Joon Son Chung |
Advancing the Dimensionality Reduction of Speaker Embeddings for Speaker Diarisation: Disentangling Noise and Informing Speech Activity. |
ICASSP |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Paul-Gauthier Noé, Xiaoxiao Miao, Xin Wang 0037, Junichi Yamagishi, Jean-François Bonastre, Driss Matrouf |
Hiding Speaker's Sex in Speech Using Zero-Evidence Speaker Representation in an Analysis/Synthesis Pipeline. |
ICASSP |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Jenthe Thienpondt, Nilesh Madhu, Kris Demuynck |
Margin-Mixup: A Method for Robust Speaker Verification In Multi-Speaker Audio. |
ICASSP |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Qiongqiong Wang, Kong Aik Lee, Tianchi Liu 0004 |
Incorporating Uncertainty from Speaker Embedding Estimation to Speaker Verification. |
ICASSP |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Xiaoyu Liu, Xu Li, Joan Serrà |
Quantitative Evidence on Overlooked Aspects of Enrollment Speaker Embeddings for Target Speaker Separation. |
ICASSP |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Xianbo Xu, Diqun Yan, Li Dong 0006 |
Adaptive-SpEx: Local and Global Perceptual Modeling with Speaker Adaptation for Target Speaker Extraction. |
SMC |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Luyao Cheng, Siqi Zheng, Qinglin Zhang, Hui Wang, Yafeng Chen, Qian Chen 0003 |
Exploring Speaker-Related Information in Spoken Language Understanding for Better Speaker Diarization. |
ACL (Findings) |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Kurniawati Azizah, Wisnu Jatmiko |
Transfer Learning, Style Control, and Speaker Reconstruction Loss for Zero-Shot Multilingual Multi-Speaker Text-to-Speech on Low-Resource Languages. |
IEEE Access |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Chinchu Thomas, Dinesh Babu Jayagopi |
Predicting Presentation Skill of a Speaker Using Automatic Speaker and Audience Measurement. |
IEEE Trans. Learn. Technol. |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Md. Shah Fahad, Ashish Ranjan 0003, Akshay Deepak, Gayadhar Pradhan |
Speaker Adversarial Neural Network (SANN) for Speaker-independent Speech Emotion Recognition. |
Circuits Syst. Signal Process. |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Byoung Jin Choi, Myeonghun Jeong, Joun Yeop Lee, Nam Soo Kim |
SNAC: Speaker-Normalized Affine Coupling Layer in Flow-Based Architecture for Zero-Shot Multi-Speaker Text-to-Speech. |
IEEE Signal Process. Lett. |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Muhammad Muneeb |
Method to integrate speaker identification, speech recognition, and information retrieval algorithms for speaker-based information retrieval. |
Int. J. Knowl. Eng. Data Min. |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Vijay M. Sardar, Manisha L. Jadhav, Saurabh H. Deshmukh |
Timbre features with MEDIAN values for compensating intra-speaker variability in speaker identification of whispering sound. |
Int. J. Speech Technol. |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Bowen Pang, Huan Zhao, Gaosheng Zhang, Xiaoyue Yang, Yang Sun, Li Zhang 0084, Qing Wang 0039, Lei Xie 0001 |
TSUP Speaker Diarization System for Conversational Short-phrase Speaker Diarization Challenge. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Zhiyun Fan, Zhenlin Liang, Linhao Dong, Yi Liu, Shiyu Zhou, Meng Cai, Jun Zhang 0066, Zejun Ma, Bo Xu |
Token-level Speaker Change Detection Using Speaker Difference and Speech Content via Continuous Integrate-and-fire. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Kenta Udagawa, Yuki Saito, Hiroshi Saruwatari |
Human-in-the-loop Speaker Adaptation for DNN-based Multi-speaker TTS. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Weiqing Wang, Qingjian Lin, Ming Li 0026 |
Online Target Speaker Voice Activity Detection for Speaker Diarization. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Zhe Li, Man-Wai Mak |
Speaker Representation Learning via Contrastive Loss with Maximal Speaker Separability. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Xiaoyi Qin, Na Li 0012, Chao Weng, Dan Su 0002, Ming Li 0026 |
Cross-Age Speaker Verification: Learning Age-Invariant Speaker Embeddings. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Chang Zeng, Xiaoxiao Miao, Xin Wang 0037, Erica Cooper, Junichi Yamagishi |
Joint Speaker Encoder and Neural Back-end Model for Fully End-to-End Automatic Speaker Verification with Multiple Enrollment Utterances. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Chunyu Qiang, Peng Yang, Hao Che, Xiaorui Wang, Zhongyuan Wang 0006 |
Style-Label-Free: Cross-Speaker Style Transfer by Quantized VAE and Speaker-wise Normalization in Speech Synthesis. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Byoung Jin Choi, Myeonghun Jeong, Minchan Kim, Sung Hwan Mun, Nam Soo Kim |
Adversarial Speaker-Consistency Learning Using Untranscribed Speech Data for Zero-Shot Multi-Speaker Text-to-Speech. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Disong Wang, Songxiang Liu, Xixin Wu, Hui Lu, Lifa Sun, Xunying Liu, Helen Meng |
Speaker Identity Preservation in Dysarthric Speech Reconstruction by Adversarial Speaker Adaptation. |
CoRR |
2022 |
DBLP BibTeX RDF |
|
15 | Xiaoyu Liu, Xu Li, Joan Serrà |
Quantitative Evidence on Overlooked Aspects of Enrollment Speaker Embeddings for Target Speaker Separation. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Seong-Hu Kim, Hyeonuk Nam, Yong-Hwa Park |
Decomposed Temporal Dynamic CNN: Efficient Time-Adaptive Network for Text-Independent Speaker Verification Explained with Speaker Activation Map. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Shulin He, Wei Rao, Kanghao Zhang, Yukai Ju, Yang Yang, Xueliang Zhang 0001, Yannan Wang, Shidong Shang |
Local-global speaker representation for target speaker extraction. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Paul-Gauthier Noé, Xiaoxiao Miao, Xin Wang 0037, Junichi Yamagishi, Jean-François Bonastre, Driss Matrouf |
Hiding speaker's sex in speech using zero-evidence speaker representation in an analysis/synthesis pipeline. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Yu Zheng 0020, Jinghan Peng, Yihao Chen, Yajun Zhang, Jialong Wang, Min Liu, Minqiang Xu |
The SpeakIn Speaker Verification System for Far-Field Speaker Verification Challenge 2022. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Zifeng Zhao, Rongzhi Gu, Dongchao Yang, Jinchuan Tian, Yuexian Zou |
Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Ahmad Aloradi, Wolfgang Mack, Mohamed Elminshawi, Emanuël A. P. Habets |
Speaker Verification in Multi-Speaker Environments Using Temporal Feature Fusion. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Byoung Jin Choi, Myeonghun Jeong, Joun Yeop Lee, Nam Soo Kim |
SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speaker text-to-speech. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Mohamed Chetouani, Marcos Faúndez-Zanuy, Bruno Gas, Jean-Luc Zarader |
A New Nonlinear speaker parameterization algorithm for speaker identification. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Yixuan Zhou 0002, Changhe Song, Xiang Li 0105, Luwen Zhang, Zhiyong Wu 0001, Yanyao Bian, Dan Su 0002, Helen Meng |
Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Ruohua Zhou, Yuxuan Du, Chenlei Hu |
The BUCEA Speaker Diarization System for the VoxCeleb Speaker Recognition Challenge 2022. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Botao Zhao 0001, Xulong Zhang 0001, Jianzong Wang, Ning Cheng 0001, Jing Xiao 0006 |
nnSpeech: Speaker-Guided Conditional Variational Autoencoder for Zero-shot Multi-speaker Text-to-Speech. |
CoRR |
2022 |
DBLP BibTeX RDF |
|
15 | Naoyuki Kanda, Jian Wu 0027, Yu Wu 0012, Xiong Xiao, Zhong Meng, Xiaofei Wang 0009, Yashesh Gaur, Zhuo Chen 0006, Jinyu Li 0001, Takuya Yoshioka |
Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Alessandro Arezzo, Stefano Berretti |
SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transformers. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Narla John Metilda Sagaya Mary, Srinivasan Umesh, Sandesh Varadaraju Katta |
S-Vectors and TESA: Speaker Embeddings and a Speaker Authenticator Based on Transformer Encoder. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Weiqing Wang, Qingjian Lin, Danwei Cai, Ming Li 0026 |
Similarity Measurement of Segment-Level Speaker Embeddings in Speaker Diarization. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Tao Liu, Xu Xiang, Zhengyang Chen, Bing Han, Kai Yu 0004, Yanmin Qian |
The X-Lance Speaker Diarization System for the Conversational Short-phrase Speaker Diarization Challenge 2022. |
ISCSLP |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Bowen Pang, Huan Zhao, Gaosheng Zhang, Xiaoyue Yang, Yang Sun, Li Zhang 0084, Qing Wang 0039, Lei Xie 0001 |
TSUP Speaker Diarization System for Conversational Short-phrase Speaker Diarization Challenge. |
ISCSLP |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Qicong Xie, Tao Li, Xinsheng Wang, Zhichao Wang, Lei Xie 0001, Guoqiao Yu, Guanglu Wan |
Multi-speaker Multi-style Text-to-speech Synthesis with Single-speaker Single-style Training Data Scenarios. |
ISCSLP |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Chunyu Qiang, Peng Yang, Hao Che, Xiaorui Wang, Zhongyuan Wang 0006 |
Style-Label-Free: Cross-Speaker Style Transfer by Quantized VAE and Speaker-wise Normalization in Speech Synthesis. |
ISCSLP |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Yixuan Zhou 0002, Changhe Song, Xiang Li 0105, Luwen Zhang, Zhiyong Wu 0001, Yanyao Bian, Dan Su 0002, Helen Meng |
Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Chu-Xiao Zuo, Jia-Yi Leng, Wu-Jun Li |
Speaker-Specific Utterance Ensemble based Transfer Attack on Speaker Identification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Avamarie Brueggeman, John H. L. Hansen |
Speaker Trait Enhancement for Cochlear Implant Users: A Case Study for Speaker Emotion Perception. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Nicolas Audibert, Cécile Fougeron |
Intra-speaker phonetic variation in read speech: comparison with inter-speaker variability in a controlled population. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Naoyuki Kanda, Jian Wu 0027, Yu Wu 0012, Xiong Xiao, Zhong Meng, Xiaofei Wang 0009, Yashesh Gaur, Zhuo Chen 0006, Jinyu Li 0001, Takuya Yoshioka |
Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
15 | Bin Gu |
Deep speaker embedding with frame-constrained training strategy for speaker verification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
|
|