The FacetedDBLP logo    Search for: in:

Disable automatic phrases ?     Syntactic query expansion: ?

Searching for phrase Audio-visual (changed automatically) with no syntactic query expansion in all metadata.

Publication years (Num. hits)
1974-1993 (15) 1994-1996 (21) 1997 (55) 1998 (29) 1999 (34) 2000 (42) 2001 (73) 2002 (68) 2003 (103) 2004 (113) 2005 (100) 2006 (101) 2007 (144) 2008 (145) 2009 (118) 2010 (84) 2011 (103) 2012 (95) 2013 (102) 2014 (101) 2015 (85) 2016 (90) 2017 (99) 2018 (139) 2019 (160) 2020 (198) 2021 (269) 2022 (292) 2023 (418) 2024 (78)
Publication types (Num. hits)
article(1159) book(1) incollection(17) inproceedings(2241) phdthesis(43) proceedings(13)
Venues (Conferences, Journals, ...)
CoRR(567) ICASSP(158) Interspeech(152) AVSP(136) HAVE(79) ACM Multimedia(75) ICME(69) ICMI(57) AVEC@ACM Multimedia(45) AVEC@MM(41) CVPR(40) IEEE Trans. Multim.(39) EUSIPCO(35) MMSP(32) IEEE Access(28) AAAI(24) More (+10 of total 800)
GrowBag graphs for keyword ? (Num. hits/coverage)

Group by:
The graphs summarize 800 occurrences of 559 keywords

Results
Found 3474 publication records. Showing 3474 according to the selection in the facets
Hits ? Authors Title Venue Year Link Author keywords
11Leandro A. Passos, João Paulo Papa, Javier Del Ser, Amir Hussain 0001, Ahsan Adeel Multimodal audio-visual information fusion using canonical-correlated Graph Neural Network for energy-efficient speech enhancement. Search on Bibsonomy Inf. Fusion The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11 Retracted: Investigating the interactive audio-visual course mode for college English using virtual reality and artificial intelligence. Search on Bibsonomy IET Softw. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Di Guo, Huaping Liu 0001, Fuchun Sun 0001 Audio-visual language instruction understanding for robotic sorting. Search on Bibsonomy Robotics Auton. Syst. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Qiya Song, Bin Sun 0001, Shutao Li Multimodal Sparse Transformer Network for Audio-Visual Speech Recognition. Search on Bibsonomy IEEE Trans. Neural Networks Learn. Syst. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Yuqin Cao, Xiongkuo Min, Wei Sun 0029, Guangtao Zhai Attention-Guided Neural Networks for Full-Reference and No-Reference Audio-Visual Quality Assessment. Search on Bibsonomy IEEE Trans. Image Process. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Yuqin Cao, Xiongkuo Min, Wei Sun 0029, Guangtao Zhai Subjective and Objective Audio-Visual Quality Assessment for User Generated Content. Search on Bibsonomy IEEE Trans. Image Process. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Guinan Li, Jiajun Deng, Mengzhe Geng, Zengrui Jin, Tianzi Wang, Shujie Hu, Mingyu Cui, Helen Meng, Xunying Liu Audio-Visual End-to-End Multi-Channel Speech Separation, Dereverberation and Recognition. Search on Bibsonomy IEEE ACM Trans. Audio Speech Lang. Process. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Xinyuan Qian, Zhengdong Wang, Jiadong Wang, Guohui Guan, Haizhou Li 0001 Audio-Visual Cross-Attention Network for Robotic Speaker Tracking. Search on Bibsonomy IEEE ACM Trans. Audio Speech Lang. Process. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Shentong Mo, Weiguo Pian, Yapeng Tian Class-Incremental Grouping Network for Continual Audio-Visual Learning. Search on Bibsonomy ICCV The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Yuxin Mao, Jing Zhang, Mochu Xiang, Yiran Zhong, Yuchao Dai Multimodal Variational Auto-encoder based Audio-Visual Segmentation. Search on Bibsonomy ICCV The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Muhammad Adi Nugroho, Sangmin Woo, Sumin Lee, Changick Kim Audio-Visual Glance Network for Efficient Video Recognition. Search on Bibsonomy ICCV The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Jinyu Chen, Wenguan Wang, Si Liu 0001, Hongsheng Li, Yi Yang 0001 Omnidirectional Information Gathering for Knowledge Transfer-based Audio-Visual Navigation. Search on Bibsonomy ICCV The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Xiaobao Guo, Nithish Muthuchamy Selvaraj, Zitong Yu, Adams Wai-Kin Kong, Bingquan Shen, Alex C. Kot Audio-Visual Deception Detection: DOLOS Dataset and Parameter-Efficient Crossmodal Learning. Search on Bibsonomy ICCV The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Kranthi Kumar Rachavarapu, A. N. Rajagopalan 0001 Boosting Positive Segments for Weakly-Supervised Audio-Visual Video Parsing. Search on Bibsonomy ICCV The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Weiguo Pian, Shentong Mo, Yunhui Guo, Yapeng Tian Audio-Visual Class-Incremental Learning. Search on Bibsonomy ICCV The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Zhe Niu, Brian Mak On the Audio-visual Synchronization for Lip-to-Speech Synthesis. Search on Bibsonomy ICCV The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Jie Hong, Zeeshan Hayder, Junlin Han, Pengfei Fang, Mehrtash Harandi, Lars Petersson Hyperbolic Audio-visual Zero-shot Learning. Search on Bibsonomy ICCV The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Yang Liu 0084, Ying Tan, Haoyuan Lan Self-Supervised Contrastive Learning for Audio-Visual Action Recognition. Search on Bibsonomy ICIP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Yuqin Cao, Xiongkuo Min, Wei Sun 0029, Xiao-Ping (Steven) Zhang, Guangtao Zhai Audio-Visual Quality Assessment for User Generated Content: Database and Method. Search on Bibsonomy ICIP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Pavel Korshunov, Haolin Chen, Philip N. Garner, Sébastien Marcel Vulnerability of Automatic Identity Recognition to Audio-Visual Deepfakes. Search on Bibsonomy IJCB The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Syrine Haddad, Olfa Dâassi, Safya Belghith Emotion Recognition from Audio-Visual Information based on Convolutional Neural Network. Search on Bibsonomy ICCAD The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Moinak Bhattacharya, Prateek Prasanna Audio-visual feature fusion for improved thoracic disease classification. Search on Bibsonomy Medical Imaging: Computer-Aided Diagnosis The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Shan Liu, Bohan Wu, Shu Ma, Zhen Yang Advanced Audio-Visual Multimodal Warnings for Drivers: Effect of Specificity and Lead Time on Effectiveness. Search on Bibsonomy HCI (8) The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Kazuki Seto, Yumi Asahi Sound Logo to Increase TV Advertising Effectiveness Based on Audio-Visual Features. Search on Bibsonomy HCI (5) The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Chang Wang, Jun Du, Hang Chen, Ruoyu Wang 0029, Chao-Han Huck Yang, Jiangjiang Zhao, Yuling Ren, Qinglong Li, Chin-Hui Lee 0001 Enhancing Privacy Preservation with Quantum Computing for Word-Level Audio-Visual Speech Recognition. Search on Bibsonomy APSIPA ASC The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Yu-Ching Chung, Ji-Yan Han, Bo-Sin Wang, Wei-Zhong Zheng, Kung-Yao Shen, Ying-Hui Lai An Audio-Visual Speech Enhancement System Based on 3D Image Features: An Application in Hearing Aids. Search on Bibsonomy APSIPA ASC The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Haodong Zhou, Tao Li, Jie Wang, Lin Li, Qingyang Hong CASA-Net: Cross-attention and Self-attention for End-to-End Audio-visual Speaker Diarization. Search on Bibsonomy APSIPA ASC The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Yoto Fujita, Yoshiaki Bando, Keisuke Imoto, Masaki Onishi, Kazuyoshi Yoshii DOA-Aware Audio-Visual Self-Supervised Learning for Sound Event Localization and Detection. Search on Bibsonomy APSIPA ASC The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Vinaya Sree Katamneni, Ajita Rattani MIS-AVoiDD: Modality Invariant and Specific Representation for Audio-Visual Deepfake Detection. Search on Bibsonomy ICMLA The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Amandine Brunetto, Sascha Hornauer, Stella X. Yu, Fabien Moutarde The Audio-Visual BatVision Dataset for Research on Sight and Sound. Search on Bibsonomy IROS The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Haru Kondoh, Asako Kanezaki Multi-Goal Audio-Visual Navigation Using Sound Direction Map. Search on Bibsonomy IROS The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Yusaku Nakajima, Masashi Hamaya, Kazutoshi Tanaka, Takafumi Hawai, Felix von Drigalski, Yasuo Takeichi, Yoshitaka Ushiku, Kanta Ono Robotic Powder Grinding with Audio-Visual Feedback for Laboratory Automation in Materials Science. Search on Bibsonomy IROS The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Yizhuo Yang, Shenghai Yuan, Muqing Cao, Jianfei Yang, Lihua Xie AV-PedAware: Self-Supervised Audio-Visual Fusion for Dynamic Pedestrian Awareness. Search on Bibsonomy IROS The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Zexu Pan, Gordon Wichern, Yoshiki Masuyama, François G. Germain, Sameer Khurana, Chiori Hori, Jonathan Le Roux Scenario-Aware Audio-Visual TF-Gridnet for Target Speech Extraction. Search on Bibsonomy ASRU The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Jiachen Lian, Alexei Baevski, Wei-Ning Hsu, Michael Auli Av-Data2Vec: Self-Supervised Learning of Audio-Visual Speech Representations with Contextualized Target Representations. Search on Bibsonomy ASRU The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Cheng-I Jeff Lai, Freda Shi, Puyuan Peng, Yoon Kim, Kevin Gimpel, Shiyu Chang, Yung-Sung Chuang, Saurabhchand Bhati, David D. Cox, David Harwath, Yang Zhang 0001, Karen Livescu, James R. Glass Audio-Visual Neural Syntax Acquisition. Search on Bibsonomy ASRU The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Changan Chen, Wei Sun, David Harwath, Kristen Grauman Learning Audio-Visual Dereverberation. Search on Bibsonomy ICASSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Yuqian Kuang, Xiaopeng Fan Collaborative Audio-Visual Event Localization Based on Sequential Decision and Cross-Modal Consistency. Search on Bibsonomy ICASSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Zirun Zhu, Hemin Yang, Min Tang, Ziyi Yang, Sefik Emre Eskimez, Huaming Wang Real-Time Audio-Visual End-To-End Speech Enhancement. Search on Bibsonomy ICASSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Zhongweiyang Xu, Xulin Fan, Mark Hasegawa-Johnson Dual-Path Cross-Modal Attention for Better Audio-Visual Speech Extraction. Search on Bibsonomy ICASSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Roshan Sharma, Weipeng He, Ju Lin, Egor Lakomkin, Yang Liu, Kaustubh Kalgaonkar Egocentric Audio-Visual Noise Suppression. Search on Bibsonomy ICASSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Jiahong Li, Chenda Li, Yifei Wu, Yanmin Qian Robust Audio-Visual ASR with Unified Cross-Modal Attention. Search on Bibsonomy ICASSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Rodrigo Mira, Buye Xu, Jacob Donley, Anurag Kumar 0003, Stavros Petridis, Vamsi Krishna Ithapu, Maja Pantic LA-VOCE: LOW-SNR Audio-Visual Speech Enhancement Using Neural Vocoders. Search on Bibsonomy ICASSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11R. Gnana Praveen, Eric Granger, Patrick Cardinal Recursive Joint Attention for Audio-Visual Fusion in Regression Based Emotion Recognition. Search on Bibsonomy ICASSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Gaopeng Xu, Xianliang Wang, Sang Wang, Junfeng Yuan, Wei Guo, Wei Li, Jie Gao The NIO System for Audio-Visual Diarization and Recognition in MISP Challenge 2022. Search on Bibsonomy ICASSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Zhe Wang, Shilong Wu, Hang Chen, Mao-Kui He, Jun Du, Chin-Hui Lee 0001, Jingdong Chen, Shinji Watanabe 0001, Sabato Marco Siniscalchi, Odette Scharenborg, Diyuan Liu, Baocai Yin, Jia Pan, Jianqing Gao, Cong Liu 0006 The Multimodal Information Based Speech Processing (Misp) 2022 Challenge: Audio-Visual Diarization And Recognition. Search on Bibsonomy ICASSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Cassia Valentini-Botinhao, Andrea Lorena Aldana Blanco, Ondrej Klejch, Peter Bell 0001 Efficient Intelligibility Evaluation Using Keyword Spotting: A Study on Audio-Visual Speech Enhancement. Search on Bibsonomy ICASSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Christian Marinoni, Riccardo F. Gramaccioni, Changan Chen, Aurelio Uncini, Danilo Comminiello Overview of the L3DAS23 Challenge on Audio-Visual Extended Reality. Search on Bibsonomy ICASSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Mandar Gogate, Kia Dashtipour, Amir Hussain 0001 Towards Pose-Invariant Audio-Visual Speech Enhancement in the Wild for Next-Generation Multi-Modal Hearing Aids. Search on Bibsonomy ICASSP Workshops The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Hongbo Chen, Dongchen Zhu, Guanghui Zhang, Wenjun Shi, Xiaolin Zhang, Jiamao Li CM-CS: Cross-Modal Common-Specific Feature Learning For Audio-Visual Video Parsing. Search on Bibsonomy ICASSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Hui Chen, Hanyi Zhang, Longbiao Wang, Kong Aik Lee, Meng Liu, Jianwu Dang 0001 Self-Supervised Audio-Visual Speaker Representation with Co-Meta Learning. Search on Bibsonomy ICASSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Yifei Wu, Chenda Li, Yanmin Qian Light-Weight Visualvoice: Neural Network Quantization On Audio Visual Speech Separation. Search on Bibsonomy ICASSP Workshops The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Jing-Xuan Zhang, Genshun Wan, Zhen-Hua Ling, Jia Pan, Jianqing Gao, Cong Liu 0006 Self-Supervised Audio-Visual Speech Representations Learning by Multimodal Self-Distillation. Search on Bibsonomy ICASSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Haitao Xu, Liangfa Wei, Jie Zhang 0042, Jianming Yang, Yannan Wang, Tian Gao, Xin Fang, Li-Rong Dai 0001 A Multi-Scale Feature Aggregation Based Lightweight Network for Audio-Visual Speech Enhancement. Search on Bibsonomy ICASSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11I-Chun Chern, Kuo-Hsuan Hung, Yi-Ting Chen, Tassadaq Hussain, Mandar Gogate, Amir Hussain 0001, Yu Tsao 0001, Jen-Cheng Hou Audio-Visual Speech Enhancement and Separation by Utilizing Multi-Modal Self-Supervised Embeddings. Search on Bibsonomy ICASSP Workshops The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Ming Cheng, Haoxu Wang, Ziteng Wang, Qiang Fu, Ming Li 0026 The WHU-Alibaba Audio-Visual Speaker Diarization System for the MISP 2022 Challenge. Search on Bibsonomy ICASSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Xiaoming Ren, Chao Li, Shenjian Wang, Biao Li Practice of the Conformer Enhanced Audio-Visual Hubert on Mandarin and English. Search on Bibsonomy ICASSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Haoxu Wang, Ming Cheng, Qiang Fu, Ming Li 0026 The DKU Post-Challenge Audio-Visual Wake Word Spotting System for the 2021 MISP Challenge: Deep Analysis. Search on Bibsonomy ICASSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Yangcheng Li, Zefang Yu, Suncheng Xiang, Ting Liu 0016, Yuzhuo Fu AV-TAD: Audio-Visual Temporal Action Detection With Transformer. Search on Bibsonomy ICASSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Timothée Dhaussy, Bassam Jabaian, Fabrice Lefèvre, Radu Horaud Audio-Visual Speaker Diarization in the Framework of Multi-User Human-Robot Interaction. Search on Bibsonomy ICASSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Ya Jiang, Hang Chen, Jun Du, Qing Wang 0008, Chin-Hui Lee 0001 Incorporating Lip Features into Audio-Visual Multi-Speaker DOA Estimation by Gated Fusion. Search on Bibsonomy ICASSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Ali Golmakani, Mostafa Sadeghi, Romain Serizel Audio-Visual Speech Enhancement with a Deep Kalman Filter Generative Model. Search on Bibsonomy ICASSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Chang-Sung Sung, Jun-Cheng Chen, Chu-Song Chen Hearing and Seeing Abnormality: Self-Supervised Audio-Visual Mutual Learning for Deepfake Detection. Search on Bibsonomy ICASSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Pingchuan Ma 0001, Alexandros Haliassos, Adriana Fernandez-Lopez, Honglie Chen, Stavros Petridis, Maja Pantic Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels. Search on Bibsonomy ICASSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Ruize Xu, Ruoxuan Feng, Shi-Xiong Zhang, Di Hu 0001 MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning. Search on Bibsonomy ICASSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Rajat Hebbar, Digbalay Bose, Krishna Somandepalli, Veena Vijai, Shrikanth Narayanan A Dataset for Audio-Visual Sound Event Detection in Movies. Search on Bibsonomy ICASSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Tao Li, Haodong Zhou, Jie Wang, Qingyang Hong, Lin Li The XMU System for Audio-Visual Diarization and Recognition in MISP Challenge 2022. Search on Bibsonomy ICASSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Pengcheng Guo, He Wang, Bingshen Mu, Ao Zhang, Peikun Chen The NPU-ASLP System for Audio-Visual Speech Recognition in MISP 2022 Challenge. Search on Bibsonomy ICASSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Meng Liu, Kong Aik Lee, Longbiao Wang, Hanyi Zhang, Chang Zeng, Jianwu Dang 0001 Cross-Modal Audio-Visual Co-Learning for Text-Independent Speaker Verification. Search on Bibsonomy ICASSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Jiuxin Lin, Xinyu Cai, Heinrich Dinkel, Jun Chen 0024, Zhiyong Yan, Yongqing Wang, Junbo Zhang, Zhiyong Wu 0001, Yujun Wang, Helen Meng Av-Sepformer: Cross-Attention Sepformer for Audio-Visual Target Speaker Extraction. Search on Bibsonomy ICASSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Prerna Singh, Ayush Tripathi, Lalan Kumar, Tapan Kumar Gandhi Brain Connectivity Features-based Age Group Classification using Temporal Asynchrony Audio-Visual Integration Task. Search on Bibsonomy EMBC The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Hannes Oppermann, Antonia Thelen, Jens Haueisen Entrainment and resonance effects with a new mobile audio-visual stimulation device. Search on Bibsonomy EMBC The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Yagna Gudipalli, Gauri Deshpande, Sachin Patel, Björn W. Schuller Deep Modelling Strategies for Human Confidence Classification using Audio-visual Data. Search on Bibsonomy EMBC The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Xiaojing Yu, Lan Zhang 0002, Xiang-Yang Li E-Talk: Accelerating Active Speaker Detection with Audio-Visual Fusion and Edge-Cloud Computing. Search on Bibsonomy SECON The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Sunan Li, Hailun Lian, Cheng Lu 0005, Yan Zhao, Chuangao Tang, Yuan Zong, Wenming Zheng Audio-Visual Group-based Emotion Recognition using Local and Global Feature Aggregation based Multi-Task Learning. Search on Bibsonomy ICMI The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Guangyao Li, Wenxuan Hou, Di Hu 0001 Progressive Spatio-temporal Perception for Audio-Visual Question Answering. Search on Bibsonomy ACM Multimedia The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Tianyu Liu, Peng Zhang 0005, Wei Huang 0013, Yufei Zha, Tao You, Yanning Zhang Induction Network: Audio-Visual Modality Gap-Bridging for Self-Supervised Sound Source Localization. Search on Bibsonomy ACM Multimedia The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Shiping Ge, Zhiwei Jiang, Yafeng Yin, Cong Wang, Zifeng Cheng, Qing Gu Learning Event-Specific Localization Preferences for Audio-Visual Event Localization. Search on Bibsonomy ACM Multimedia The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Chenyu Yang, Mengxi Chen, Yanfeng Wang, Yu Wang 0027 Uncertainty-Guided End-to-End Audio-Visual Speaker Diarization for Far-Field Recordings. Search on Bibsonomy ACM Multimedia The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Sung Jin Um, Dongjin Kim, Jung Uk Kim Audio-Visual Spatial Integration and Recursive Attention for Robust Sound Source Localization. Search on Bibsonomy ACM Multimedia The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Chen Liu 0028, Peike Patrick Li, Xingqun Qi, Hu Zhang, Lincheng Li, Dadong Wang, Xin Yu 0002 Audio-Visual Segmentation by Exploring Cross-Modal Mutual Semantics. Search on Bibsonomy ACM Multimedia The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Chao Sun, Min Chen 0003, Jialiang Cheng, Han Liang, Chuanbo Zhu 0002, Jincai Chen SCLAV: Supervised Cross-modal Contrastive Learning for Audio-Visual Coding. Search on Bibsonomy ACM Multimedia The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Hongye Liu, Xianhai Xie, Yang Gao, Zhou Yu 0001 Parameter-Efficient Transfer Learning for Audio-Visual-Language Tasks. Search on Bibsonomy ACM Multimedia The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Haotian Wang, Yuxuan Xi, Hang Chen, Jun Du, Yan Song 0001, Qing Wang 0008, Hengshun Zhou, Chenxi Wang, Jiefeng Ma, Pengfei Hu 0006, Ya Jiang, Shi Cheng, Jie Zhang 0042, Yuzhe Weng Hierarchical Audio-Visual Information Fusion with Multi-label Joint Decoding for MER 2023. Search on Bibsonomy ACM Multimedia The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Chenyang Lyu, Wenxi Li, Tianbo Ji, Longyue Wang, Liting Zhou, Cathal Gurrin, Linyi Yang, Yi Yu 0001, Yvette Graham, Jennifer Foster Graph-Based Video-Language Learning with Multi-Grained Audio-Visual Alignment. Search on Bibsonomy ACM Multimedia The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Jiayi Zhang, Weixin Li 0001 Multi-Modal and Multi-Scale Temporal Fusion Architecture Search for Audio-Visual Video Parsing. Search on Bibsonomy ACM Multimedia The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Wenrui Li, Xi-Le Zhao, Zhengyu Ma, Xingtao Wang, Xiaopeng Fan, Yonghong Tian 0001 Motion-Decoupled Spiking Transformer for Audio-Visual Zero-Shot Learning. Search on Bibsonomy ACM Multimedia The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Soichiro Komura, Katsuyoshi Maeyama, Akira Taniguchi, Tadahiro Taniguchi Lexical Acquisition from Audio-Visual Streams Using a Multimodal Recurrent State-Space Model. Search on Bibsonomy ICDL The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Yumi Hughes, Kae Mukai, Katsumi Watanabe, Kazutoshi Kudo An on-line study about recognition of improvisation theatre using audio-visual information. Search on Bibsonomy CogSci The full citation details ... 2023 DBLP  BibTeX  RDF
11Huilin Tian, Jingke Meng, Yuhan Yao, Wei-Shi Zheng 0001 Unimodal-Multimodal Collaborative Enhancement for Audio-Visual Event Localization. Search on Bibsonomy PRCV (6) The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Pritam Sarkar, Aaron Posen, Ali Etemad AVCAffe: A Large Scale Audio-Visual Dataset of Cognitive Load and Affect for Remote Work. Search on Bibsonomy AAAI The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Pritam Sarkar, Ali Etemad Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity. Search on Bibsonomy AAAI The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Mingrui Lao, Nan Pu, Yu Liu 0012, Kai He, Erwin M. Bakker, Michael S. Lew COCA: COllaborative CAusal Regularization for Audio-Visual Question Answering. Search on Bibsonomy AAAI The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Chen Chen 0075, Yuchen Hu, Qiang Zhang, Heqing Zou, Beier Zhu, Eng Siong Chng Leveraging Modality-Specific Representations for Audio-Visual Speech Recognition via Reinforcement Learning. Search on Bibsonomy AAAI The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Simon Jenni, Alexander Black 0001, John P. Collomosse Audio-Visual Contrastive Learning with Temporal Self-Supervision. Search on Bibsonomy AAAI The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Jingfei Xia, Mingchen Zhuge, Tiantian Geng, Shun Fan, Yuantai Wei, Zhenyu He 0001, Feng Zheng Skating-Mixer: Long-Term Sport Audio-Visual Modeling with MLPs. Search on Bibsonomy AAAI The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Peijun Bao, Wenhan Yang, Boon Poh Ng, Meng Hwa Er, Alex C. Kot Cross-Modal Label Contrastive Learning for Unsupervised Audio-Visual Event Localization. Search on Bibsonomy AAAI The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Mina Huh, Saelyne Yang, Yi-Hao Peng, Xiang 'Anthony' Chen, Young-Ho Kim, Amy Pavel AVscript: Accessible Video Editing with Audio-Visual Scripts. Search on Bibsonomy CHI The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Wenting Zhao 0003, Shigang Wang, Yan Zhao 0012, Jian Wei, Tianshu Li A Novel Intelligent Assessment Based on Audio-Visual Data for Chinese Zither Fingerings. Search on Bibsonomy ICIG (4) The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Xiaoyu Wu, Jucheng Qiu, Qiurui Yue GLTCM: Global-Local Temporal and Cross-Modal Network for Audio-Visual Event Localization. Search on Bibsonomy ICIG (2) The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
Displaying result #801 - #900 of 3474 (100 per page; Change: )
Pages: [<<][1][2][3][4][5][6][7][8][9][10][11][12][13][14][15][16][17][18][>>]
Valid XHTML 1.1! Valid CSS! [Valid RSS]
Maintained by L3S.
Previously maintained by Jörg Diederich.
Based upon DBLP by Michael Ley.
open data data released under the ODC-BY 1.0 license