The FacetedDBLP logo    Search for: in:

Disable automatic phrases ?     Syntactic query expansion: ?

Searching for phrase Audio-visual (changed automatically) with no syntactic query expansion in all metadata.

Publication years (Num. hits)
1974-1993 (15) 1994-1996 (21) 1997 (55) 1998 (29) 1999 (34) 2000 (42) 2001 (73) 2002 (68) 2003 (103) 2004 (113) 2005 (100) 2006 (101) 2007 (144) 2008 (145) 2009 (118) 2010 (84) 2011 (103) 2012 (95) 2013 (102) 2014 (101) 2015 (85) 2016 (90) 2017 (99) 2018 (139) 2019 (160) 2020 (198) 2021 (269) 2022 (292) 2023 (418) 2024 (78)
Publication types (Num. hits)
article(1159) book(1) incollection(17) inproceedings(2241) phdthesis(43) proceedings(13)
Venues (Conferences, Journals, ...)
CoRR(567) ICASSP(158) INTERSPEECH(152) AVSP(136) HAVE(79) ACM Multimedia(75) ICME(69) ICMI(57) AVEC@ACM Multimedia(45) AVEC@MM(41) CVPR(40) IEEE Trans. Multim.(39) EUSIPCO(35) MMSP(32) IEEE Access(28) AAAI(24) More (+10 of total 800)
GrowBag graphs for keyword ? (Num. hits/coverage)

Group by:
The graphs summarize 800 occurrences of 559 keywords

Results
Found 3474 publication records. Showing 3474 according to the selection in the facets
Hits ? Authors Title Venue Year Link Author keywords
11Yuxin Zhu, Xilei Zhu, Huiyu Duan, Jie Li, Kaiwei Zhang, Yucheng Zhu, Li Chen, Xiongkuo Min, Guangtao Zhai Audio-Visual Saliency for Omnidirectional Videos. Search on Bibsonomy ICIG (5) The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Hong-Liang Dai, Xinfeng Zhang 0003, Haiyang Yu 0002 An Attention-based Audio-visual Fusion Method for Short Video Classification. Search on Bibsonomy BDIOT The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Yibo He, Kah Phooi Seng, Li-Minn Ang, Xingyu Zhao Cycle-Consistent Generative Adversarial Network Architectures for Audio Visual Speech Recognition. Search on Bibsonomy ICSPCC The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Peng Zhang, Hui Zhao, Meijuan Li, Yida Chen, Jianqiang Zhang, Fuqiang Wang, Xiaoming Wu Audio-Visual Emotion Recognition Based on Multi-Scale Channel Attention and Global Interactive Fusion. Search on Bibsonomy SMC The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Jinxin Wang, Chao Yang 0024, Zhongwen Guo, Xiaomei Li, Weigang Wang An End-to-End Mandarin Audio-Visual Speech Recognition Model with a Feature Enhancement Module. Search on Bibsonomy SMC The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Yapeng Li, Yong Luo 0002, Bo Du 0001 Audio-Visual Generalized Zero-Shot Learning Based on Variational Information Bottleneck. Search on Bibsonomy ICME The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Jinxin Wang, Zhongwen Guo, Chao Yang 0024, Xiaomei Li, Ziyuan Cui Multi-Scale Hybrid Fusion Network for Mandarin Audio-Visual Speech Recognition. Search on Bibsonomy ICME The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Wenrui Li, Zhengyu Ma, Liang-Jian Deng, Hengyu Man, Xiaopeng Fan Modality-Fusion Spiking Transformer Network for Audio-Visual Zero-Shot Learning. Search on Bibsonomy ICME The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Otniel-Bogdan Mercea, Thomas Hummel 0001, A. Sophia Koepke, Zeynep Akata Text-to-Feature Diffusion for Audio-Visual Few-Shot Learning. Search on Bibsonomy DAGM The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Yutong Jiang, Kaoru Hirota, Yaping Dai, Ye Ji, Shuai Shao Abnormal Emotion Recognition Based on Audio-Visual Modality Fusion. Search on Bibsonomy ICIRA (1) The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Shentong Mo, Yapeng Tian Audio-Visual Grouping Network for Sound Localization from Mixtures. Search on Bibsonomy CVPR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Junwen Xiong, Ganglai Wang, Peng Zhang 0005, Wei Huang 0013, Yufei Zha, Guangtao Zhai CASP-Net: Rethinking Video Saliency Prediction from an Audio-Visual Consistency Perceptual Perspective. Search on Bibsonomy CVPR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Weixuan Sun, Jiayi Zhang, Jianyuan Wang, Zheyuan Liu 0002, Yiran Zhong, Tianpeng Feng, Yandong Guo, Yanhao Zhang, Nick Barnes Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learning. Search on Bibsonomy CVPR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Chao Feng, Ziyang Chen, Andrew Owens Self-Supervised Video Forensics by Audio-Visual Anomaly Detection. Search on Bibsonomy CVPR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Jiaben Chen, Renrui Zhang, Dongze Lian, Jiaqi Yang, Ziyao Zeng, Jianbo Shi iQuery: Instruments as Queries for Audio-Visual Sound Separation. Search on Bibsonomy CVPR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Reuben Tan, Arijit Ray, Andrea Burns, Bryan A. Plummer, Justin Salamon, Oriol Nieto, Bryan Russell, Kate Saenko Language-Guided Audio-Visual Source Separation via Trimodal Consistency. Search on Bibsonomy CVPR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Chao Huang, Yapeng Tian, Anurag Kumar, Chenliang Xu Egocentric Audio-Visual Object Localization. Search on Bibsonomy CVPR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Wenru Zheng, Ryota Yoshihashi, Rei Kawakami, Ikuro Sato, Asako Kanezaki Multi Event Localization by Audio-Visual Fusion with Omnidirectional Camera and Microphone Array. Search on Bibsonomy CVPR Workshops The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Davide Cozzolino, Alessandro Pianese, Matthias Nießner, Luisa Verdoliva Audio-Visual Person-of-Interest DeepFake Detection. Search on Bibsonomy CVPR Workshops The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Aggelina Chatziagapi, Dimitris Samaras AVFace: Towards Detailed Audio-Visual 4D Face Reconstruction. Search on Bibsonomy CVPR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Junyu Gao 0002, Mengyuan Chen, Changsheng Xu Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio- Visual Event Perception. Search on Bibsonomy CVPR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Yan-Bo Lin, Yi-Lin Sung, Jie Lei 0003, Mohit Bansal, Gedas Bertasius Vision Transformers are Parameter-Efficient Audio-Visual Learners. Search on Bibsonomy CVPR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Tiantian Geng, Teng Wang, Jinming Duan 0001, Runmin Cong, Feng Zheng Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline. Search on Bibsonomy CVPR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Juheon Hwang, Jiwoo Kang Audio-visual Neural Face Generation with Emotional Stimuli. Search on Bibsonomy IEEE Big Data The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Yi-Lu Jiang, Wen-Chang Chang, Chih-Yi Chiu Pineapple Quality Classification in a Multimodal Audio-Visual Dataset. Search on Bibsonomy IEEE Big Data The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Xingjian Diao, Ming Cheng, Shitong Cheng AV-MaskEnhancer: Enhancing Video Representations through Audio-Visual Masked Autoencoder. Search on Bibsonomy ICTAI The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Donghuo Zeng, Kazushi Ikeda Triplet Loss with Curriculum Learning for Audio-Visual Retrieval. Search on Bibsonomy ISM The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Antonio Rios-Navarro, Enrique Piñero-Fuentes, Salvador Canas-Moreno, Aqib Javed, Jim Harkin, Alejandro Linares-Barranco LIPSFUS: A neuromorphic dataset for audio-visual sensory fusion of lip reading. Search on Bibsonomy ISCAS The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Abhijeet Bishnu, Ankit Gupta 0008, Mandar Gogate, Kia Dashtipour, Tughrul Arslan, Ahsan Adeel, Amir Hussain 0001, Mathini Sellathurai, Tharmalingam Ratnarajah Live Demonstration: Cloud-based Audio-Visual Speech Enhancement in Multimodal Hearing-aids. Search on Bibsonomy ISCAS The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Zheng Zhang 0043, Zheng Ning, Chenliang Xu, Yapeng Tian, Toby Jia-Jun Li PEANUT: A Human-AI Collaborative Tool for Annotating Audio-Visual Data. Search on Bibsonomy UIST The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Leena Mathur, Ralph Adolphs, Maja J. Mataric Towards Intercultural Affect Recognition: Audio-Visual Affect Recognition in the Wild Across Six Cultures. Search on Bibsonomy FG The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Ryan Buyssens [in]florescence - a tangible audio-visual installation. Search on Bibsonomy SIGGRAPH Labs The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Josef Chaloupka, Karel Palecek Audio-Visual Broadcast Transcription System in the Era of Covid-19. Search on Bibsonomy TSP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Hang Zhang, Xin Li, Lidong Bing Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding. Search on Bibsonomy EMNLP (Demos) The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Yuanyuan Jiang, Jianqin Yin Target-Aware Spatio-Temporal Reasoning via Answering Questions in Dynamic Audio-Visual Scenarios. Search on Bibsonomy EMNLP (Findings) The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Joanna Hong, Se Jin Park, Yong Man Ro Intuitive Multilingual Audio-Visual Speech Recognition with a Single-Trained Model. Search on Bibsonomy EMNLP (Findings) The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Craig Cieciura, Maxine Glancy, Philip J. B. Jackson Producing Personalised Object-Based Audio-Visual Experiences: an Ethnographic Study. Search on Bibsonomy IMX The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Gaurav Singh, Paul Ghanem, Taskin Padir Sporadic Audio-Visual Embodied Assistive Robot Navigation For Human Tracking. Search on Bibsonomy PETRA The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Haoyi Duan, Yan Xia, Mingze Zhou, Li Tang, Jieming Zhu, Zhou Zhao Cross-modal Prompts: Adapting Large Pre-trained Models for Audio-Visual Downstream Tasks. Search on Bibsonomy NeurIPS The full citation details ... 2023 DBLP  BibTeX  RDF
11Kazuki Shimada, Archontis Politis, Parthasaarathy Sudarsanam, Daniel Aleksander Krause 0001, Kengo Uchida, Sharath Adavanne, Aapo Hakala, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Tuomas Virtanen, Yuki Mitsufuji STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events. Search on Bibsonomy NeurIPS The full citation details ... 2023 DBLP  BibTeX  RDF
11Yung-Hsuan Lai, Yen-Chun Chen 0001, Frank Wang Modality-Independent Teachers Meet Weakly-Supervised Audio-Visual Event Parser. Search on Bibsonomy NeurIPS The full citation details ... 2023 DBLP  BibTeX  RDF
11Yingying Fan, Yu Wu 0011, Bo Du, Yutian Lin Revisit Weakly-Supervised Audio-Visual Video Parsing from the Language Perspective. Search on Bibsonomy NeurIPS The full citation details ... 2023 DBLP  BibTeX  RDF
11Susan Liang, Chao Huang, Yapeng Tian, Anurag Kumar, Chenliang Xu AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis. Search on Bibsonomy NeurIPS The full citation details ... 2023 DBLP  BibTeX  RDF
11Yuxin Guo, Shijie Ma, Hu Su, Zhiqing Wang, Yuhao Zhao, Wei Zou, Siyang Sun, Yun Zheng Dual Mean-Teacher: An Unbiased Semi-Supervised Framework for Audio-Visual Source Localization. Search on Bibsonomy NeurIPS The full citation details ... 2023 DBLP  BibTeX  RDF
11Shentong Mo, Bhiksha Raj Weakly-Supervised Audio-Visual Segmentation. Search on Bibsonomy NeurIPS The full citation details ... 2023 DBLP  BibTeX  RDF
11Luchcha Lam, Minsoo Choi, Magzhan Mukanova, Klay Hauser, Fangzheng Zhao, Richard E. Mayer, Christos Mousas, Nicoletta Adamo-Villani Effects of Body Type and Voice Pitch on Perceived Audio-Visual Correspondence and Believability of Virtual Characters. Search on Bibsonomy SAP The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Qichen Zheng, Jie Hong, Moshiur Farazi A Generative Approach to Audio-Visual Generalized Zero-Shot Learning: Combining Contrastive and Discriminative Techniques. Search on Bibsonomy IJCNN The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Silong Liang, Chunxiao Li, Naying Cui, Minghui Sun, Hao Xue 3DSEAVNet: 3D-Squeeze-and-Excitation Networks for Audio-Visual Saliency Prediction. Search on Bibsonomy IJCNN The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Jinqiao Dou, Xi Chen 0025, Yuehai Wang Specialty may be better: A decoupling multi-modal fusion network for Audio-visual event localization. Search on Bibsonomy IJCNN The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Gnana Praveen Rajasekhar, Jahangir Alam Audio-Visual Speaker Verification via Joint Cross-Attention. Search on Bibsonomy SPECOM (2) The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Salam Nandakishor, Debadatta Pati Improvement of Audio-Visual Keyword Spotting System Accuracy Using Excitation Source Feature. Search on Bibsonomy SPECOM (2) The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Denis Ivanko, Elena Ryumina, Dmitry Ryumin, Alexandr Axyonov, Alexey M. Kashevnik, Alexey Karpov 0001 EMO-AVSR: Two-Level Approach for Audio-Visual Emotional Speech Recognition. Search on Bibsonomy SPECOM (1) The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Yidan Fan, Yongxin Yu, Wenhuan Lu, Yahong Han A Cross-modal and Redundancy-reduced Network for Weakly-Supervised Audio-Visual Violence Detection. Search on Bibsonomy MMAsia The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Darshan Singh S, Anchit Gupta, C. V. Jawahar, Makarand Tapaswi Unsupervised Audio-Visual Lecture Segmentation. Search on Bibsonomy WACV The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Tanvir Mahmud, Diana Marculescu AVE-CLIP: AudioCLIP-based Multi-window Temporal Transformer for Audio Visual Event Localization. Search on Bibsonomy WACV The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Madhav Agarwal, Rudrabha Mukhopadhyay, Vinay P. Namboodiri, C. V. Jawahar Audio-Visual Face Reenactment. Search on Bibsonomy WACV The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Arda Senocak, Junsik Kim 0001, Tae-Hyun Oh, Dingzeyu Li, In So Kweon Event-Specific Audio-Visual Fusion Layers: A Simple and New Perspective on Video Understanding. Search on Bibsonomy WACV The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Maxime Burchi, Radu Timofte Audio-Visual Efficient Conformer for Robust Speech Recognition. Search on Bibsonomy WACV The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Jialiang Cheng, Chao Sun, Jincai Chen, Ping Lu 0006 Audio-visual mutual learning for Weakly Supervised Violence Detection. Search on Bibsonomy ICISE The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Jialing Zou, Jiahao Mei, Guangze Ye, Tianyu Huai, Qiwei Shen, Daoguo Dong EMID: An Emotional Aligned Dataset in Audio-Visual Modality. Search on Bibsonomy MCGE@MM The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Kei Suzuki, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai Audio-Visual Class Association Based on Two-stage Self-supervised Contrastive Learning towards Robust Scene Analysis. Search on Bibsonomy SII The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Yuan Gong 0001, Andrew Rouditchenko, Alexander H. Liu, David Harwath, Leonid Karlinsky, Hilde Kuehne, James R. Glass Contrastive Audio-Visual Masked Autoencoder. Search on Bibsonomy ICLR The full citation details ... 2023 DBLP  BibTeX  RDF
11Haoyue Cheng, Zhaoyang Liu, Wayne Wu, Limin Wang 0002 Filter-Recovery Network for Multi-Speaker Audio-Visual Speech Separation. Search on Bibsonomy ICLR The full citation details ... 2023 DBLP  BibTeX  RDF
11Shentong Mo, Pedro Morgado 0001 A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition. Search on Bibsonomy ICML The full citation details ... 2023 DBLP  BibTeX  RDF
11Yuchen Hu, Ruizhe Li 0001, Chen Chen 0075, Heqing Zou, Qiushi Zhu, Eng Siong Chng Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition. Search on Bibsonomy IJCAI The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Xilei Zhu, Huiyu Duan, Yuqin Cao, Yuxin Zhu, Yucheng Zhu, Jing Liu, Li Chen, Xiongkuo Min, Guangtao Zhai Perceptual Quality Assessment of Omnidirectional Audio-Visual Signals. Search on Bibsonomy CICAI (2) The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Hanyuan Wang, Majid Mirmehdi, Dima Damen, Toby Perrett Centre Stage: Centricity-based Audio-Visual Temporal Action Detection. Search on Bibsonomy BMVC Workshop The full citation details ... 2023 DBLP  BibTeX  RDF
11Yating Xu, Conghui Hu, Gim Hee Lee Motion and Context-Aware Audio-Visual Conditioned Video Prediction. Search on Bibsonomy BMVC The full citation details ... 2023 DBLP  BibTeX  RDF
11Feixiang Wang, Shuang Yang, Shiguang Shan, Xilin Chen 0001 Dual Attention for Audio-Visual Speech Enhancement with Facial Cues. Search on Bibsonomy BMVC The full citation details ... 2023 DBLP  BibTeX  RDF
11Jiarui Yu, Haoran Li, Yanbin Hao, Jinmeng Wu, Tong Xu 0001, Shuo Wang 0008, Xiangnan He 0001 How Can Contrastive Pre-training Benefit Audio-Visual Segmentation? A Study from Supervised and Zero-shot Perspectives. Search on Bibsonomy BMVC The full citation details ... 2023 DBLP  BibTeX  RDF
11Tomoya Yoshinaga, Keitaro Tanaka, Shigeo Morishima Audio-Visual Speech Enhancement with Selective Off-Screen Speech Extraction. Search on Bibsonomy EUSIPCO The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Özkan Çayli, Xubo Liu, Volkan Kiliç, Wenwu Wang 0001 Knowledge Distillation for Efficient Audio-Visual Video Captioning. Search on Bibsonomy EUSIPCO The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Kenichi Ito, Juro Hosoi, Yuki Ban, Takayuki Kikuchi, Kyosuke Nakagawa, Hanako Kitagawa, Chizuru Murakami, Yosuke Imai, Shin'ichi Warisawa Wind comfort and emotion can be changed by the cross-modal presentation of audio-visual stimuli of indoor and outdoor environments. Search on Bibsonomy VR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Dan Si, Qing Ye, Jindi Lv, Yuhao Zhou, Jiancheng Lv 0001 Violence-MFAS: Audio-Visual Violence Detection Using Multimodal Fusion Architecture Search. Search on Bibsonomy ICONIP (14) The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Yuchen Hu, Chen Chen 0075, Ruizhe Li 0001, Heqing Zou, Eng Siong Chng MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition. Search on Bibsonomy ACL (1) The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Rongjie Huang, Huadai Liu, Xize Cheng, Yi Ren 0006, Linjun Li, Zhenhui Ye, Jinzheng He, Lichao Zhang, Jinglin Liu, Xiang Yin, Zhou Zhao AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation. Search on Bibsonomy ACL (1) The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Wang Lin, Tao Jin, Wenwen Pan, Linjun Li, Xize Cheng, Ye Wang, Zhou Zhao TAVT: Towards Transferable Audio-Visual Text Generation. Search on Bibsonomy ACL (1) The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Yuchen Hu, Ruizhe Li 0001, Chen Chen 0075, Chengwei Qin, Qiu-Shi Zhu, Eng Siong Chng Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition. Search on Bibsonomy ACL (1) The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
11Juan F. Montesinos Towards efficient audio-visual source separation and synthesis Search on Bibsonomy 2023   RDF
11Shota Abe, Shuichi Sakamoto, Zhengile Cui, Yôiti Suzuki Determination of optimal levels of whole-body vibration using audio-visual information of multimodal content. Search on Bibsonomy J. Inf. Hiding Multim. Signal Process. The full citation details ... 2022 DBLP  BibTeX  RDF
11Yasar Dasdemir, Rüstem Özakar Affective states classification performance of audio-visual stimuli from EEG signals with multiple-instance learning. Search on Bibsonomy Turkish J. Electr. Eng. Comput. Sci. The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
11Maria Pawelec Deepfakes and Democracy (Theory): How Synthetic Audio-Visual Media for Disinformation and Hate Speech Threaten Core Democratic Functions. Search on Bibsonomy Digit. Soc. The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
11Jianning Wu, Zhuqing Jiang, Qingchao Chen, Shiping Wen 0001, Aidong Men, Haiying Wang 0005 Toward a perceptive pretraining framework for Audio-Visual Video Parsing. Search on Bibsonomy Inf. Sci. The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
11Hacene Terbouche, Liam Schoneveld, Oisin Benson, Alice Othmani Comparing Learning Methodologies for Self-Supervised Audio-Visual Representation Learning. Search on Bibsonomy IEEE Access The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
11Tomoya Sato, Yusuke Sugano, Yoichi Sato Self-Supervised Learning for Audio-Visual Relationships of Videos With Stereo Sounds. Search on Bibsonomy IEEE Access The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
11Pratibha Kumari 0001, Mukesh Saini An Adaptive Framework for Anomaly Detection in Time-Series Audio-Visual Data. Search on Bibsonomy IEEE Access The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
11Honghui Xu, Zhipeng Cai 0001, Daniel Takabi, Wei Li 0059 Audio-Visual Autoencoding for Privacy-Preserving Video Streaming. Search on Bibsonomy IEEE Internet Things J. The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
11Moonisa Ahsan, Fabio Marton, Ruggero Pintus, Enrico Gobbetti Audio-visual annotation graphs for guiding lens-based scene exploration. Search on Bibsonomy Comput. Graph. The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
11Saswati Debnath, Pinki Roy Audio-visual speech recognition based on machine learning approach. Search on Bibsonomy Int. J. Adv. Intell. Paradigms The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
11Jannik Zürn, Wolfram Burgard Self-Supervised Moving Vehicle Detection From Audio-Visual Cues. Search on Bibsonomy IEEE Robotics Autom. Lett. The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
11Xinyuan Qian, Qiquan Zhang, Guohui Guan, Wei Xue Deep Audio-Visual Beamforming for Speaker Localization. Search on Bibsonomy IEEE Signal Process. Lett. The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
11Gonzalo D. Sad, Lucas D. Terissi, Juan Carlos Gómez Complementary models for audio-visual speech classification. Search on Bibsonomy Int. J. Speech Technol. The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
11 Detecting adversarial attacks on audio-visual speech recognition using deep learning method. Search on Bibsonomy Int. J. Speech Technol. The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
11Aishan Liu, Huiyuan Xie, Xianglong Liu 0001, Zixin Yin, Shunchang Liu Revisiting audio visual scene-aware dialog. Search on Bibsonomy Neurocomputing The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
11Zhen Liang, Xihao Zhang, Rushuang Zhou, Li Zhang 0041, Linling Li, Gan Huang, Zhiguo Zhang 0001 Cross-individual affective detection using EEG signals with audio-visual embedding. Search on Bibsonomy Neurocomputing The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
11Xinyuan Qian, Alessio Brutti, Oswald Lanz, Maurizio Omologo, Andrea Cavallaro Audio-Visual Tracking of Concurrent Speakers. Search on Bibsonomy IEEE Trans. Multim. The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
11Aihua Zheng, Menglan Hu, Bo Jiang 0002, Yan Huang, Yan Yan 0002, Bin Luo 0001 Adversarial-Metric Learning for Audio-Visual Cross-Modal Matching. Search on Bibsonomy IEEE Trans. Multim. The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
11Efthymios Tzinis, Scott Wisdom, Tal Remez, John R. Hershey AudioScopeV2: Audio-Visual Attention Architectures for Calibrated Open-Domain On-Screen Sound Separation. Search on Bibsonomy CoRR The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
11Ziwei Ji, Yan Xu 0012, I-Tsun Cheng, Samuel Cahyawijaya, Rita Frieske, Etsuko Ishii, Min Zeng, Andrea Madotto, Pascale Fung VScript: Controllable Script Generation with Audio-Visual Presentation. Search on Bibsonomy CoRR The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
11Wenliang Dai, Samuel Cahyawijaya, Tiezheng Yu, Elham J. Barezi, Peng Xu 0008, Cheuk Tung Shadow Yiu, Rita Frieske, Holy Lovenia, Genta Indra Winata, Qifeng Chen, Xiaojuan Ma, Bertram E. Shi, Pascale Fung CI-AVSR: A Cantonese Audio-Visual Speech Dataset for In-car Command Recognition. Search on Bibsonomy CoRR The full citation details ... 2022 DBLP  BibTeX  RDF
Displaying result #901 - #1000 of 3474 (100 per page; Change: )
Pages: [<<][1][2][3][4][5][6][7][8][9][10][11][12][13][14][15][16][17][18][19][>>]
Valid XHTML 1.1! Valid CSS! [Valid RSS]
Maintained by L3S.
Previously maintained by Jörg Diederich.
Based upon DBLP by Michael Ley.
open data data released under the ODC-BY 1.0 license