The FacetedDBLP logo    Search for: in:

Disable automatic phrases ?     Syntactic query expansion: ?

Searching for phrase Audio-visual (changed automatically) with no syntactic query expansion in all metadata.

Publication years (Num. hits)
1974-1993 (15) 1994-1996 (21) 1997 (55) 1998 (29) 1999 (34) 2000 (42) 2001 (73) 2002 (68) 2003 (103) 2004 (113) 2005 (100) 2006 (101) 2007 (144) 2008 (145) 2009 (118) 2010 (84) 2011 (103) 2012 (95) 2013 (102) 2014 (101) 2015 (85) 2016 (90) 2017 (99) 2018 (139) 2019 (160) 2020 (198) 2021 (269) 2022 (292) 2023 (418) 2024 (78)
Publication types (Num. hits)
article(1159) book(1) incollection(17) inproceedings(2241) phdthesis(43) proceedings(13)
Venues (Conferences, Journals, ...)
CoRR(567) ICASSP(158) INTERSPEECH(152) AVSP(136) HAVE(79) ACM Multimedia(75) ICME(69) ICMI(57) AVEC@ACM Multimedia(45) AVEC@MM(41) CVPR(40) IEEE Trans. Multim.(39) EUSIPCO(35) MMSP(32) IEEE Access(28) AAAI(24) More (+10 of total 800)
GrowBag graphs for keyword ? (Num. hits/coverage)

Group by:
The graphs summarize 800 occurrences of 559 keywords

Results
Found 3474 publication records. Showing 3474 according to the selection in the facets
Hits ? Authors Title Venue Year Link Author keywords
28George Drettakis Audiovisual 3d rendering as a tool for multimodal interfaces. Search on Bibsonomy ICMI The full citation details ... 2008 DBLP  DOI  BibTeX  RDF computer graphics, 3d audio
28Lei Xie 0001, Zhi-Qiang Liu Realistic Mouth-Synching for Speech-Driven Talking Face Using Articulatory Modelling. Search on Bibsonomy IEEE Trans. Multim. The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
28Boris Reuderink, Mannes Poel, Khiet P. Truong, Ronald Poppe, Maja Pantic Decision-Level Fusion for Audio-Visual Laughter Detection. Search on Bibsonomy MLMI The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
28Niall A. Fox, Brian A. O'Mullane, Richard B. Reilly VALID: A New Practical Audio-Visual Database, and Comparative Results. Search on Bibsonomy AVBPA The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
28Léon J. M. Rothkrantz, Jacek C. Wojdel, Pascal Wiggers Fusing Data Streams in Continuous Audio-Visual Speech Recognition. Search on Bibsonomy TSD The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
28Guillaume Lathoud, Jean-Marc Odobez, Daniel Gatica-Perez AV16.3: An Audio-Visual Corpus for Speaker Localization and Tracking. Search on Bibsonomy MLMI The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
28Sofia Tsekeridou, Stelios Krinidis, Ioannis Pitas Scene Change Detection Based on Audio-Visual Analysis and Interaction. Search on Bibsonomy Theoretical Foundations of Computer Vision The full citation details ... 2000 DBLP  DOI  BibTeX  RDF
28Mary Mikhail, Giovanni Palumbo, Jinane Mohammad, Mohamed El-Helaly, Aishy Amer An Online System for Synchronized Processing of Video and Audio Signals. Search on Bibsonomy CCECE The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
27Tapio Lokki, Matti Gröhn Navigation with Auditory Cues in a Virtual Environment. Search on Bibsonomy IEEE Multim. The full citation details ... 2005 DBLP  DOI  BibTeX  RDF virtual environments, 3D sound, audio-visual, Auditory navigation
26Hari Krishna Maganti, Daniel Gatica-Perez Speaker localization for microphone array-based ASR: the effects of accuracy on overlapping speech. Search on Bibsonomy ICMI The full citation details ... 2006 DBLP  DOI  BibTeX  RDF audio-visual speaker tracking, microphone array ASR
26Vedad Hulusic, Kurt Debattista, Vibhor Aggarwal, Alan Chalmers Exploiting Audio-Visual Cross-Modal Interaction to Reduce Computational Requirements in Interactive Environments. Search on Bibsonomy VS-GAMES The full citation details ... 2010 DBLP  DOI  BibTeX  RDF sound effects, frame rate perception, perception, psychophysics, cross-modal interaction, audio-visual
26Walid Karam, Chafic Mokbel, Hanna Greige, Gérard Chollet Audio-Visual Identity Verification and Robustness to Imposture. Search on Bibsonomy ICB The full citation details ... 2009 DBLP  DOI  BibTeX  RDF audio-visual forgery, talking-face imposture, biometric verification robustness, Identity verification, face animation, voice conversion
26Danqi Chen 0001, Dongmei Jiang, Ilse Ravyse, Hichem Sahli Audio-Visual Emotion Recognition Based on a DBN Model with Constrained Asynchrony. Search on Bibsonomy ICIG The full citation details ... 2009 DBLP  DOI  BibTeX  RDF audio visual multi-stream, asynchronous DBN model
26Jie Luo, Barbara Caputo, Alon Zweig, Jörg-Hendrik Bach, Jörn Anemüller Object Category Detection Using Audio-Visual Cues. Search on Bibsonomy ICVS The full citation details ... 2008 DBLP  DOI  BibTeX  RDF Multimodal Recognition, Audio-visual Fusion, Object Categorization
26Girija Chetty, Michael Wagner 0004 Audio Visual Speaker Verification Based on Hybrid Fusion of Cross Modal Features. Search on Bibsonomy PReMI The full citation details ... 2007 DBLP  DOI  BibTeX  RDF speaker identity verification, liveness checking, cross modal correlations, Audio-visual
26Ziyou Xiong, Regunathan Radhakrishnan, Ajay Divakaran, Thomas S. Huang Highlights extraction from sports video based on an audio-visual marker detection framework. Search on Bibsonomy ICME The full citation details ... 2005 DBLP  DOI  BibTeX  RDF audio-visual marker, visual object detection algorithm, semantic object, audio classification algorithm, sports highlights extraction, finer-resolution highlight segment, color information, grouping phase, soccer, golf video, motion information, baseball
26Vladimir Pavlovic 0001, G. A. Berry, Thomas S. Huang Integration of Audio/Visual Information for Use in Human-Computer Intelligent Interaction. Search on Bibsonomy ICIP (1) The full citation details ... 1997 DBLP  DOI  BibTeX  RDF audio/visual information integration, human-computer intelligent interaction, human-computer communication, auditory features, automatic gesture recognition, user interfaces, virtual environments, automatic speech recognition, visual features, computer interfaces, human communication
26Alexander Haubold, Promiti Dutta, John R. Kender Evaluation of video browser features and user interaction with VAST MM. Search on Bibsonomy ACM Multimedia The full citation details ... 2008 DBLP  DOI  BibTeX  RDF presentation video, speaker index, structure in videos, text augmentation, transcript analysis, evaluation, measures, user studies, automatic speech recognition, streaming video, speaker segmentation, video library, visual segmentation
26Simone Cifani, Andrew Abel, Amir Hussain 0001, Stefano Squartini, Francesco Piazza An Investigation into Audiovisual Speech Correlation in Reverberant Noisy Environments. Search on Bibsonomy COST 2102 Conference (Prague) The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
26Shih-Fu Chang, Dan Ellis, Wei Jiang 0001, Keansub Lee, Akira Yanagawa, Alexander C. Loui, Jiebo Luo Large-scale multimodal semantic concept detection for consumer video. Search on Bibsonomy Multimedia Information Retrieval The full citation details ... 2007 DBLP  DOI  BibTeX  RDF consumer video indexing, video classification, multimedia ontology, semantic classification
26Zohar Barzelay, Yoav Y. Schechner Harmony in Motion. Search on Bibsonomy CVPR The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
26Lei Xie 0001, Helen Meng, Zhi-Qiang Liu A Cantonese Speech-Driven Talking Face Using Translingual Audio-to-Visual Conversion. Search on Bibsonomy ISCSLP The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
26Sama'a Al Hashimi, Gordon Davies Vocal telekinesis: physical control of inanimate objects with minimal paralinguistic voice input. Search on Bibsonomy ACM Multimedia The full citation details ... 2006 DBLP  DOI  BibTeX  RDF paralanguage, vocal input, vocal telekinesis, voice-physical
26Koji Iwano, Taro Miyazaki, Sadaoki Furui Multimodal Speaker Verification Using Ear Image Features Extracted by PCA and ICA. Search on Bibsonomy AVBPA The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
26Jong-Seok Lee, Touradj Ebrahimi Two-Level Bimodal Association for Audio-Visual Speech Recognition. Search on Bibsonomy ACIVS The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
26Keni Bernardin, Rainer Stiefelhagen, Alex Waibel Probabilistic integration of sparse audio-visual cues for identity tracking. Search on Bibsonomy ACM Multimedia The full citation details ... 2008 DBLP  DOI  BibTeX  RDF modality fusion, sensor fusion, smart environments, human perception
26Vasil Khalidov, Florence Forbes, Miles E. Hansard, Elise Arnaud, Radu Horaud Audio-Visual Clustering for 3D Speaker Localization. Search on Bibsonomy MLMI The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
26Jan Kratt, Florian Metze, Rainer Stiefelhagen, Alex Waibel Large Vocabulary Audio-Visual Speech Recognition Using the Janus Speech Recognition Toolkit. Search on Bibsonomy DAGM-Symposium The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
26Zeeshan Rasheed, Mubarak Shah Movie Genre Classification By Exploiting Audio-Visual Features Of Previews. Search on Bibsonomy ICPR (2) The full citation details ... 2002 DBLP  DOI  BibTeX  RDF
26Shahrokh Ghaemmaghami Audio Segmentation and Classification based on a Selective Analysis Scheme. Search on Bibsonomy MMM The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
24Einat Kidron, Yoav Y. Schechner, Michael Elad Cross-Modal Localization via Sparsity. Search on Bibsonomy IEEE Trans. Signal Process. The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
24Einat Kidron, Yoav Y. Schechner, Michael Elad Pixels that Sound. Search on Bibsonomy CVPR (1) The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
24Hiroshi G. Okuno, Kazuhiro Nakadai, Hiroaki Kitano Realizing Audio-Visually Triggered ELIZA-Like Non-verbal Behaviors. Search on Bibsonomy PRICAI The full citation details ... 2002 DBLP  DOI  BibTeX  RDF
24Serdar Yildirim, Shrikanth S. Narayanan Automatic Detection of Disfluency Boundaries in Spontaneous Speech of Children Using Audio-Visual Information. Search on Bibsonomy IEEE Trans. Speech Audio Process. The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
24Juergen Luettin, Stéphane Dupont Continuous Audio-Visual Speech Recognition. Search on Bibsonomy ECCV (2) The full citation details ... 1998 DBLP  DOI  BibTeX  RDF
24Emily Mower, Sungbok Lee, Maja J. Mataric, Shrikanth S. Narayanan Human perception of synthetic character emotions in the presence of conflicting and congruent vocal and facial expressions. Search on Bibsonomy ICASSP The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
24Niall A. Fox, Ralph Gross, Jeffrey F. Cohn, Richard B. Reilly Robust Biometric Person Identification Using Automatic Classifier Fusion of Speech, Mouth, and Face Experts. Search on Bibsonomy IEEE Trans. Multim. The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
24Zhiyong Wu 0001, Lianhong Cai, Helen M. Meng Multi-level Fusion of Audio and Visual Features for Speaker Identification. Search on Bibsonomy ICB The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
24Myung-Won Kim, Joung Woo Ryu, Eun Ju Kim Speech Recognition with Multi-modal Features Based on Neural Networks. Search on Bibsonomy ICONIP (2) The full citation details ... 2006 DBLP  DOI  BibTeX  RDF neural network, speech recognition, sequential pattern, post-processing, contextual information
24Xingquan Zhu 0001, Xindong Wu 0001, Ahmed K. Elmagarmid, Zhe Feng 0001, Lide Wu Video Data Mining: Semantic Indexing and Event Detection from the Association Perspective. Search on Bibsonomy IEEE Trans. Knowl. Data Eng. The full citation details ... 2005 DBLP  DOI  BibTeX  RDF knowledge-based systems, multimedia systems, database management, Video mining
24Gianluca Monaci, Òscar Divorra Escoda, Pierre Vandergheynst Analysis of multimodal signals using redundant representations. Search on Bibsonomy ICIP (3) The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
24Shankar T. Shivappa, Bhaskar D. Rao, Mohan M. Trivedi Multimodal information fusion using the iterative decoding algorithm and its application to audio-visual speech recognition. Search on Bibsonomy ICASSP The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
24Tero Jokela, Jaakko Lehikoinen, Hannu Korhonen Mobile multimedia presentation editor: enabling creation of audio-visual stories on mobile devices. Search on Bibsonomy CHI The full citation details ... 2008 DBLP  DOI  BibTeX  RDF user interfaces, mobile devices, interaction design, authoring, storytelling, smil, multimedia presentations, editor, content creation, mms, multimedia messages
24Andrew Abel, Amir Hussain 0001 Multi-modal Speech Processing Methods: An Overview and Future Research Directions Using a MATLAB Based Audio-Visual Toolbox. Search on Bibsonomy COST 2102 School (Vietri) The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
24Zhihong Zeng, Jilin Tu, Ming Liu 0009, Thomas S. Huang, Brian Pianfetti, Dan Roth, Stephen E. Levinson Audio-Visual Affect Recognition. Search on Bibsonomy IEEE Trans. Multim. The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
24Zhihong Zeng, Yuxiao Hu 0001, Glenn I. Roisman, Zhen Wen, Yun Fu 0001, Thomas S. Huang Audio-Visual Spontaneous Emotion Recognition. Search on Bibsonomy Artifical Intelligence for Human Computing The full citation details ... 2007 DBLP  DOI  BibTeX  RDF Affective computing, emotion recognition, Multimodal Human-Computer Interaction, affect recognition
24Adam O'Donovan, Ramani Duraiswami, Jan Neumann Microphone Arrays as Generalized Cameras for Integrated Audio Visual Processing. Search on Bibsonomy CVPR The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
24Zhihong Zeng, Yuxiao Hu 0001, Yun Fu 0001, Thomas S. Huang, Glenn I. Roisman, Zhen Wen Audio-visual emotion recognition in adult attachment interview. Search on Bibsonomy ICMI The full citation details ... 2006 DBLP  DOI  BibTeX  RDF affective computing, emotion recognition, multimodal human-computer interaction, affect recognition
24Harriet J. Nock, Giridharan Iyengar, Chalapathy Neti Speaker Localisation Using Audio-Visual Synchrony: An Empirical Study. Search on Bibsonomy CIVR The full citation details ... 2003 DBLP  DOI  BibTeX  RDF
24Milind R. Naphade, Ashutosh Garg, Thomas S. Huang Duration Dependent Input Output Markov Models For Audio-Visual Event Detection. Search on Bibsonomy ICME The full citation details ... 2001 DBLP  DOI  BibTeX  RDF
24Ashutosh Garg, Vladimir Pavlovic 0001, James M. Rehg Audio-Visual Speaker Detection Using Dynamic Bayesian Networks. Search on Bibsonomy FG The full citation details ... 2000 DBLP  DOI  BibTeX  RDF dynamic Bayesian networks, multimodal HCI, speaker detection
23Ashish Verma, L. Venkata Subramaniam, Nitendra Rajput, Chalapathy Neti, Tanveer A. Faruquie Animating expressive faces across languages. Search on Bibsonomy IEEE Trans. Multim. The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
23Surya Nepal, Uma Srinivasan 0001, Graham J. Reynolds Semantic Based Retrieval Model for Digital Audio and Video. Search on Bibsonomy ICME The full citation details ... 2001 DBLP  DOI  BibTeX  RDF
23Petar S. Aleksic, Aggelos K. Katsaggelos Speech-to-video synthesis using MPEG-4 compliant visual features. Search on Bibsonomy IEEE Trans. Circuits Syst. Video Technol. The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
22Mihai Gurban, Jean-Philippe Thiran, Thomas Drugman, Thierry Dutoit Dynamic modality weighting for multi-stream hmms inaudio-visual speech recognition. Search on Bibsonomy ICMI The full citation details ... 2008 DBLP  DOI  BibTeX  RDF multi-stream hmm, stream reliability, multimodal fusion, audio-visual speech recognition
22Annie On Ni Wan, Hiroki Nishino, Pamela Pietro Tre marie. Search on Bibsonomy ACM Multimedia The full citation details ... 2006 DBLP  DOI  BibTeX  RDF RF-ID, audio-visual improvisation, dance performance, open sound control, bluetooth
22Yoshinao Takemae, Takehiko Ohno, Ikuo Yoda, Shinji Ozawa Estimating human interruptibility in the home for remote communication. Search on Bibsonomy CHI Extended Abstracts The full citation details ... 2006 DBLP  DOI  BibTeX  RDF audio-visual tracking, online remote communication, awareness, presence, interruptibility, home
22Kongwah Wan, Xin Yan 0001, Changsheng Xu Automatic mobile sports highlights. Search on Bibsonomy ICME The full citation details ... 2005 DBLP  DOI  BibTeX  RDF a priori decision scheme, automatic mobile sports highlight, sports video highlight, live game, mobile videophone, GPRS network, audio-visual feature, circular buffer, real-time system, 3G network, real-time analysis, mobile advertising
22John E. Redford, Keith S. Ruttle, Timothy M. Dobson Video over ATM: experience from the Cambridge Interactive TV Trial. Search on Bibsonomy ICIP The full citation details ... 1995 DBLP  DOI  BibTeX  RDF cable television, Cambridge Interactive TV Trial, interactive TV industry, business ideas, MPEG audio/visual streams, CiTVIC, CTSN, asynchronous transfer mode, ATM, decoding, decoding, interactive television, service provision, telecommunication standards, digital television, interactive video, television standards, technology infrastructure
22Dimitris I. Rigas, Dave Memery Utilising Audio-Visual Stimuli in Interactive Information Systems: A Two Domain Investigation on Auditory Metaphors. Search on Bibsonomy ITCC The full citation details ... 2002 DBLP  DOI  BibTeX  RDF Interactive Information Systems, Stock Control Systems, E-Mail Tool, Auditory Design, User Interface, Multimedia, Software Design, Speech
22Robert Kaucic, Barney Dalton, Andrew Blake 0001 Real-Time Lip Tracking for Audio-Visual Speech Recognition Applications. Search on Bibsonomy ECCV (2) The full citation details ... 1996 DBLP  DOI  BibTeX  RDF
21Matteo Bregonzio, Murtaza Taj, Andrea Cavallaro Multi-Modal Particle Filtering Tracking using Appearance, Motion and Audio Likelihoods. Search on Bibsonomy ICIP (5) The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
21Kyung-Ae Cha, Kyungdeok Kim MPEG-4 Scene Description Optimization for Interactive Terrestrial DMB Content. Search on Bibsonomy ICESS The full citation details ... 2007 DBLP  DOI  BibTeX  RDF T-DMB, Scene description Optimization, MPEG-4 System, Interactive Content, BIFS
21Ming Liu 0009, Hao Tang 0001, Huazhong Ning, Thomas S. Huang Person Identification Based on Multichannel and Multimodality Fusion. Search on Bibsonomy CLEAR The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
21Seungmin Rho, SooCheol Lee, Eenjun Hwang, YangKyoo Lee XCRAB: A Content and Annotation-Based Multimedia Indexing and Retrieval System. Search on Bibsonomy ICCSA (4) The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
21Adriano de Andrade Bresolin, Diamantino Rui da Silva Freitas, Adrião Duarte Dória Neto, Pablo Javier Alsina European and American Audio-Visual Speech Recognition, Using SVM in Portuguese Language. Search on Bibsonomy DCC The full citation details ... 2008 DBLP  DOI  BibTeX  RDF Image Pattern Recognition, Neural Networks, Speech Recognition
21Zhihong Zeng, Yuxiao Hu 0001, Ming Liu 0009, Yun Fu 0001, Thomas S. Huang Training combination strategy of multi-stream fused hidden Markov model for audio-visual affect recognition. Search on Bibsonomy ACM Multimedia The full citation details ... 2006 DBLP  DOI  BibTeX  RDF affective computing, emotion recognition, multimodal human-computer interaction, affect recognition
21Iain McCowan, Maganto Hari Krishna, Daniel Gatica-Perez, Darren Moore, Sileye O. Ba Speech Acquisition in Meetings with an Audio-Visual Sensor Array. Search on Bibsonomy ICME The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
21Mike Leggett Losers and finders: indexing audio-visual digital media. Search on Bibsonomy Creativity & Cognition The full citation details ... 2005 DBLP  DOI  BibTeX  RDF interactive, taxonomy, index, digital media
21Hari Kalva, Alexandros Eleftheriadis Algorithms for multiplex scheduling of object-based audio-visual presentations. Search on Bibsonomy IEEE Trans. Circuits Syst. Video Technol. The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
21Hari Sundaram, Shih-Fu Chang Determining computable scenes in films and their structures using audio-visual memory models. Search on Bibsonomy ACM Multimedia The full citation details ... 2000 DBLP  DOI  BibTeX  RDF computable scenes, periodic analysis transform, shot-level structure, memory models, films, scene detection
21Karren Yang, Dejan Markovic, Steven Krenn, Vasu Agrawal, Alexander Richard Audio-Visual Speech Codecs: Rethinking Audio-Visual Speech Enhancement by Re-Synthesis. Search on Bibsonomy CoRR The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
21Zi-qiang Zhang, Jie Zhang 0042, Jian-Shu Zhang, Ming-Hui Wu, Xin Fang, Li-Rong Dai 0001 Learning Contextually Fused Audio-visual Representations for Audio-visual Speech Recognition. Search on Bibsonomy CoRR The full citation details ... 2022 DBLP  BibTeX  RDF
21Joanna Hong, Minsu Kim, Daehun Yoo, Yong Man Ro Visual Context-driven Audio Feature Enhancement for Robust End-to-End Audio-Visual Speech Recognition. Search on Bibsonomy CoRR The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
21Ziqiang Zhang, Jie Zhang 0042, Jian-Shu Zhang, Ming-Hui Wu, Xin Fang, Lirong Dai 0001 Learning Contextually Fused Audio-Visual Representations For Audio-Visual Speech Recognition. Search on Bibsonomy ICIP The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
21Joanna Hong, Minsu Kim, Daehun Yoo, Yong Man Ro Visual Context-driven Audio Feature Enhancement for Robust End-to-End Audio-Visual Speech Recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
21Ankit P. Shah, Shijie Geng, Peng Gao, Anoop Cherian, Takaaki Hori, Tim K. Marks, Jonathan Le Roux, Chiori Hori Audio-Visual Scene-Aware Dialog and Reasoning Using Audio-Visual Transformers with Joint Student-Teacher Learning. Search on Bibsonomy ICASSP The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
21Karren Yang, Dejan Markovic, Steven Krenn, Vasu Agrawal, Alexander Richard Audio-Visual Speech Codecs: Rethinking Audio-Visual Speech Enhancement by Re-Synthesis. Search on Bibsonomy CVPR The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
21Ankit P. Shah, Shijie Geng, Peng Gao, Anoop Cherian, Takaaki Hori, Tim K. Marks, Jonathan Le Roux, Chiori Hori Audio-Visual Scene-Aware Dialog and Reasoning using Audio-Visual Transformers with Joint Student-Teacher Learning. Search on Bibsonomy CoRR The full citation details ... 2021 DBLP  BibTeX  RDF
21Josefine Hölling, Maria Svahn, Sandra Pauletto Audio-Visual Interactive Art: Investigating the effect of gaze-controlled audio on visual attention and short term memory. Search on Bibsonomy Audio Mostly Conference The full citation details ... 2021 DBLP  DOI  BibTeX  RDF
21Lucas D. Terissi, Gonzalo D. Sad, Juan Carlos Gómez Robust front-end for audio, visual and audio-visual speech classification. Search on Bibsonomy Int. J. Speech Technol. The full citation details ... 2018 DBLP  DOI  BibTeX  RDF
21Jiyoung Lee, Sunok Kim, Seungryong Kim, Kwanghoon Sohn Audio-Visual Attention Networks for Emotion Recognition. Search on Bibsonomy AVSU@MM The full citation details ... 2018 DBLP  DOI  BibTeX  RDF
21Zafi Sherhan Syed, Kirill A. Sidorov, A. David Marshall Automated Screening for Bipolar Disorder from Audio/Visual Modalities. Search on Bibsonomy AVEC@MM The full citation details ... 2018 DBLP  DOI  BibTeX  RDF
21Luca Remaggi, Hansung Kim, Philip J. B. Jackson, Adrian Hilton 0001 An Audio-Visual Method for Room Boundary Estimation and Material Recognition. Search on Bibsonomy AVSU@MM The full citation details ... 2018 DBLP  DOI  BibTeX  RDF
21Rongfeng Su, Lan Wang, Xunying Liu Multimodal learning using 3D audio-visual data for audio-visual speech recognition. Search on Bibsonomy IALP The full citation details ... 2017 DBLP  DOI  BibTeX  RDF
21Petros Koutras, Athanasia Zlatintsi, Elias Iosif, Athanasios Katsamanis, Petros Maragos, Alexandros Potamianos Predicting audio-visual salient events based on visual, audio and text modalities for movie summarization. Search on Bibsonomy ICIP The full citation details ... 2015 DBLP  DOI  BibTeX  RDF
21Humberto Pérez Espinosa, Hugo Jair Escalante, Luis Villaseñor Pineda, Manuel Montes-y-Gómez, David Pinto Avendaño, Verónica Reyes-Meza Fusing Affective Dimensions and Audio-Visual Features from Segmented Video for Depression Recognition: INAOE-BUAP's Participation at AVEC'14 Challenge. Search on Bibsonomy AVEC@MM The full citation details ... 2014 DBLP  DOI  BibTeX  RDF
21Ahmed Hussen Abdelaziz, Steffen Zeiler, Dorothea Kolossa Using twin-HMM-based audio-visual speech enhancement as a front-end for robust audio-visual speech recognition. Search on Bibsonomy INTERSPEECH The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
21Peng Shen, Satoshi Tamura, Satoru Hayamizu Audio-visual interaction in sparse representation features for noise robust audio-visual speech recognition. Search on Bibsonomy AVSP The full citation details ... 2013 DBLP  BibTeX  RDF
21Michel F. Valstar, Björn W. Schuller, Kirsty Smith, Florian Eyben, Bihan Jiang, Sanjay Bilakhia, Sebastian Schnieder, Roddy Cowie, Maja Pantic AVEC 2013: the continuous audio/visual emotion and depression recognition challenge. Search on Bibsonomy AVEC@ACM Multimedia The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
21Marc Rébillat, Xavier Boutillon, Etienne Corteel, Brian F. G. Katz Audio, visual, and audio-visual egocentric distance perception by moving subjects in virtual environments. Search on Bibsonomy ACM Trans. Appl. Percept. The full citation details ... 2012 DBLP  DOI  BibTeX  RDF
21Natalie Fecher The 'Audio-Visual Face Cover Corpus': Investigations into audio-visual speech and speaker recognition when the speaker's face is occluded by facewear. Search on Bibsonomy INTERSPEECH The full citation details ... 2012 DBLP  DOI  BibTeX  RDF
21Wei Jiang 0001, Alexander C. Loui Audio-visual grouplet: temporal audio-visual interactions for general video concept classification. Search on Bibsonomy ACM Multimedia The full citation details ... 2011 DBLP  DOI  BibTeX  RDF
21Satoshi Tamura, Masato Ishikawa, Takashi Hashiba, Shin'ichi Takeuchi, Satoru Hayamizu A robust audio-visual speech recognition using audio-visual voice activity detection. Search on Bibsonomy INTERSPEECH The full citation details ... 2010 DBLP  DOI  BibTeX  RDF
21Yuki Denda, Takanobu Nishiura, Yoichi Yamashita Omnidirectional Audio-Visual Talker Localization Based on Dynamic Fusion of Audio-Visual Features Using Validity and Reliability Criteria. Search on Bibsonomy IEICE Trans. Inf. Syst. The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
21King-Shy Goh, Koji Miyahara, Regunathan Radhakrishnan, Ziyou Xiong, Ajay Divakaran Audio-visual event detection based on mining of semantic audio-visual labels. Search on Bibsonomy Storage and Retrieval Methods and Applications for Multimedia The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
21Gerasimos Potamianos, Chalapathy Neti, Sabine Deligne Joint audio-visual speech processing for recognition and enhancement. Search on Bibsonomy AVSP The full citation details ... 2003 DBLP  BibTeX  RDF
21Martin Heckmann, Frédéric Berthommier, Christophe Savariaux, Kristian Kroschel Effects of image distortions on audio-visual speech recognition. Search on Bibsonomy AVSP The full citation details ... 2003 DBLP  BibTeX  RDF
21Jing Huang 0019, Gerasimos Potamianos, Chalapathy Neti Improving audio-visual speech recognition with an infrared headset. Search on Bibsonomy AVSP The full citation details ... 2003 DBLP  BibTeX  RDF
21Tomoaki Yoshinaga, Satoshi Tamura, Koji Iwano, Sadaoki Furui Audio-visual speech recognition using lip movement extracted from side-face images. Search on Bibsonomy AVSP The full citation details ... 2003 DBLP  BibTeX  RDF
Displaying result #101 - #200 of 3474 (100 per page; Change: )
Pages: [<<][1][2][3][4][5][6][7][8][9][10][11][>>]
Valid XHTML 1.1! Valid CSS! [Valid RSS]
Maintained by L3S.
Previously maintained by Jörg Diederich.
Based upon DBLP by Michael Ley.
open data data released under the ODC-BY 1.0 license