|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
The graphs summarize 800 occurrences of 559 keywords
|
|
|
Results
Found 3474 publication records. Showing 3474 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
155 | Hector Ouilhet |
Google Sky Map: using your phone as an interface. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Mobile HCI ![In: Proceedings of the 12th Conference on Human-Computer Interaction with Mobile Devices and Services, Mobile HCI 2010, Lisbon, Portugal, September 7-10, 2010, pp. 419-422, 2010, ACM, 978-1-60558-835-3. The full citation details ...](Pics/full.jpeg) |
2010 |
DBLP DOI BibTeX RDF |
|
92 | Wei Jiang 0001, Courtenay V. Cotton, Shih-Fu Chang, Dan Ellis, Alexander C. Loui |
Short-term audio-visual atoms for generic video concept classification. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ACM Multimedia ![In: Proceedings of the 17th International Conference on Multimedia 2009, Vancouver, British Columbia, Canada, October 19-24, 2009, pp. 5-14, 2009, ACM, 978-1-60558-608-3. The full citation details ...](Pics/full.jpeg) |
2009 |
DBLP DOI BibTeX RDF |
audio-visual codebook, joint audio-visual analysis, short-term audio-visual atom, semantic concept detection |
78 | Niall A. Fox, Brian A. O'Mullane, Richard B. Reilly |
Audio-Visual Speaker Identification via Adaptive Fusion Using Reliability Estimates of Both Modalities. ![Search on Bibsonomy](Pics/bibsonomy.png) |
AVBPA ![In: Audio- and Video-Based Biometric Person Authentication, 5th International Conference, AVBPA 2005, Hilton Rye Town, NY, USA, July 20-22, 2005, Proceedings, pp. 787-796, 2005, Springer, 3-540-27887-7. The full citation details ...](Pics/full.jpeg) |
2005 |
DBLP DOI BibTeX RDF |
|
68 | Mustafa Nazmi Kaynak, Qi Zhi, Adrian David Cheok, Kuntal Sengupta, Jian Zhang, Chi Chung Ko |
Analysis of lip geometric features for audio-visual speech recognition. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Trans. Syst. Man Cybern. Part A ![In: IEEE Trans. Syst. Man Cybern. Part A 34(4), pp. 564-570, 2004. The full citation details ...](Pics/full.jpeg) |
2004 |
DBLP DOI BibTeX RDF |
|
64 | Timothy J. Hazen, Kate Saenko, Chia-Hao La, James R. Glass |
A segment-based audio-visual speech recognizer: data collection, development, and initial experiments. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICMI ![In: Proceedings of the 6th International Conference on Multimodal Interfaces, ICMI 2004, State College, PA, USA, October 13-15, 2004, pp. 235-242, 2004, ACM, 1-58113-995-0. The full citation details ...](Pics/full.jpeg) |
2004 |
DBLP DOI BibTeX RDF |
audio-visual corpora, audio-visual speech recognition |
63 | David Grelaud, Nicolas Bonneel, Michael Wimmer 0001, Manuel Asselot, George Drettakis |
Efficient and practical audio-visual rendering for games using crossmodal perception. ![Search on Bibsonomy](Pics/bibsonomy.png) |
SI3D ![In: Proceedings of the 2009 Symposium on Interactive 3D Graphics, SI3D 2009, February 27 - March 1, 2009, Boston, Massachusetts, USA, pp. 177-182, 2009, ACM, 978-1-60558-429-4. The full citation details ...](Pics/full.jpeg) |
2009 |
DBLP DOI BibTeX RDF |
audio-visual rendering, crossmodal perception |
60 | Toshiko Isei-Jaakkola |
Cognition and Physio-acoustic Correlates - Audio and Audio-visual Effects of a Short English Emotional Statement: On JL2, FL2 and EL1. ![Search on Bibsonomy](Pics/bibsonomy.png) |
FinTAL ![In: Advances in Natural Language Processing, 5th International Conference on NLP, FinTAL 2006, Turku, Finland, August 23-25, 2006, Proceedings, pp. 161-173, 2006, Springer, 3-540-37334-9. The full citation details ...](Pics/full.jpeg) |
2006 |
DBLP DOI BibTeX RDF |
|
58 | Giulia Garau, Sileye O. Ba, Hervé Bourlard, Jean-Marc Odobez |
Investigating the use of visual focus of attention for audio-visual speaker diarisation. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ACM Multimedia ![In: Proceedings of the 17th International Conference on Multimedia 2009, Vancouver, British Columbia, Canada, October 19-24, 2009, pp. 681-684, 2009, ACM, 978-1-60558-608-3. The full citation details ...](Pics/full.jpeg) |
2009 |
DBLP DOI BibTeX RDF |
audio-visual speaker diarisation, visual focus of attention |
58 | Niall A. Fox, Richard B. Reilly |
Audio-Visual Speaker Identification Based on the Use of Dynamic Audio and Visual Features. ![Search on Bibsonomy](Pics/bibsonomy.png) |
AVBPA ![In: Audio-and Video-Based Biometrie Person Authentication, 4th International Conference, AVBPA 2003, Guildford, UK, June 9-11, 2003 Proceedings, pp. 743-751, 2003, Springer, 3-540-40302-7. The full citation details ...](Pics/full.jpeg) |
2003 |
DBLP DOI BibTeX RDF |
|
57 | Jong-Seok Lee, Cheol Hoon Park |
Temporal filtering of visual speech for audio-visual speech recognition in acoustically and visually challenging environments. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICMI ![In: Proceedings of the 9th International Conference on Multimodal Interfaces, ICMI 2007, Nagoya, Aichi, Japan, November 12-15, 2007, pp. 220-227, 2007, ACM, 978-1-59593-817-6. The full citation details ...](Pics/full.jpeg) |
2007 |
DBLP DOI BibTeX RDF |
late integration, neural network, feature extraction, hidden Markov model, noise-robustness, audio-visual speech recognition, temporal filtering |
56 | Marco Cristani, Manuele Bicego, Vittorio Murino |
Audio-Visual Event Recognition in Surveillance Video Sequences. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Trans. Multim. ![In: IEEE Trans. Multim. 9(2), pp. 257-267, 2007. The full citation details ...](Pics/full.jpeg) |
2007 |
DBLP DOI BibTeX RDF |
|
52 | Vasil Khalidov, Florence Forbes, Miles E. Hansard, Elise Arnaud, Radu Horaud |
Detection and localization of 3d audio-visual objects using unsupervised clustering. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICMI ![In: Proceedings of the 10th International Conference on Multimodal Interfaces, ICMI 2008, Chania, Crete, Greece, October 20-22, 2008, pp. 217-224, 2008, ACM, 978-1-60558-198-9. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP DOI BibTeX RDF |
audio-visual clustering, binaural hearing, mixture models, stereo vision |
52 | Jörn Anemüller, Jörg-Hendrik Bach, Barbara Caputo, Michal Havlena, Jie Luo, Hendrik Kayser, Bastian Leibe, Petr Motlícek, Tomás Pajdla, Misha Pavel, Akihiko Torii, Luc Van Gool, Alon Zweig, Hynek Hermansky |
The DIRAC AWEAR audio-visual platform for detection of unexpected and incongruent events. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICMI ![In: Proceedings of the 10th International Conference on Multimodal Interfaces, ICMI 2008, Chania, Crete, Greece, October 20-22, 2008, pp. 289-292, 2008, ACM, 978-1-60558-198-9. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP DOI BibTeX RDF |
sensor platform, multimodal interaction, event detection, augmented cognition, audio-visual |
52 | C. Mario Christoudias, Kate Saenko, Louis-Philippe Morency, Trevor Darrell |
Co-Adaptation of audio-visual speech and gesture classifiers. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICMI ![In: Proceedings of the 8th International Conference on Multimodal Interfaces, ICMI 2006, Banff, Alberta, Canada, November 2-4, 2006, pp. 84-91, 2006, ACM, 1-59593-541-X. The full citation details ...](Pics/full.jpeg) |
2006 |
DBLP DOI BibTeX RDF |
audio-visual speech and gesture, adaptation, semi-supervised learning, human-computer interfaces, co-training |
52 | Benjamin Senechal, Denis Pellerin, Laurent Besacier, Isabelle Simand, Stéphane Bres |
Audio, video and audio-visual signatures for short video clip detection: experiments on Trecvid2003. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICME ![In: Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, ICME 2005, July 6-9, 2005, Amsterdam, The Netherlands, pp. 221-224, 2005, IEEE Computer Society, 0-7803-9331-7. The full citation details ...](Pics/full.jpeg) |
2005 |
DBLP DOI BibTeX RDF |
audio-visual signature, video clip detection, Trecvid2003, spatio-temporal signature, gray level centroid, video database |
52 | Yu Cao 0002, Sung Baang, Shih-Hsi Liu, Ming Li 0007, Sanqing Hu |
Audio-visual event classification via spatial-temporal-audio words. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICPR ![In: 19th International Conference on Pattern Recognition (ICPR 2008), December 8-11, 2008, Tampa, Florida, USA, pp. 1-5, 2008, IEEE Computer Society, 978-1-4244-2175-6. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP DOI BibTeX RDF |
|
52 | Jing Huang 0019, Etienne Marcheret, Karthik Visweswariah |
Rapid Feature Space Speaker Adaptation for Multi-Stream HMM-Based Audio-Visual Speech Recognition. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICME ![In: Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, ICME 2005, July 6-9, 2005, Amsterdam, The Netherlands, pp. 338-341, 2005, IEEE Computer Society, 0-7803-9331-7. The full citation details ...](Pics/full.jpeg) |
2005 |
DBLP DOI BibTeX RDF |
|
50 | Sofia Tsekeridou, Ioannis Pitas |
Audio-Visual Content Analysis for Content-Based Video Indexing. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICMCS, Vol. 1 ![In: IEEE International Conference on Multimedia Computing and Systems, ICMCS 1999, Florence, Italy, June 7-11, 1999. Volume I, pp. 667-672, 1999, IEEE Computer Society, 0-7695-0253-9. The full citation details ...](Pics/full.jpeg) |
1999 |
DBLP DOI BibTeX RDF |
audio-visual content analysis, multimodal interaction, content-based indexing/retrieval |
50 | Marco Cristani, Manuele Bicego, Vittorio Murino |
Audio-Video Integration for Background Modelling. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ECCV (2) ![In: Computer Vision - ECCV 2004, 8th European Conference on Computer Vision, Prague, Czech Republic, May 11-14, 2004. Proceedings, Part II, pp. 202-213, 2004, Springer, 3-540-21983-8. The full citation details ...](Pics/full.jpeg) |
2004 |
DBLP DOI BibTeX RDF |
|
50 | Ye Wang 0007, Bingjun Zhang, Olaf Schleusing |
Educational violin transcription by fusing multimedia streams. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ACM Multimedia EMME Workshop ![In: Proceedings of the International Workshop on Educational Multimedia and Multimedia Education 2007, Augsburg, Bavaria, Germany, September 28, 2007, pp. 57-66, 2007, ACM, 978-1-59593-783-4. The full citation details ...](Pics/full.jpeg) |
2007 |
DBLP DOI BibTeX RDF |
audio-visual fusion, computer-assisted tutoring, detection function, note segmentation, music transcription, onset detection |
50 | Ming Liu 0009, Yun Fu 0001, Thomas S. Huang |
An audio-visual fusion framework with joint dimensionality reducton. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICASSP ![In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2008, March 30 - April 4, 2008, Caesars Palace, Las Vegas, Nevada, USA, pp. 4437-4440, 2008, IEEE, 1-4244-1484-9. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP DOI BibTeX RDF |
|
49 | Louis H. Terry, Derek J. Shiell, Aggelos K. Katsaggelos |
Feature space video stream consistency estimation for dynamic stream weighting in audio-visual speech recognition. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICIP ![In: Proceedings of the International Conference on Image Processing, ICIP 2008, October 12-15, 2008, San Diego, California, USA, pp. 1316-1319, 2008, IEEE, 978-1-4244-1765-0. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP DOI BibTeX RDF |
|
49 | Pengyu Hong, Zhen Wen, Thomas S. Huang, Heung-Yeung Shum |
Real-Time Speech-Driven 3D Face Animation. ![Search on Bibsonomy](Pics/bibsonomy.png) |
3DPVT ![In: 1st International Symposium on 3D Data Processing Visualization and Transmission (3DPVT 2002), 19-21 June 2002, Padova, Italy, pp. 713-716, 2002, IEEE Computer Society, 0-7695-1521-5. The full citation details ...](Pics/full.jpeg) |
2002 |
DBLP DOI BibTeX RDF |
|
48 | Tsuhan Chen, Hans Peter Graf, Barry G. Haskell, Eric Petajan, Yao Wang 0001, Homer H. Chen, Wu Chou |
Speech-assisted lip synchronization in audio-visual communications. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICIP ![In: Proceedings 1995 International Conference on Image Processing, Washington, DC, USA, October 23-26, 1995, pp. 579-582, 1995, IEEE Computer Society, 0-8186-7310-9. The full citation details ...](Pics/full.jpeg) |
1995 |
DBLP DOI BibTeX RDF |
audio-visual systems, audio-visual communications, speech-assisted lip synchronization, speech information, speech-assisted frame-rate conversion, talking head video coding, image processing, image processing, video coding, synchronisation, videoconferencing, teleconferencing, speech processing, speech analysis, videotelephony, video telephony |
47 | Tanveer A. Faruquie, Abhik Majumdar, Nitendra Rajput, L. Venkata Subramaniam |
Large Vocabulary Audio-Visual Speech Recognition Using Active Shape Models. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICPR ![In: 15th International Conference on Pattern Recognition, ICPR'00, Barcelona, Spain, September 3-8, 2000., pp. 3110-3113, 2000, IEEE Computer Society, 0-7695-0750-6. The full citation details ...](Pics/full.jpeg) |
2000 |
DBLP DOI BibTeX RDF |
|
46 | Timothy J. Hazen |
Visual model structures and synchrony constraints for audio-visual speech recognition. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Trans. Speech Audio Process. ![In: IEEE Trans. Speech Audio Process. 14(3), pp. 1082-1089, 2006. The full citation details ...](Pics/full.jpeg) |
2006 |
DBLP DOI BibTeX RDF |
|
45 | Girija Chetty, Michael Wagner 0004 |
A Robust Speaking Face Modelling Approach Based on Multilevel Fusion. ![Search on Bibsonomy](Pics/bibsonomy.png) |
DICTA ![In: Proceedings of the International Conference on Digital Image Computing: Techniques and Applications, DICTA 2007, 3-5 December 2007, Adelaide, Australia, pp. 408-415, 2007, IEEE Computer Society, 0-7695-3067-2. The full citation details ...](Pics/full.jpeg) |
2007 |
DBLP DOI BibTeX RDF |
|
44 | Niall A. Fox, Richard B. Reilly |
Robust multi-modal person identification with tolerance of facial expression. ![Search on Bibsonomy](Pics/bibsonomy.png) |
SMC (1) ![In: Proceedings of the IEEE International Conference on Systems, Man & Cybernetics: The Hague, Netherlands, 10-13 October 2004, pp. 580-585, 2004, IEEE, 0-7803-8566-7. The full citation details ...](Pics/full.jpeg) |
2004 |
DBLP DOI BibTeX RDF |
|
41 | Lei Xie 0001, Zhi-Qiang Liu |
Speech Animation Using Coupled Hidden Markov Models. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICPR (1) ![In: 18th International Conference on Pattern Recognition (ICPR 2006), 20-24 August 2006, Hong Kong, China, pp. 1128-1131, 2006, IEEE Computer Society, 0-7695-2521-0. The full citation details ...](Pics/full.jpeg) |
2006 |
DBLP DOI BibTeX RDF |
|
41 | Ara V. Nefian, Luhong Liang, Tieyan Fu, Xiaoxing Liu |
A Bayesian Approach to Audio-Visual Speaker Identification. ![Search on Bibsonomy](Pics/bibsonomy.png) |
AVBPA ![In: Audio-and Video-Based Biometrie Person Authentication, 4th International Conference, AVBPA 2003, Guildford, UK, June 9-11, 2003 Proceedings, pp. 761-769, 2003, Springer, 3-540-40302-7. The full citation details ...](Pics/full.jpeg) |
2003 |
DBLP DOI BibTeX RDF |
|
40 | Lei Chen 0002, Sule Gündüz, M. Tamer Özsu |
Mixed Type Audio Classification with Support Vector Machine. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICME ![In: Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, ICME 2006, July 9-12 2006, Toronto, Ontario, Canada, pp. 781-784, 2006, IEEE Computer Society, 1-4244-0367-7. The full citation details ...](Pics/full.jpeg) |
2006 |
DBLP DOI BibTeX RDF |
|
39 | Petar S. Aleksic, Aggelos K. Katsaggelos |
Comparison of MPEG-4 facial animation parameter groups with respect to audio-visual speech recognition performance. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICIP (3) ![In: Proceedings of the 2005 International Conference on Image Processing, ICIP 2005, Genoa, Italy, September 11-14, 2005, pp. 501-504, 2005, IEEE, 0-7803-9134-9. The full citation details ...](Pics/full.jpeg) |
2005 |
DBLP DOI BibTeX RDF |
|
39 | Amitava Das |
Audio Visual Person Authentication by Multiple Nearest Neighbor Classifiers. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICB ![In: Advances in Biometrics, International Conference, ICB 2007, Seoul, Korea, August 27-29, 2007, Proceedings, pp. 1114-1123, 2007, Springer, 978-3-540-74548-8. The full citation details ...](Pics/full.jpeg) |
2007 |
DBLP DOI BibTeX RDF |
audio-visual biometric authentication, feature extraction, face recognition, Multimodal, fusion, Speaker recognition, multiple classifiers, VQ |
39 | Indrani Medhi, Archana Prasad, Kentaro Toyama |
Optimal audio-visual representations for illiterate users of computers. ![Search on Bibsonomy](Pics/bibsonomy.png) |
WWW ![In: Proceedings of the 16th International Conference on World Wide Web, WWW 2007, Banff, Alberta, Canada, May 8-12, 2007, pp. 873-882, 2007, ACM, 978-1-59593-654-7. The full citation details ...](Pics/full.jpeg) |
2007 |
DBLP DOI BibTeX RDF |
audio-visual icons, illiterate users, text-free user interfaces |
39 | Satoshi Nakamura 0001, Eli Yamamoto |
Speech-to-Lip Movement Synthesis by Maximizing Audio-Visual Joint Probability Based on the EM Algorithm. ![Search on Bibsonomy](Pics/bibsonomy.png) |
J. VLSI Signal Process. ![In: J. VLSI Signal Process. 27(1-2), pp. 119-126, 2001. The full citation details ...](Pics/full.jpeg) |
2001 |
DBLP DOI BibTeX RDF |
lip movement synthesis, audio-visual joint probability, speech recognition, EM algorithm |
39 | Wee Sun Lee, Michael R. Frater, Mark R. Pickering, John F. Arnold |
Robustness of multiplexing protocols for audio-visual services over wireless networks. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICIP (3) ![In: Proceedings 1997 International Conference on Image Processing, ICIP '97, Santa Barbara, California, USA, October 26-29, 1997, pp. 563-566, 1997, IEEE Computer Society, 0-8186-8183-7. The full citation details ...](Pics/full.jpeg) |
1997 |
DBLP DOI BibTeX RDF |
multiplexing protocols, audio-visual services, MPEG 2 systems, ITU-T H.223, high bit-error rates, packet-oriented multiplexing, packet header, forward error correcting codes, header information, FEC protection, quality of service, wireless networks, packet radio networks, wireless channels |
39 | Mike A. Oren, Chris Harding, Terri L. Bonebright |
Evaluation of spatial abilities within a 2D auditory platform game. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ASSETS ![In: Proceedings of the 10th International ACM SIGACCESS Conference on Computers and Accessibility, ASSETS 2008, Halifax, Nova Scotia, Canada, October 13-15, 2008, pp. 235-236, 2008, ACM, 978-1-59593-976-0. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP DOI BibTeX RDF |
visual impairments, video games, spatial ability, audio games |
39 | Alexánder Ceballos, Juan-Bernardo Gómez, Flavio Prieto, Tanneguy Redarce |
Robot Command Interface Using an Audio-Visual Speech Recognition System. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CIARP ![In: Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications, 14th Iberoamerican Conference on Pattern Recognition, CIARP 2009, Guadalajara, Jalisco, Mexico, November 15-18, 2009. Proceedings, pp. 869-876, 2009, Springer, 978-3-642-10267-7. The full citation details ...](Pics/full.jpeg) |
2009 |
DBLP DOI BibTeX RDF |
Speech recognition, MPEG-4, manipulator |
39 | Tomoaki Koiwa, Kazuhiro Nakadai, Jun-ichi Imura |
Coarse speech recognition by audio-visual integration based on missing feature theory. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IROS ![In: 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, October 29 - November 2, 2007, Sheraton Hotel and Marina, San Diego, California, USA, pp. 1751-1756, 2007, IEEE, 978-1-4244-0912-9. The full citation details ...](Pics/full.jpeg) |
2007 |
DBLP DOI BibTeX RDF |
|
39 | Atulya Velivelli, Chong-Wah Ngo, Thomas S. Huang |
Detection of Documentary Scene Changes by Audio-Visual Fusion. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CIVR ![In: Image and Video Retrieval, Second International Conference, CIVR 2003, Urbana-Champaign, IL, USA, July 24-25, 2003, Proceedings, pp. 227-237, 2003, Springer, 3-540-40634-4. The full citation details ...](Pics/full.jpeg) |
2003 |
DBLP DOI BibTeX RDF |
|
39 | Hari Sundaram, Lexing Xie, Shih-Fu Chang |
A utility framework for the automatic generation of audio-visual skims. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ACM Multimedia ![In: Proceedings of the 10th ACM International Conference on Multimedia 2002, Juan les Pins, France, December 1-6, 2002., pp. 189-198, 2002, ACM, 1-58113-620-X. The full citation details ...](Pics/full.jpeg) |
2002 |
DBLP DOI BibTeX RDF |
|
38 | Emily Mower, Maja J. Mataric, Shrikanth S. Narayanan |
Selection of Emotionally Salient Audio-Visual Features for Modeling Human Evaluations of Synthetic Character Emotion Displays. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ISM ![In: Tenth IEEE International Symposium on Multimedia (ISM2008), December 15-17, 2008, Berkeley, California, USA, pp. 190-195, 2008, IEEE Computer Society, 978-0-7695-3454-1. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP DOI BibTeX RDF |
|
38 | Ilse Ravyse, Dongmei Jiang, Xiaoyue Jiang, Guoyun Lv, Yunshu Hou, Hichem Sahli, Rongchun Zhao |
DBN Based Models for Audio-Visual Speech Analysis and Recognition. ![Search on Bibsonomy](Pics/bibsonomy.png) |
PCM ![In: Advances in Multimedia Information Processing - PCM 2006, 7th Pacific Rim Conference on Multimedia, Hangzhou, China, November 2-4, 2006, Proceedings, pp. 19-30, 2006, Springer, 3-540-48766-2. The full citation details ...](Pics/full.jpeg) |
2006 |
DBLP DOI BibTeX RDF |
|
38 | Marc A. Al-Hames, Thomas Hain, Jan Cernocký, Sascha Schreiber, Mannes Poel, Ronald Müller, Sébastien Marcel, David A. van Leeuwen, Jean-Marc Odobez, Sileye O. Ba, Hervé Bourlard, Fabien Cardinaux, Daniel Gatica-Perez, Adam Janin, Petr Motlícek, Stephan Reiter, Steve Renals, Jeroen van Rest, Rutger Rienks, Gerhard Rigoll, Kevin Smith 0001, Andrew H. C. Thean, Pavel Zemcík |
Audio-Visual Processing in Meetings: Seven Questions and Current AMI Answers. ![Search on Bibsonomy](Pics/bibsonomy.png) |
MLMI ![In: Machine Learning for Multimodal Interaction, Third International Workshop, MLMI 2006, Bethesda, MD, USA, May 1-4, 2006, Revised Selected Papers, pp. 24-35, 2006, Springer, 3-540-69267-3. The full citation details ...](Pics/full.jpeg) |
2006 |
DBLP DOI BibTeX RDF |
|
38 | Renaud Séguier, David Mercier |
Audio-Visual Speech Recognition One Pass Learning with Spiking Neurons. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICANN ![In: Artificial Neural Networks - ICANN 2002, International Conference, Madrid, Spain, August 28-30, 2002, Proceedings, pp. 1207-1212, 2002, Springer, 3-540-44074-7. The full citation details ...](Pics/full.jpeg) |
2002 |
DBLP DOI BibTeX RDF |
|
38 | Ferda Ofli, Yasemin Demir, Engin Erzin, Yücel Yemez, A. Murat Tekalp |
Multicamera Audio-Visual Analysis of Dance Figures. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICME ![In: Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, ICME 2007, July 2-5, 2007, Beijing, China, pp. 1703-1706, 2007, IEEE Computer Society, 1-4244-1017-7. The full citation details ...](Pics/full.jpeg) |
2007 |
DBLP DOI BibTeX RDF |
|
38 | Ziyou Xiong |
Audio-visual sports highlights extraction using Coupled Hidden Markov Models. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Pattern Anal. Appl. ![In: Pattern Anal. Appl. 8(1-2), pp. 62-71, 2005. The full citation details ...](Pics/full.jpeg) |
2005 |
DBLP DOI BibTeX RDF |
|
38 | Thierry Guiard-Marigny, Nicolas Tsingos, Ali Adjoudani, Christian Benoît, Marie-Paule Gascuel |
3D Models Of The Lips For Realistic Speech Animation. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CA ![In: Computer Animation 1996, CA 1996, Geneva, Switzerland, June 3-4, 1996, pp. 80-89, 1996, IEEE Computer Society, 0-8186-7588-8. The full citation details ...](Pics/full.jpeg) |
1996 |
DBLP DOI BibTeX RDF |
audio-visual systems, realistic images, lip modelling, realistic speech animation, audio visual articulatory speech synthesizer, border contours, vermilion zone, lip shape, speaker dependent conformations, natural lips, French speaker, reference lip shapes, lip contact, natural languages, computer animation, 3D models, implicit surfaces, speech synthesis, volumetric model, continuous functions, human face, geometrical analysis, algebraic equations |
37 | Lucas D. Terissi, Juan Carlos Gómez |
Audio-to-Visual Conversion Via HMM Inversion for Speech-Driven Facial Animation. ![Search on Bibsonomy](Pics/bibsonomy.png) |
SBIA ![In: Advances in Artificial Intelligence - SBIA 2008, 19th Brazilian Symposium on Artificial Intelligence, Savador, Brazil, October 26-30, 2008. Proceedings, pp. 33-42, 2008, Springer, 978-3-540-88189-6. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP DOI BibTeX RDF |
Audio-Visual Speech Processing, Hidden Markov Models, Facial Animation |
37 | Jeho Nam, A. Enis Çetin, Ahmed H. Tewfik |
Speaker Identification and Video Analysis for Hierarchical Video Shot Classification . ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICIP (2) ![In: Proceedings 1997 International Conference on Image Processing, ICIP '97, Santa Barbara, California, USA, October 26-29, 1997, pp. 550-553, 1997, IEEE Computer Society, 0-8186-8183-7. The full citation details ...](Pics/full.jpeg) |
1997 |
DBLP DOI BibTeX RDF |
hierarchical video shot classification, visual data tracks, audio data tracks, visual stream analysis, voiced phonemes tracking, speech data, audio-visual objects indexing, video shots matching, browsing, video analysis, content-based retrieval, multimedia databases, video databases, speaker recognition, speaker identification, clustering technique, content-based indexing, video data, audio signal, 3D wavelet transform, speaker changes detection |
37 | Rebecca Lunsford, Sharon L. Oviatt |
Human perception of intended addressee during computer-assisted meetings. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICMI ![In: Proceedings of the 8th International Conference on Multimodal Interfaces, ICMI 2006, Banff, Alberta, Canada, November 2-4, 2006, pp. 20-27, 2006, ACM, 1-59593-541-X. The full citation details ...](Pics/full.jpeg) |
2006 |
DBLP DOI BibTeX RDF |
acoustic-prosodic cues, dialogue style, human-computer teamwork, intended addressee, open-microphone engagement, gaze, multiparty interaction |
37 | Hendrik Knoche, Hermann de Meer, David Kirsh |
Compensating for low frame rates. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CHI Extended Abstracts ![In: Extended Abstracts Proceedings of the 2005 Conference on Human Factors in Computing Systems, CHI 2005, Portland, Oregon, USA, April 2-7, 2005, pp. 1553-1556, 2005, ACM, 1-59593-002-7. The full citation details ...](Pics/full.jpeg) |
2005 |
DBLP DOI BibTeX RDF |
audio-visual integration, skew, frame rates, speech perception |
37 | Aristodemos Pnevmatikakis, John Soldatos 0001, Fotios Talantzis, Lazaros Polymenakos |
Robust multimodal audio-visual processing for advanced context awareness in smart spaces. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Pers. Ubiquitous Comput. ![In: Pers. Ubiquitous Comput. 13(1), pp. 3-14, 2009. The full citation details ...](Pics/full.jpeg) |
2009 |
DBLP DOI BibTeX RDF |
|
37 | Le Xin, Jianhua Tao 0001, Tieniu Tan |
Dynamic Audio-Visual Mapping using Fused Hidden Markov Model Inversion Method. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICIP (3) ![In: Proceedings of the International Conference on Image Processing, ICIP 2007, September 16-19, 2007, San Antonio, Texas, USA, pp. 293-296, 2007, IEEE, 978-1-4244-1436-9. The full citation details ...](Pics/full.jpeg) |
2007 |
DBLP DOI BibTeX RDF |
|
37 | Aristodemos Pnevmatikakis, John Soldatos 0001, Fotios Talantzis, Lazaros Polymenakos |
Robust Multimodal Audio-Visual Processing for Advanced Context Awareness in Smart Spaces. ![Search on Bibsonomy](Pics/bibsonomy.png) |
AIAI ![In: Artificial Intelligence Applications and Innovations, 3rd IFIP Conference on Artificial Intelligence Applications and Innovations (AIAI) 2006, June 7-9, 2006, Athens, Greece, pp. 290-301, 2006, Springer, 0-387-34223-0. The full citation details ...](Pics/full.jpeg) |
2006 |
DBLP DOI BibTeX RDF |
|
37 | Shengli Fu, Ricardo Gutierrez-Osuna, Anna Esposito, Praveen K. Kakumanu, Oscar N. Garcia |
Audio/visual mapping with cross-modal hidden Markov models. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Trans. Multim. ![In: IEEE Trans. Multim. 7(2), pp. 243-252, 2005. The full citation details ...](Pics/full.jpeg) |
2005 |
DBLP DOI BibTeX RDF |
|
37 | Zhihong Zeng, Jilin Tu, Ming Liu 0009, Thomas S. Huang |
Multi-stream Confidence Analysis for Audio-Visual Affect Recognition. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ACII ![In: Affective Computing and Intelligent Interaction, First International Conference, ACII 2005, Beijing, China, October 22-24, 2005, Proceedings, pp. 964-971, 2005, Springer, 3-540-29621-2. The full citation details ...](Pics/full.jpeg) |
2005 |
DBLP DOI BibTeX RDF |
|
37 | Walid Karam, Chafic Mokbel, Hanna Greige, Guido Aversano, Catherine Pelachaud, Gérard Chollet |
An Audio-Visual Imposture Scenario by Talking Face Animation. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Summer School on Neural Networks ![In: Nonlinear Speech Modeling and Applications, Advanced Lectures and Revised Selected Papers, 9th International Summer School `Neural Nets E.R. Caianiello` on Nonlinear Speech Processing: Algorithms and Analysis, Vietri sul Mare, Salerno, Italy, September 13-18, 2004, pp. 365-369, 2004, Springer, 3-540-27441-3. The full citation details ...](Pics/full.jpeg) |
2004 |
DBLP DOI BibTeX RDF |
|
37 | Satoshi Nakamura 0001 |
Fusion of Audio-Visual Information for Integrated Speech Processing. ![Search on Bibsonomy](Pics/bibsonomy.png) |
AVBPA ![In: Audio- and Video-Based Biometric Person Authentication, Third International Conference, AVBPA 2001 Halmstad, Sweden, June 6-8, 2001, Proceedings, pp. 127-143, 2001, Springer, 3-540-42216-1. The full citation details ...](Pics/full.jpeg) |
2001 |
DBLP DOI BibTeX RDF |
|
36 | Tanveer A. Faruquie, Ashish Kapoor, Rohit J. Kate, Nitendra Rajput, L. Venkata Subramaniam |
Audio Driven Facial Animation For Audio-Visual Reality. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICME ![In: Proceedings of the 2001 IEEE International Conference on Multimedia and Expo, ICME 2001, August 22-25, 2001, Tokyo, Japan, 2001, IEEE Computer Society, 0-7695-1198-8. The full citation details ...](Pics/full.jpeg) |
2001 |
DBLP DOI BibTeX RDF |
|
36 | Kate Saenko, Trevor Darrell, James R. Glass |
Articulatory features for robust visual speech recognition. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICMI ![In: Proceedings of the 6th International Conference on Multimodal Interfaces, ICMI 2004, State College, PA, USA, October 13-15, 2004, pp. 152-158, 2004, ACM, 1-58113-995-0. The full citation details ...](Pics/full.jpeg) |
2004 |
DBLP DOI BibTeX RDF |
articulatory features, speechreading, visual feature extraction, multimodal interfaces, audio-visual speech recognition |
35 | Wichian Sittiprapaporn, Jun Soo Kwon |
Brain Electric Microstate and Perception of Simultaneously Audiovisual Presentation. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICANN (1) ![In: Artificial Neural Networks - ICANN 2009, 19th International Conference, Limassol, Cyprus, September 14-17, 2009, Proceedings, Part I, pp. 345-355, 2009, Springer, 978-3-642-04273-7. The full citation details ...](Pics/full.jpeg) |
2009 |
DBLP DOI BibTeX RDF |
Audiovisual perception, Microstate, Cognition, Brain, Event-related potential (ERP) |
35 | Bernt Schiele, Nuria Oliver, Tony Jebara, Alex Pentland |
An Interactive Computer Vision System DyPERS: Dynamic Personal Enhanced Reality System. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICVS ![In: Computer Vision Systems, First International Conference, ICVS '99, Las Palmas, Gran Canaria, Spain, January 13-15, 1999, Proceedings, pp. 51-65, 1999, Springer, 3-540-65459-3. The full citation details ...](Pics/full.jpeg) |
1999 |
DBLP DOI BibTeX RDF |
|
34 | Hari Krishna Maganti, Daniel Gatica-Perez, Iain McCowan |
Speech Enhancement and Recognition in Meetings With an Audio-Visual Sensor Array. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Trans. Speech Audio Process. ![In: IEEE Trans. Speech Audio Process. 15(8), pp. 2257-2269, 2007. The full citation details ...](Pics/full.jpeg) |
2007 |
DBLP DOI BibTeX RDF |
|
34 | Olivier Martin 0001, Irene Kotsia, Benoît Macq, Ioannis Pitas |
The eNTERFACE'05 Audio-Visual Emotion Database. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICDE Workshops ![In: Proceedings of the 22nd International Conference on Data Engineering Workshops, ICDE 2006, 3-7 April 2006, Atlanta, GA, USA, pp. 8, 2006, IEEE Computer Society, 0-7695-2571-7. The full citation details ...](Pics/full.jpeg) |
2006 |
DBLP DOI BibTeX RDF |
|
34 | Mat Felthousen |
Combining audio/visual and computing support. ![Search on Bibsonomy](Pics/bibsonomy.png) |
SIGUCCS ![In: Proceedings of the 33rd Annual ACM SIGUCCS Conference on User Services 2005, Monterey, CA, USA, November 6-9, 2005, pp. 75-82, 2005, ACM, 1-59593-200-3. The full citation details ...](Pics/full.jpeg) |
2005 |
DBLP DOI BibTeX RDF |
AV equipment, design, reliability, standardization, computer labs, budgeting |
34 | Myung-Won Kim, Joung Woo Ryu, Eun Ju Kim |
Speech Recognition by Integrating Audio, Visual and Contextual Features Based on Neural Networks. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICNC (2) ![In: Advances in Natural Computation, First International Conference, ICNC 2005, Changsha, China, August 27-29, 2005, Proceedings, Part II, pp. 155-164, 2005, Springer, 3-540-28325-0. The full citation details ...](Pics/full.jpeg) |
2005 |
DBLP DOI BibTeX RDF |
|
34 | Lei Xie 0001, Zhi-Qiang Liu |
Multi-stream Articulator Model with Adaptive Reliability Measure for Audio Visual Speech Recognition. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICMLC ![In: Advances in Machine Learning and Cybernetics, 4th International Conference, ICMLC 2005, Guangzhou, China, August 18-21, 2005, Revised Selected Papers, pp. 994-1004, 2005, Springer, 3-540-33584-6. The full citation details ...](Pics/full.jpeg) |
2005 |
DBLP DOI BibTeX RDF |
|
34 | Raffay Hamid, Aaron F. Bobick, Anthony J. Yezzi |
Audio-visual flow -a variational approach to multi-modal flow estimation. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICIP ![In: Proceedings of the 2004 International Conference on Image Processing, ICIP 2004, Singapore, October 24-27, 2004, pp. 2563-2566, 2004, IEEE, 0-7803-8554-3. The full citation details ...](Pics/full.jpeg) |
2004 |
DBLP DOI BibTeX RDF |
|
34 | Iain A. Matthews, Gerasimos Potamianos, Chalapathy Neti, Juergen Luettin |
A Comparison Of Model And Transform-Based Visual Features For Audio-Visual LVCSR. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICME ![In: Proceedings of the 2001 IEEE International Conference on Multimedia and Expo, ICME 2001, August 22-25, 2001, Tokyo, Japan, 2001, IEEE Computer Society, 0-7695-1198-8. The full citation details ...](Pics/full.jpeg) |
2001 |
DBLP DOI BibTeX RDF |
|
33 | Walter Allasia, Fabrizio Falchi, Francesco Gallo, Mouna Kacimi, Aaron Kaplan, Jonathan Mamou, Yosi Mass, Nicola Orio |
Audio-Visual Content Analysis in P2P Networks: The SAPIR Approach. ![Search on Bibsonomy](Pics/bibsonomy.png) |
DEXA Workshops ![In: 19th International Workshop on Database and Expert Systems Applications (DEXA 2008), 1-5 September 2008, Turin, Italy, pp. 610-614, 2008, IEEE Computer Society, 978-0-7695-3299-8. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP DOI BibTeX RDF |
Audio-visual content analysis, SAPIR, P2P |
33 | Zhihong Zeng, ZhenQiu Zhang, Brian Pianfetti, Jilin Tu, Thomas S. Huang |
Audio-visual affect recognition in activation-evaluation space. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICME ![In: Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, ICME 2005, July 6-9, 2005, Amsterdam, The Netherlands, pp. 828-831, 2005, IEEE Computer Society, 0-7803-9331-7. The full citation details ...](Pics/full.jpeg) |
2005 |
DBLP DOI BibTeX RDF |
bimodal affect recognition, audio-visual affect recognition, psychological research, activation-evaluation space, Fisher boosting learning algorithm, human-computer interaction, HCI |
33 | Raphaël Troncy, Jean Carrive |
A reduced yet extensible audio-visual description language. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ACM Symposium on Document Engineering ![In: Proceedings of the 2004 ACM Symposium on Document Engineering, Milwaukee, Wisconsin, USA, October 28-30, 2004, pp. 87-89, 2004, ACM, 1-58113-938-1. The full citation details ...](Pics/full.jpeg) |
2004 |
DBLP DOI BibTeX RDF |
audio-visual description language, semantic web, semantics, knowledge representation, structure, MPEG-7, descriptor |
33 | Andreas Tirakis, Panagiotis Katalagarianos, Michael Papathomas, Christodoulos Hamilakis |
Distributed Audio-Visual Archives Network (DiVAN). ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICMCS, Vol. 2 ![In: IEEE International Conference on Multimedia Computing and Systems, ICMCS 1999, Florence, Italy, June 7-11, 1999. Volume II, pp. 1086-1088, 1999, IEEE Computer Society. The full citation details ...](Pics/full.jpeg) |
1999 |
DBLP DOI BibTeX RDF |
Distributed audio-visual archive, Extensible Data Model, OMG-CORBA infrastructure, Content-based access |
33 | Satoshi Nakamura 0001, Ken'ichi Kumatani, Satoshi Tamura |
Multi-Modal Temporal Asynchronicity Modeling by Product HMMs for Robust. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICMI ![In: 4th IEEE International Conference on Multimodal Interfaces (ICMI 2002), 14-16 October 2002, Pittsburgh, PA, USA, pp. 305-312, 2002, IEEE Computer Society, 0-7695-1834-6. The full citation details ...](Pics/full.jpeg) |
2002 |
DBLP DOI BibTeX RDF |
|
33 | Kyoung-Ho Choi, Ying Luo 0011, Jenq-Neng Hwang |
Hidden Markov Model Inversion for Audio-to-Visual Conversion in an MPEG-4 Facial Animation System. ![Search on Bibsonomy](Pics/bibsonomy.png) |
J. VLSI Signal Process. ![In: J. VLSI Signal Process. 29(1-2), pp. 51-61, 2001. The full citation details ...](Pics/full.jpeg) |
2001 |
DBLP DOI BibTeX RDF |
HMMI, audio-to-visual conversion, MPEG-4, facial animation |
32 | Simon Lucey, Tsuhan Chen, Sridha Sridharan, Vinod Chandran |
Integration strategies for audio-visual speech processing: applied to text-dependent speaker recognition. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Trans. Multim. ![In: IEEE Trans. Multim. 7(3), pp. 495-506, 2005. The full citation details ...](Pics/full.jpeg) |
2005 |
DBLP DOI BibTeX RDF |
|
32 | Simon Lucey, Tsuhan Chen |
Improved Audio-Visual Speaker Recognition via the Use of a Hybrid Combination Strategy. ![Search on Bibsonomy](Pics/bibsonomy.png) |
AVBPA ![In: Audio-and Video-Based Biometrie Person Authentication, 4th International Conference, AVBPA 2003, Guildford, UK, June 9-11, 2003 Proceedings, pp. 929-936, 2003, Springer, 3-540-40302-7. The full citation details ...](Pics/full.jpeg) |
2003 |
DBLP DOI BibTeX RDF |
|
32 | Zeljko Obrenovic, Dusan Starcevic, Emil Jovanov |
Toward Optimization of Multimodal User Interfaces for Tactical Audio Applications. ![Search on Bibsonomy](Pics/bibsonomy.png) |
User Interfaces for All ![In: Universal Access: Theoretical Perspectives, Practice, and Experience, 7th ERCIM International Workshop on User Interfaces for All, Paris, France, October 24-25, 2002, Revised Papers, pp. 287-298, 2002, Springer, 3-540-00855-1. The full citation details ...](Pics/full.jpeg) |
2002 |
DBLP DOI BibTeX RDF |
|
32 | Yan Li, Heung-Yeung Shum |
Learning dynamic audio-visual mapping with input-output Hidden Markov models. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Trans. Multim. ![In: IEEE Trans. Multim. 8(3), pp. 542-549, 2006. The full citation details ...](Pics/full.jpeg) |
2006 |
DBLP DOI BibTeX RDF |
|
32 | Gareth J. F. Jones, Richard J. Edens |
Automated Alignment and Annotation of Audio-Visual Presentations. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ECDL ![In: Research and Advanced Technology for Digital Libraries, 6th European Conference, ECDL 2002, Rome, Italy, September 16-18, 2002, Proceedings, pp. 276-291, 2002, Springer, 3-540-44178-6. The full citation details ...](Pics/full.jpeg) |
2002 |
DBLP DOI BibTeX RDF |
|
32 | Laurent Girin |
Joint matrix quantization of face parameters and LPC coefficients for low bit rate audiovisual speech coding. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Trans. Speech Audio Process. ![In: IEEE Trans. Speech Audio Process. 12(3), pp. 265-276, 2004. The full citation details ...](Pics/full.jpeg) |
2004 |
DBLP DOI BibTeX RDF |
|
32 | Jeongsoo Choi, Se Jin Park, Minsu Kim, Yong Man Ro |
AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2312.02512, 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
32 | Milos Zelezný, Petr Císar |
Czech audio-visual speech corpus of a car driver for in-vehicle audio-visual speech recognition. ![Search on Bibsonomy](Pics/bibsonomy.png) |
AVSP ![In: AVSP 2003 - International Conference on Audio-Visual Speech Processing, St. Jorioz, France, September 4-7, 2003, pp. 169-173, 2003, ISCA. The full citation details ...](Pics/full.jpeg) |
2003 |
DBLP BibTeX RDF |
|
32 | Laurent Girin, Jean-Luc Schwartz, Gang Feng 0002 |
Can the visual input make the audio signal "pop out" in noise ? a first study of the enhancement of noisy VCV acoustic sequences by audio-visual fusion. ![Search on Bibsonomy](Pics/bibsonomy.png) |
AVSP ![In: ESCA Workshop on Audio-Visual Speech Processing, AVSP '97, Rhodes, Greece, September 26-27, 1997, pp. 37-40, 1997, ISCA. The full citation details ...](Pics/full.jpeg) |
1997 |
DBLP BibTeX RDF |
|
31 | Hiroshi G. Okuno, Kazuhiro Nakadai, Tino Lourens, Hiroaki Kitano |
Sound and Visual Tracking for Humanoid Robot. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Appl. Intell. ![In: Appl. Intell. 20(3), pp. 253-266, 2004. The full citation details ...](Pics/full.jpeg) |
2004 |
DBLP DOI BibTeX RDF |
robot audition, audio-visual integration, audio-visual tracking, computational auditory scene analysis |
31 | Kyoung-Ho Choi, Jong-Hoon Lee |
Constrained optimization for a speech driven talking head. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ISCAS (2) ![In: Proceedings of the 2003 International Symposium on Circuits and Systems, ISCAS 2003, Bangkok, Thailand, May 25-28, 2003, pp. 560-563, 2003, IEEE, 0-7803-7761-3. The full citation details ...](Pics/full.jpeg) |
2003 |
DBLP DOI BibTeX RDF |
|
31 | Adam Nash |
Real time art engines 3: post-convergent creative practice in MUVEs. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IE ![In: Proceedings of the 4th Australasian Conference on Interactive Entertainment, IE 2007, 3-5 December 2007, Melbourne, Australia, pp. 19, 2007, ACM, 978-1-921166-87-7. The full citation details ...](Pics/full.jpeg) |
2007 |
DBLP BibTeX RDF |
audio-visual composition, post-convergent space, Second Life, multi-user virtual environments |
31 | Daniel Gatica-Perez, Guillaume Lathoud, Jean-Marc Odobez, Iain McCowan |
Multimodal multispeaker probabilistic tracking in meetings. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICMI ![In: Proceedings of the 7th International Conference on Multimodal Interfaces, ICMI 2005, Trento, Italy, October 4-6, 2005, pp. 183-190, 2005, ACM, 1-59593-028-0. The full citation details ...](Pics/full.jpeg) |
2005 |
DBLP DOI BibTeX RDF |
audio-visual speaker tracking, particle filters, MCMC |
30 | Brit Susan Jensen, Mikael B. Skov, Nissan Thiruravichandran |
Studying driver attention and behaviour for three configurations of GPS navigation in real traffic driving. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CHI ![In: Proceedings of the 28th International Conference on Human Factors in Computing Systems, CHI 2010, Atlanta, Georgia, USA, April 10-15, 2010, pp. 1271-1280, 2010, ACM, 978-1-60558-929-9. The full citation details ...](Pics/full.jpeg) |
2010 |
DBLP DOI BibTeX RDF |
eye glances, navigation guide, output modalities, gps, driving, field experiment, in-vehicle systems |
30 | Martin J. Hicks, Sarah Nichols 0001, Claire O'Malley |
Comparing the roles of 3D representations in audio and audio-visual collaborations. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Virtual Real. ![In: Virtual Real. 7(3-4), pp. 148-163, 2004. The full citation details ...](Pics/full.jpeg) |
2004 |
DBLP DOI BibTeX RDF |
Collaborative problem solving, Information visualisation, 3D representations |
30 | Louis H. Terry, Aggelos K. Katsaggelos |
A phone-viseme dynamic Bayesian network for audio-visual automatic speech recognition. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICPR ![In: 19th International Conference on Pattern Recognition (ICPR 2008), December 8-11, 2008, Tampa, Florida, USA, pp. 1-4, 2008, IEEE Computer Society, 978-1-4244-2175-6. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP DOI BibTeX RDF |
|
30 | Paisarn Muneesawang, Tahir Amin, Ling Guan |
Audio Visual Cues for Video Indexing and Retrieval. ![Search on Bibsonomy](Pics/bibsonomy.png) |
PCM (1) ![In: Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30 - December 3, 2004, Proceedings, Part I, pp. 642-649, 2004, Springer, 3-540-23974-X. The full citation details ...](Pics/full.jpeg) |
2004 |
DBLP DOI BibTeX RDF |
|
30 | Marios Kyperountas, Constantine Kotropoulos, Ioannis Pitas |
Enhanced Eigen-Audioframes for Audiovisual Scene Change Detection. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Trans. Multim. ![In: IEEE Trans. Multim. 9(4), pp. 785-797, 2007. The full citation details ...](Pics/full.jpeg) |
2007 |
DBLP DOI BibTeX RDF |
|
30 | Kai Nickel, Tobias Gehrig, Hazim Kemal Ekenel, John W. McDonough, Rainer Stiefelhagen |
An Audio-Visual Particle Filter for Speaker Tracking on the CLEAR'06 Evaluation Dataset. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CLEAR ![In: Multimodal Technologies for Perception of Humans, First International Evaluation Workshop on Classification of Events, Activities and Relationships, CLEAR 2006, Southampton, UK, April 6-7, 2006, Revised Selected Papers, pp. 69-80, 2006, Springer, 978-3-540-69567-7. The full citation details ...](Pics/full.jpeg) |
2006 |
DBLP DOI BibTeX RDF |
|
29 | Gerasimos Potamianos, Chalapathy Neti |
Improved ROI and within frame discriminant features for lipreading. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICIP (3) ![In: Proceedings of the 2001 International Conference on Image Processing, ICIP 2001, Thessaloniki, Greece, October 7-10, 2001, pp. 250-253, 2001, IEEE, 0-7803-6725-1. The full citation details ...](Pics/full.jpeg) |
2001 |
DBLP DOI BibTeX RDF |
|
29 | Tian Gan, Wolfgang Menzel, Jianwei Zhang 0001 |
Using the Tandem Approach for AF Classification in an AVSR System. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ISNN (2) ![In: Advances in Neural Networks - ISNN 2008, 5th International Symposium on Neural Networks, ISNN 2008, Beijing, China, September 24-28, 2008, Proceedings, Part II, pp. 830-839, 2008, Springer, 978-3-540-87733-2. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP DOI BibTeX RDF |
Articulatory Features, MLP, Audio Visual Speech Recognition |
29 | Dong Zhang 0001, Daniel Gatica-Perez, Samy Bengio |
Semi-supervised meeting event recognition with adapted HMMs. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICME ![In: Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, ICME 2005, July 6-9, 2005, Amsterdam, The Netherlands, pp. 1102-1105, 2005, IEEE Computer Society, 0-7803-9331-7. The full citation details ...](Pics/full.jpeg) |
2005 |
DBLP DOI BibTeX RDF |
audio-visual event, semisupervised framework, meeting event recognition, HMM adaptation technique |
28 | Gerald Friedland, Chuohao Yeo, Hayley Hung |
Visual speaker localization aided by acoustic models. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ACM Multimedia ![In: Proceedings of the 17th International Conference on Multimedia 2009, Vancouver, British Columbia, Canada, October 19-24, 2009, pp. 195-202, 2009, ACM, 978-1-60558-608-3. The full citation details ...](Pics/full.jpeg) |
2009 |
DBLP DOI BibTeX RDF |
visual localization, multimodal integration, speaker diarization |
28 | Elise Arnaud, Heidi Christensen, Yan-Chen Lu, Jon Barker, Vasil Khalidov, Miles E. Hansard, Bertrand Holveck, Hervé Mathieu, Ramya Narasimha, Elise Taillant, Florence Forbes, Radu Horaud |
The CAVA corpus: synchronised stereoscopic and binaural datasets with head movements. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICMI ![In: Proceedings of the 10th International Conference on Multimodal Interfaces, ICMI 2008, Chania, Crete, Greece, October 20-22, 2008, pp. 109-116, 2008, ACM, 978-1-60558-198-9. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP DOI BibTeX RDF |
binaural hearing, database, stereo vision |
Displaying result #1 - #100 of 3474 (100 per page; Change: ) Pages: [ 1][ 2][ 3][ 4][ 5][ 6][ 7][ 8][ 9][ 10][ >>] |
|