|
|
|
|
Venues (Conferences, Journals, ...)
|
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
The graphs summarize 34 occurrences of 32 keywords
|
|
|
|
|
Results
Found 86 publication records. Showing 86 according to the selection in the facets
| Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
| 2 | Krzysztof Slot, Jaroslaw Cichosz, Lukasz Bronakowski |
Emotion Recognition with Poincare Mapping of Voiced-Speech Segments of Utterances.  |
ICAISC  |
2008 |
DBLP DOI BibTeX RDF |
Emotional Speech Recognition, Poincaré Maps, Feature Selection |
| 2 | Viktor Rozgic, Kyu Jeong Han, Panayiotis G. Georgiou, Shrikanth S. Narayanan |
Multimodal Speaker Segmentation in Presence of Overlapped Speech Segments.  |
ISM  |
2008 |
DBLP DOI BibTeX RDF |
|
| 2 | Tomohiro Nakatani, Takafumi Hikichi, Keisuke Kinoshita, Takuya Yoshioka, Marc Delcroix, Masato Miyoshi, Biing-Hwang Juang |
Robust blind dereverberation of speech signals based on characteristics of short-time speech segments.  |
ISCAS  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Fernando Perdigão, Cláudio Neves, Luís Sá |
Pathological Voice Detection using Turbulent Speech Segments.  |
BIOSIGNALS  |
2012 |
DBLP BibTeX RDF |
|
| 1 | Viktor Rozgic, Kyu Jeong Han, Panayiotis G. Georgiou, Shrikanth Narayanan |
Multimodal Speaker Segmentation and Identification in Presence of Overlapped Speech Segments.  |
Journal of Multimedia  |
2010 |
DBLP DOI BibTeX RDF |
|
| 1 | Sree Harsha Yella, Vasudeva Varma, Kishore Prahallad |
Prominence based scoring of speech segments for automatic speech-to-speech summarization.  |
INTERSPEECH  |
2010 |
DBLP BibTeX RDF |
|
| 1 | Richard Dufour, Fethi Bougares, Yannick Estève, Paul Deléglise |
Unsupervised model adaptation on targeted speech segments for LVCSR system combination.  |
INTERSPEECH  |
2010 |
DBLP BibTeX RDF |
|
| 1 | Thomas Ewender, Beat Pfister |
Accurate pitch marking for prosodic modification of speech segments.  |
INTERSPEECH  |
2010 |
DBLP BibTeX RDF |
|
| 1 | Kyu Jeong Han, Shrikanth S. Narayanan |
Improved speaker diarization of meeting speech with recurrent selection of representative speech segments and participant interaction pattern modeling.  |
INTERSPEECH  |
2009 |
DBLP BibTeX RDF |
|
| 1 | Kentaro Ishizuka, Shoko Araki, Kazuhiro Otsuka, Tomohiro Nakatani, Masakiyo Fujimoto |
A speaker diarization method based on the probabilistic fusion of audio-visual location information.  |
ICMI  |
2009 |
DBLP DOI BibTeX RDF |
multi-modal systems, multi-party conversation analysis, speaker diarization |
| 1 | Gerald Friedland, Luke R. Gottlieb, Adam Janin |
Joke-o-mat: browsing sitcoms punchline by punchline.  |
ACM Multimedia  |
2009 |
DBLP DOI BibTeX RDF |
acoustic event detection, speaker ID, video navigation |
| 1 | Deepu Vijayasenan, Fabio Valente, Hervé Bourlard |
An Information Theoretic Approach to Speaker Diarization of Meeting Data.  |
IEEE Transactions on Audio, Speech & Language Processing  |
2009 |
DBLP DOI BibTeX RDF |
|
| 1 | Mohamed Chetouani, Ammar Mahdhaoui, Fabien Ringeval |
Time-Scale Feature Extractions for Emotional Speech Characterization.  |
Cognitive Computation  |
2009 |
DBLP DOI BibTeX RDF |
Time-scales analysis, Statistical fusion, Data-driven approach, Feature extraction, Emotional speech |
| 1 | Esfandiar Zavarehei, Saeed Vaseghi |
Interpolation of Lost Speech Segments Using LP-HNM Model With Codebook Post-Processing.  |
IEEE Transactions on Multimedia  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Soheil Shafiee, Farshad Almasganj, Ayyoob Jafari |
Speech/non-speech segments detection based on chaotic and prosodic features.  |
INTERSPEECH  |
2008 |
DBLP BibTeX RDF |
|
| 1 | Jun Du, Qiang Huo |
A feature compensation approach using piecewise linear approximation of an explicit distortion model for noisy speech recognition.  |
ICASSP  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Cong Li, Zhijian Ou, Wei Hu, Tao Wang, Yimin Zhang |
Caption-aided speech detection in videos.  |
ICASSP  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Konstantin Markov, Shun Nakamura |
Language identification with dynamic hidden Markov network.  |
ICASSP  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Akihiro Yoshida, Hideyuki Mizuno, Kazunori Mano |
Segment selection method based on tonal validity evaluation using machine learning for concatenative speech synthesis.  |
ICASSP  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Kyu Jeong Han, Panayiotis G. Georgiou, Shrikanth S. Narayanan |
The SAIL speaker diarization system for analysis of spontaneous meetings.  |
MMSP  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | K. J. Han, S. Kim, S. S. Narayanan |
Strategies to Improve the Robustness of Agglomerative Hierarchical Clustering Under Data Source Variation for Speaker Diarization.  |
IEEE Transactions on Audio, Speech & Language Processing  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Tomohiro Nakatani, Biing-Hwang Juang, Takuya Yoshioka, Keisuke Kinoshita, Marc Delcroix, Masato Miyoshi |
Speech Dereverberation Based on Maximum-Likelihood Estimation With Time-Varying Gaussian Source Model.  |
IEEE Transactions on Audio, Speech & Language Processing  |
2008 |
DBLP DOI BibTeX RDF |
|
| 1 | Marion Dohen |
Speech through the Ear, the Eye, the Mouth and the Hand.  |
COST 2102 School (Vietri)  |
2008 |
DBLP DOI BibTeX RDF |
multimodal speech, auditory-visual perception, sensori-motor, hand-mouth coordination, speech and language development, pointing, prosody |
| 1 | Shuang Zhang, Wei Hu, Tao Wang, Jia Liu, Yimin Zhang |
Speaker Clustering Aided by Visual Dialogue Analysis.  |
PCM  |
2008 |
DBLP DOI BibTeX RDF |
speech segmentation, Speaker clustering, dialogue analysis |
| 1 | Esfandiar Zavarehei, Saeed Vaseghi |
Interpolation of lost speech segments using LP-HNM model with codebook-mapping post-processing.  |
ASRU  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | David R. Hill |
Speaker Classification Concepts: Past, Present and Future.  |
Speaker Classification  |
2007 |
DBLP DOI BibTeX RDF |
voice morphing, socio-phonetics, speech forensics, speech research tools, speech segments, speech prosody, formant sensitivity analysis, dialogue dynamics, gnuspeech, face recognition, rhythm, emotional intelligence, intonation, speaker classification, mimicry, impersonation |
| 1 | Hiroyuki Sakai, Tobias Cincarek, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano, Akinobu Lee |
Voice activity detection applied to hands-free spoken dialogue robot based on decoding using acoustic and language model.  |
ROBOCOMM  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Cheng-Hsiung Hsieh, Ting-Yu Feng, Ren-Hsien Huang |
Voice Activity Detection Based on GM(1, 1) Model.  |
ACIS-ICIS  |
2007 |
DBLP DOI BibTeX RDF |
Voice activity detection (VAD), GM(1,1) model, signal/noise estimation, G.729, GSM AMR, grey model |
| 1 | Liang Wang 0003, Eliathamby Ambikairajah, Eric H. C. Choi |
A Novel Method for Automatic Tonal and Non-Tonal Language Classification.  |
ICME  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Wai Nang Chan, Nengheng Zheng, Tan Lee |
Discrimination Power of Vocal Source and Vocal Tract Related Features for Speaker Segmentation.  |
IEEE Transactions on Audio, Speech & Language Processing  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | Chung-Hsien Wu, Chia-Hsin Hsieh, Chien-Lin Huang |
Speech Sentence Compression Based on Speech Segment Extraction and Concatenation.  |
IEEE Transactions on Multimedia  |
2007 |
DBLP DOI BibTeX RDF |
|
| 1 | S. Mostafa Rahimi Azghadi, Mohammad Reza Bonyadi, Hamed Shah-Hosseini |
Gender Classification Based on FeedForward Backpropagation Neural Network.  |
AIAI  |
2007 |
DBLP DOI BibTeX RDF |
pitch features, Fast Fourier Transform, Gender classifications, Backpropagation neural network |
| 1 | Anthony Brew, Marco Grimaldi, Padraig Cunningham |
An evaluation of one-class classification techniques for speaker verification.  |
Artif. Intell. Rev.  |
2007 |
DBLP DOI BibTeX RDF |
One-class classifiers, Gaussian mixture models, Speaker verification |
| 1 | Yusuke Hiwasaki, Toru Morinaga, Jotaro Ikedo, Akitoshi Kataoka |
Measuring the Perceived Importance of Speech Segments for Transmission over IP Networks.  |
IEICE Transactions  |
2006 |
DBLP DOI BibTeX RDF |
|
| 1 | Zhen-Hua Ling, Ren-Hua Wang |
HMM-based unit selection using frame sized speech segments.  |
INTERSPEECH  |
2006 |
DBLP BibTeX RDF |
|
| 1 | Antonio Camarena-Ibarrola, Edgar Chávez |
Using a new Discretization of the Fourier Transform to Discriminate Voiced From Unvoiced Speech.  |
ENC  |
2006 |
DBLP DOI BibTeX RDF |
|
| 1 | Lei Chen 0002, Sule Gündüz, M. Tamer Özsu |
Mixed Type Audio Classification with Support Vector Machine.  |
ICME  |
2006 |
DBLP DOI BibTeX RDF |
|
| 1 | Shinya Mori, Tsuyoshi Moriyama, Shinji Ozawa |
Emotional Speech Synthesis using Subspace Constraints in Prosody.  |
ICME  |
2006 |
DBLP DOI BibTeX RDF |
|
| 1 | Huazhong Ning, Wei Xu, Yihong Gong, Thomas S. Huang |
Improving Speaker Diarization by Cross EM Refinement.  |
ICME  |
2006 |
DBLP DOI BibTeX RDF |
|
| 1 | Avishay Amsalem, Ilan D. Shallom |
Time Frequency Representation for Speech Recognition.  |
ITRE  |
2006 |
DBLP DOI BibTeX RDF |
|
| 1 | S. Gazor, R. R. Far |
Adaptive maximum windowed likelihood multicomponent AM-FM signal decomposition.  |
IEEE Transactions on Audio, Speech & Language Processing  |
2006 |
DBLP DOI BibTeX RDF |
|
| 1 | Yoshifumi Nagata, T. Fujioka, Masato Abe |
Speech enhancement based on auto gain control.  |
IEEE Transactions on Audio, Speech & Language Processing  |
2006 |
DBLP DOI BibTeX RDF |
|
| 1 | Geert Rombouts, Toon van Waterschoot, Kris Struyve, Marc Moonen |
Acoustic feedback cancellation for long acoustic paths using a nonstationary source model.  |
IEEE Transactions on Signal Processing  |
2006 |
DBLP DOI BibTeX RDF |
|
| 1 | Jing Deng, Thomas Fang Zheng, Wenhu Wu |
UBM Based Speaker Segmentation and Clustering for 2-Speaker Detection.  |
ISCSLP  |
2006 |
DBLP DOI BibTeX RDF |
Multi-speaker, Speaker segmentation, Speaker clustering, Speaker Detection |
| 1 | France Mihelic, Janez Zibert |
Robust Speech Detection Based on Phoneme Recognition Features.  |
TSD  |
2006 |
DBLP DOI BibTeX RDF |
|
| 1 | T. M. Sunil Kumar, T. V. Sreenivas |
Speech enhancement using Markov model of speech segments.  |
INTERSPEECH  |
2005 |
DBLP BibTeX RDF |
|
| 1 | José Anibal Arias |
Unsupervised identification of speech segments using kernel methods for clustering.  |
INTERSPEECH  |
2005 |
DBLP BibTeX RDF |
|
| 1 | Dimitrios Ververidis, Constantine Kotropoulos |
Emotional Speech Classification Using Gaussian Mixture Models and the Sequential Floating Forward Selection Algorithm.  |
ICME  |
2005 |
DBLP DOI BibTeX RDF |
|
| 1 | Ling Guo, Ying-Chun Shi, Xianzhong Zhou, Feng Zhang |
Location and Extraction of Broadcast in News Video Based on QGMM and BIC.  |
CIT  |
2005 |
DBLP DOI BibTeX RDF |
|
| 1 | Farook Sattar, Moe Pwint |
A new speech/non-speech classification method using minimal Walsh basis functions.  |
ISCAS  |
2005 |
DBLP DOI BibTeX RDF |
|
| 1 | Sanshzar Kettebekov, Mohammed Yeasin, Rajeev Sharma |
Prosody based audiovisual coanalysis for coverbal gesture recognition.  |
IEEE Transactions on Multimedia  |
2005 |
DBLP DOI BibTeX RDF |
|
| 1 | Vivek Tyagi, Christian Wellekens, Hervé Bourlard |
A Variable-Scale Piecewise Stationary Spectral Analysis Technique Applied to ASR.  |
MLMI  |
2005 |
DBLP DOI BibTeX RDF |
|
| 1 | Yannis Pantazis, Yannis Stylianou |
On the Detection of Discontinuities in Concatenative Speech Synthesis.  |
WNSP  |
2005 |
DBLP DOI BibTeX RDF |
|
| 1 | F. Coldefy, Patrick Bouthemy |
Unsupervised soccer video abstraction based on pitch, dominant color and camera motion analysis.  |
ACM Multimedia  |
2004 |
DBLP DOI BibTeX RDF |
non verbal speech classification, video summarization |
| 1 | Ying Li, Chitra Dorai |
Analyzing discussion scene contents in instructional videos.  |
ACM Multimedia  |
2004 |
DBLP DOI BibTeX RDF |
discussion pattern detection, discussion scene, instructional video content analysis, e-learning, speaker clustering |
| 1 | László Tóth, Gábor Gosztolya |
Replicator Neural Networks for Outlier Modeling in Segmental Speech Recognition.  |
ISNN  |
2004 |
DBLP DOI BibTeX RDF |
|
| 1 | Walid Karam, Chafic Mokbel, Hanna Greige, Guido Aversano, Catherine Pelachaud, Gérard Chollet |
An Audio-Visual Imposture Scenario by Talking Face Animation.  |
Summer School on Neural Networks  |
2004 |
DBLP DOI BibTeX RDF |
|
| 1 | Weifeng Li, Takanori Nishino, Chiyomi Miyajima, Katsunobu Itou, Kazuya Takeda, Fumitada Itakura |
In-Car Speech Recognition Using Distributed Multiple Microphones.  |
PCM  |
2004 |
DBLP DOI BibTeX RDF |
|
| 1 | Shi-Huang Chen, Jhing-Fa Wang |
Speech Enhancement Using Perceptual Wavelet Packet Decomposition and Teager Energy Operator.  |
VLSI Signal Processing  |
2004 |
DBLP DOI BibTeX RDF |
perceptual wavelet packet decomposition (PWPD), teager energy operator (TEO), time-adaptive thresholding (TAT), speech enhancement |
| 1 | E. Mendoza, G. Carballo, A. Cruz, M. D. Fresneda, J. Muñoz, V. Marrero |
Temporal variability in speech segments of Spanish: context and speaker related differences.  |
Speech Communication  |
2003 |
DBLP DOI BibTeX RDF |
|
| 1 | Sanshzar Kettebekov, Mohammed Yeasin, Rajeev Sharma |
Improving Continuous Gesture Recognition with Spoken Prosody.  |
CVPR  |
2003 |
DBLP DOI BibTeX RDF |
|
| 1 | Olivia Donnellan, Elmar Jung, Eugene Coyle |
Speech-Adaptive Time-Scale Modification for Computer Assisted Language-Learning.  |
ICALT  |
2003 |
DBLP DOI BibTeX RDF |
|
| 1 | Dijana Petrovska-Delacrétaz, Asmaa El Hannani, Gérard Chollet |
Searching through a Speech Memory for Text-Independent Speaker Verification.  |
AVBPA  |
2003 |
DBLP DOI BibTeX RDF |
|
| 1 | Robert Batusek, Pavel Gaura |
A Comparison of Unit Selection Techniques in Limited Domain Speech Synthesis.  |
TSD  |
2003 |
DBLP DOI BibTeX RDF |
|
| 1 | Konstantin Biatov |
Large Text and Audio Data Alignment for Multimedia Applications.  |
TSD  |
2003 |
DBLP DOI BibTeX RDF |
|
| 1 | Waleed H. Abdulla |
HMM-based techniques for speech segments extraction.  |
Scientific Programming  |
2002 |
DBLP BibTeX RDF |
|
| 1 | Carlos Lima, Luís B. Almeida, João L. Monteiro |
Improving the role of unvoiced speech segments by spectral normalisation in robust speech recognition.  |
INTERSPEECH  |
2002 |
DBLP BibTeX RDF |
|
| 1 | Weifeng Lee, C. Chandra Sekhar, Kazuya Takeda, Fumitada Itakura |
Recognition of continuous speech segments of monophone units using support vector machines.  |
INTERSPEECH  |
2002 |
DBLP BibTeX RDF |
|
| 1 | Hari Sundaram, Lexing Xie, Shih-Fu Chang |
A utility framework for the automatic generation of audio-visual skims.  |
ACM Multimedia  |
2002 |
DBLP DOI BibTeX RDF |
|
| 1 | Juan Manuel Huerta |
Alignment-based codeword-dependent cepstral normalization.  |
IEEE Transactions on Speech and Audio Processing  |
2002 |
DBLP DOI BibTeX RDF |
|
| 1 | K. Ashouri, M. Amini, M. H. Savoji |
Non-linear Prediction of Speech Signal Using Artificial Neural Nets.  |
EurAsia-ICT  |
2002 |
DBLP DOI BibTeX RDF |
|
| 1 | Harald Gustafsson, Ingvar Claesson, Ulf A. Lindgren |
Speech Bandwidth Extension.  |
ICME  |
2001 |
DBLP DOI BibTeX RDF |
|
| 1 | Sofia Tsekeridou, Stelios Krinidis, Ioannis Pitas |
Scene Change Detection Based on Audio-Visual Analysis and Interaction.  |
Theoretical Foundations of Computer Vision  |
2000 |
DBLP DOI BibTeX RDF |
|
| 1 | Xihong Wang, Stephen A. Zahorian, Stefan Auberg |
Analysis of speech segments using variable spectral/temporal resolution.  |
ICSLP  |
1996 |
DBLP BibTeX RDF |
|
| 1 | Ramesh R. Sarukkai, Dana H. Ballard |
The distance set representation of speech segments.  |
EUROSPEECH  |
1995 |
DBLP BibTeX RDF |
|
| 1 | Oded Ghitza, M. Mohan Sondhi |
On the perceptual distance between speech segments.  |
ICSLP  |
1994 |
DBLP BibTeX RDF |
|
| 1 | Minoru Tsuzaki, Hiroaki Kato, Masako Tanaka |
Effects of acoustic discontinuity and phonemic deviation on the apparent duration of speech segments.  |
ICSLP  |
1994 |
DBLP BibTeX RDF |
|
| 1 | V. Ralph Algazi, Kathy L. Brown, Michael J. Ready, David H. Irvine, Christie L. Cadwell, Sang Chung |
Transform representation of the spectra of acoustic speech segments with applications. I. General approach and application to speech recognition.  |
IEEE Transactions on Speech and Audio Processing  |
1993 |
DBLP DOI BibTeX RDF |
|
| 1 | V. Ralph Algazi, Kathy L. Brown, Michael J. Ready, David H. Irvine, Christie L. Cadwell, Sang Chung |
Transform representation of the spectra of acoustic speech segments with applications. II. Speech analysis, synthesis, and coding.  |
IEEE Transactions on Speech and Audio Processing  |
1993 |
DBLP DOI BibTeX RDF |
|
| 1 | Barry Arons |
Hyperspeech.  |
INTERCHI  |
1993 |
DBLP DOI BibTeX RDF |
speech applications, speech recognition, hypermedia, speech synthesis, speech user interfaces, conversational interfaces, speech as data |
| 1 | Henk van den Heuvel, Toni C. M. Rietveld |
Speaker related variability in cepstral representations of dutch speech segments.  |
ICSLP  |
1992 |
DBLP BibTeX RDF |
|
| 1 | Herbert Gish, Kenney Ng, Jan Robin Rohlicek |
Secondary processing using speech segments for an HMM word spotting system.  |
ICSLP  |
1992 |
DBLP BibTeX RDF |
|
| 1 | Henk van den Heuvel, Bert Cranen, Toni C. M. Rietveld |
Speaker related variability in the durations of dutch speech segments.  |
EUROSPEECH  |
1991 |
DBLP BibTeX RDF |
|
| 1 | María José Castro, Francisco Casacuberta |
The Use of Multilayer Perceptrons in Isolated Word Recognition.  |
IWANN  |
1991 |
DBLP DOI BibTeX RDF |
|
| 1 | J. P. Eatock, J. S. D. Mason |
Automatically focusing on good discriminating speech segments in speaker recognition.  |
ICSLP  |
1990 |
DBLP BibTeX RDF |
|
| 1 | B. Yegnanarayana, K. V. Madhu Murthy |
Analysis of short time speech segments based on linear prediction.  |
EUROSPEECH  |
1989 |
DBLP BibTeX RDF |
|
Displaying result #1 - #86 of 86 (100 per page; Change: )
|
|