Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
88 | Takashi Muroi, Tetsuya Takiguchi, Yasuo Ariki |
Speaker Independent Phoneme Recognition Based on Fisher Weight Map. ![Search on Bibsonomy](Pics/bibsonomy.png) |
MUE ![In: 2008 International Conference on Multimedia and Ubiquitous Engineering (MUE 2008), 24-26 April 2008, Busan, Korea, pp. 253-257, 2008, IEEE Computer Society, 978-0-7695-3134-2. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP DOI BibTeX RDF |
Fisher weight map, Local auto-correlation feature, Local feature, Phoneme recognition |
76 | Nengheng Zheng, Ning Wang, Tan Lee, P. C. Ching |
Speaker Verification Using Complementary Information from Vocal Source and Vocal Tract. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ISCSLP ![In: Chinese Spoken Language Processing, 5th International Symposium, ISCSLP 2006, Singapore, December 13-16, 2006, Proceedings, pp. 518-528, 2006, Springer, 3-540-49665-3. The full citation details ...](Pics/full.jpeg) |
2006 |
DBLP DOI BibTeX RDF |
|
63 | Rajesh M. Hegde, Hema A. Murthy |
Cluster and Intrinsic Dimensionality Analysis of the Modified Group Delay Feature for Speaker Classification. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICONIP ![In: Neural Information Processing, 11th International Conference, ICONIP 2004, Calcutta, India, November 22-25, 2004, Proceedings, pp. 1172-1178, 2004, Springer, 3-540-23931-6. The full citation details ...](Pics/full.jpeg) |
2004 |
DBLP DOI BibTeX RDF |
|
61 | Hossein Moeinzadeh, Mehdi Mohammadi, Ahmad Akbari, Babak Nasersharif |
Robust speech recognition using evolutionary class-dependent LDA. ![Search on Bibsonomy](Pics/bibsonomy.png) |
GECCO (Companion) ![In: Genetic and Evolutionary Computation Conference, GECCO 2009, Proceedings, Montreal, Québec, Canada, July 8-12, 2009, Companion Material, pp. 2109-2114, 2009, ACM, 978-1-60558-505-5. The full citation details ...](Pics/full.jpeg) |
2009 |
DBLP DOI BibTeX RDF |
mfcc, particle swarm optimization, speech recognition, linear discriminate analysis, harmony search, transformation matrix |
61 | Jian Liu, Thomas Fang Zheng, Wenhu Wu |
Pitch Mean Based Frequency Warping. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ISCSLP ![In: Chinese Spoken Language Processing, 5th International Symposium, ISCSLP 2006, Singapore, December 13-16, 2006, Proceedings, pp. 87-94, 2006, Springer, 3-540-49665-3. The full citation details ...](Pics/full.jpeg) |
2006 |
DBLP DOI BibTeX RDF |
frequency warping, MFCC, Pitch |
56 | Ben Milner, Jonathan Darch, Saeed Vaseghi |
Applying noise compensation methods to robustly predict acoustic speech features from MFCC vectors in noise. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICASSP ![In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2008, March 30 - April 4, 2008, Caesars Palace, Las Vegas, Nevada, USA, pp. 3945-3948, 2008, IEEE, 1-4244-1484-9. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP DOI BibTeX RDF |
|
56 | Azam Sheikh Muhammad, Z. A. Mansoor, M. Shahzad Mughal, S. Mohsin |
Urdu Spoken Digits Recognition Using Classified MFCC and Backpropgation Neural Network. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CGIV ![In: 4th International Conference on Computer Graphics, Imaging and Visualization (CGIV 2007), August 14-16, 2007, Bangkok, Thailand, pp. 414-418, 2007, IEEE Computer Society, 0-7695-2928-3. The full citation details ...](Pics/full.jpeg) |
2007 |
DBLP DOI BibTeX RDF |
Mel Frequency Cepsptral Coefficients, Urdu spoken digits recognition, Backprapagation |
56 | Sandipan Chakroborty, Anindya Roy, Sourav Majumdar, Goutam Saha 0001 |
Capturing Complementary Information via Reversed Filter Bank and Parallel Implementation with MFCC for Improved Text-Independent Speaker Identification. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICCTA ![In: 2007 International Conference on Computing: Theory and Applications (ICCTA 2007), 5-7 March 2007, Kolkata, India, pp. 463-467, 2007, IEEE Computer Society, 978-0-7695-2770-3. The full citation details ...](Pics/full.jpeg) |
2007 |
DBLP DOI BibTeX RDF |
|
56 | Wei Han, Cheong-Fat Chan, Oliver Chiu-sing Choy, Kong-Pang Pun |
An efficient MFCC extraction method in speech recognition. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ISCAS ![In: International Symposium on Circuits and Systems (ISCAS 2006), 21-24 May 2006, Island of Kos, Greece, 2006, IEEE, 0-7803-9389-9. The full citation details ...](Pics/full.jpeg) |
2006 |
DBLP DOI BibTeX RDF |
|
51 | Wei Chu, Benoît Champagne 0001 |
A Noise-Robust FFT-Based Auditory Spectrum With Application in Audio Classification. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Trans. Speech Audio Process. ![In: IEEE Trans. Speech Audio Process. 16(1), pp. 137-150, 2008. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP DOI BibTeX RDF |
|
51 | Amit S. Malegaonkar, Aladdin M. Ariyaeeinia, P. Sivakumaran, Surosh G. Pillay |
Discrimination Effectiveness of Speech Cepstral Features. ![Search on Bibsonomy](Pics/bibsonomy.png) |
BIOID ![In: Biometrics and Identity Management, First European Workshop, BIOID 2008, Roskilde, Denmark, May 7-9, 2008. Revised Selected Papers, pp. 91-99, 2008, Springer, 978-3-540-89990-7. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP DOI BibTeX RDF |
|
51 | Hemant A. Patil, T. K. Basu |
Cepstral Domain Teager Energy for Identifying Perceptually Similar Languages. ![Search on Bibsonomy](Pics/bibsonomy.png) |
PReMI ![In: Pattern Recognition and Machine Intelligence, Second International Conference, PReMI 2007, Kolkata, India, December 18-22, 2007, Proceedings, pp. 455-462, 2007, Springer, 978-3-540-77045-9. The full citation details ...](Pics/full.jpeg) |
2007 |
DBLP DOI BibTeX RDF |
|
51 | David Chow, Waleed H. Abdulla |
Speaker Identification Based on Log Area Ratio and Gaussian Mixture Models in Narrow-Band Speech: Speech Understanding / Interaction. ![Search on Bibsonomy](Pics/bibsonomy.png) |
PRICAI ![In: PRICAI 2004: Trends in Artificial Intelligence, 8th Pacific Rim International Conference on Artificial Intelligence, Auckland, New Zealand, August 9-13, 2004, Proceedings, pp. 901-908, 2004, Springer, 3-540-22817-9. The full citation details ...](Pics/full.jpeg) |
2004 |
DBLP DOI BibTeX RDF |
|
51 | Mark D. Skowronski, John G. Harris |
Improving the filter bank of a classic speech feature extraction algorithm. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ISCAS (4) ![In: Proceedings of the 2003 International Symposium on Circuits and Systems, ISCAS 2003, Bangkok, Thailand, May 25-28, 2003, pp. 281-284, 2003, IEEE, 0-7803-7761-3. The full citation details ...](Pics/full.jpeg) |
2003 |
DBLP DOI BibTeX RDF |
|
49 | Dacheng Tao, Hao Liu 0007, Xiaoou Tang |
K-BOX: a query-by-singing based music retrieval system. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ACM Multimedia ![In: Proceedings of the 12th ACM International Conference on Multimedia, New York, NY, USA, October 10-16, 2004, pp. 464-467, 2004, ACM, 1-58113-893-8. The full citation details ...](Pics/full.jpeg) |
2004 |
DBLP DOI BibTeX RDF |
music clustering, music segmentation, music retrieval, MFCC |
38 | Limin Hou, Juanmin Xie |
Compensating Function of Formant Instantaneous Characteristics in Speaker Identification. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IAS ![In: Proceedings of the Fifth International Conference on Information Assurance and Security, IAS 2009, Xi'An, China, 18-20 August 2009, pp. 744-747, 2009, IEEE Computer Society, 978-0-7695-3744-3. The full citation details ...](Pics/full.jpeg) |
2009 |
DBLP DOI BibTeX RDF |
|
38 | Hui Gao, Shanguang Chen, Guangchuan Su |
Emotion classification of mandarin speech based on TEO nonlinear features. ![Search on Bibsonomy](Pics/bibsonomy.png) |
SNPD (3) ![In: Proceedings of the 8th ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing, SNPD 2007, July 30 - August 1, 2007, Qingdao, China, pp. 394-398, 2007, IEEE Computer Society, 0-7695-2909-7. The full citation details ...](Pics/full.jpeg) |
2007 |
DBLP DOI BibTeX RDF |
|
38 | Shen Huang, Zheng Chen 0001, Yong Yu 0001, Wei-Ying Ma |
Multitype Features Coselection for Web Document Clustering. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Trans. Knowl. Data Eng. ![In: IEEE Trans. Knowl. Data Eng. 18(4), pp. 448-459, 2006. The full citation details ...](Pics/full.jpeg) |
2006 |
DBLP DOI BibTeX RDF |
clustering, Web mining, feature evaluation and selection |
38 | Nengheng Zheng, P. C. Ching, Ning Wang, Tan Lee |
Integrating Complementary Features with a Confidence Measure for Speaker Identification. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ISCSLP ![In: Chinese Spoken Language Processing, 5th International Symposium, ISCSLP 2006, Singapore, December 13-16, 2006, Proceedings, pp. 549-557, 2006, Springer, 3-540-49665-3. The full citation details ...](Pics/full.jpeg) |
2006 |
DBLP DOI BibTeX RDF |
|
38 | See-May Phoong, Georges Quénot, Eric Castelli |
Recognizing emotions for the audio-visual document indexing. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ISCC ![In: Proceedings of the 9th IEEE Symposium on Computers and Communications (ISCC 2006), June 28 - July 1, 2004, Alexandria, Egypt, pp. 580-584, 2004, IEEE Computer Society, 0-7803-8623-X. The full citation details ...](Pics/full.jpeg) |
2004 |
DBLP DOI BibTeX RDF |
|
36 | Shuhei Hamawaki, Shintaro Funasawa, Jiro Katto, Hiromi Ishizaki, Keiichiro Hoashi, Yasuhiro Takishima |
Feature Analysis and Normalization Approach for Robust Content-Based Music Retrieval to Encoded Audio with Different Bit Rates. ![Search on Bibsonomy](Pics/bibsonomy.png) |
MMM ![In: Advances in Multimedia Modeling, 15th International Multimedia Modeling Conference, MMM 2009, Sophia-Antipolis, France, January 7-9, 2009. Proceedings, pp. 298-309, 2009, Springer, 978-3-540-92891-1. The full citation details ...](Pics/full.jpeg) |
2009 |
DBLP DOI BibTeX RDF |
Content-based MIR Normalization, Mel-Frequency Cepstral Coefficient (MFCC) |
36 | Shi-Huang Chen, Shih-Hao Chen |
Content-based music genre classification using timbral feature vectors and support vector machine. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICIS ![In: Proceedings of the 2nd International Conference on Interaction Sciences: Information Technology, Culture and Human (ICIS 2009), Seoul, Korea, 24-26 November 2009, pp. 1095-1101, 2009, ACM, 978-1-60558-710-3. The full citation details ...](Pics/full.jpeg) |
2009 |
DBLP DOI BibTeX RDF |
support vector machine (SVM), mel-frequency cepstral coefficient (MFCC), music genre classification |
36 | P. Chakraborty, F. Ahmed, Md. Monirul Kabir, M. Shahjahan, Kazuyuki Murase |
An Automatic Speaker Recognition System. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICONIP (1) ![In: Neural Information Processing, 14th International Conference, ICONIP 2007, Kitakyushu, Japan, November 13-16, 2007, Revised Selected Papers, Part I, pp. 517-526, 2007, Springer, 978-3-540-69154-9. The full citation details ...](Pics/full.jpeg) |
2007 |
DBLP DOI BibTeX RDF |
MFCC- Mel-Frequency Cepstrum Co-efficient, DCT: Discrete cosine Transform, IIR:, Infinite impulse response, FIR:, Finite impulse response, FFT:, VQ:, Fast Fourier Transform, Vector Quantization |
36 | Jigish Trivedi, Anutosh Maitra, Suman K. Mitra |
A Hybrid Approach to Speaker Recognition in Multi-speaker Environment. ![Search on Bibsonomy](Pics/bibsonomy.png) |
PReMI ![In: Pattern Recognition and Machine Intelligence, First International Conference, PReMI 2005, Kolkata, India, December 20-22, 2005, Proceedings, pp. 272-275, 2005, Springer, 3-540-30506-8. The full citation details ...](Pics/full.jpeg) |
2005 |
DBLP DOI BibTeX RDF |
Speech recognition, ICA, Vector Quantization, MFCC |
35 | Qin Li 0016, Yuze Yang, Tianxiang Lan, Huifeng Zhu, Qi Wei 0001, Fei Qiao, Xinjun Liu, Huazhong Yang |
MSP-MFCC: Energy-Efficient MFCC Feature Extraction Method With Mixed-Signal Processing Architecture for Wearable Speech Recognition Applications. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Access ![In: IEEE Access 8, pp. 48720-48730, 2020. The full citation details ...](Pics/full.jpeg) |
2020 |
DBLP DOI BibTeX RDF |
|
35 | Chandrasekhar Paseddula, Suryakanth V. Gangashetty |
DNN based Acoustic Scene Classification using Score Fusion of MFCC and Inverse MFCC. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICIIS ![In: 13th IEEE International Conference on Industrial and Information Systems, ICIIS 2018, Rupnagar, India, December 1-2, 2018, pp. 18-21, 2018, IEEE, 978-1-5386-8492-4. The full citation details ...](Pics/full.jpeg) |
2018 |
DBLP DOI BibTeX RDF |
|
35 | Soma Khan, Joyanta Basu, Milton Samirakshma Bepari |
Performance Evaluation of PBDP Based Real-Time Speaker Identification System with Normal MFCC vs MFCC of LP Residual Features. ![Search on Bibsonomy](Pics/bibsonomy.png) |
PerMIn ![In: Perception and Machine Intelligence - First Indo-Japan Conference, PerMIn 2012, Kolkata, India, January 12-13, 2012. Proceedings, pp. 358-366, 2012, Springer, 978-3-642-27386-5. The full citation details ...](Pics/full.jpeg) |
2012 |
DBLP DOI BibTeX RDF |
|
30 | Vrijendra Singh, Narendra Meena |
Engine Fault Diagnosis using DTW, MFCC and FFT. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IHCI ![In: Proceedings of the First International Conference on Intelligent Human Computer Interaction, IHCI 2009, January 20-23, 2009, Organized by the Indian Institute of Information Technology, Allahabad, India, pp. 83-94, 2009, Springer India, 978-81-8489-404-2. The full citation details ...](Pics/full.jpeg) |
2009 |
DBLP DOI BibTeX RDF |
|
30 | Srinivasan Umesh, Rohit Sinha 0003 |
A Study of Filter Bank Smoothing in MFCC Features for Recognition of Children's Speech. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Trans. Speech Audio Process. ![In: IEEE Trans. Speech Audio Process. 15(8), pp. 2418-2430, 2007. The full citation details ...](Pics/full.jpeg) |
2007 |
DBLP DOI BibTeX RDF |
|
30 | Xi Zhou, Yun Fu 0001, Ming Liu 0009, Mark Hasegawa-Johnson, Thomas S. Huang |
Robust Analysis and Weighting on MFCC Components for Speech Recognition and Speaker Identification. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICME ![In: Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, ICME 2007, July 2-5, 2007, Beijing, China, pp. 188-191, 2007, IEEE Computer Society, 1-4244-1017-7. The full citation details ...](Pics/full.jpeg) |
2007 |
DBLP DOI BibTeX RDF |
|
25 | Nicolás Morales, Doroteo Torre Toledano, John H. L. Hansen, Javier Garrido Salas |
Feature Compensation Techniques for ASR on Band-Limited Speech. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Trans. Speech Audio Process. ![In: IEEE Trans. Speech Audio Process. 17(4), pp. 758-774, 2009. The full citation details ...](Pics/full.jpeg) |
2009 |
DBLP DOI BibTeX RDF |
|
25 | Chang-Wen Hsu, Lin-Shan Lee |
Higher Order Cepstral Moment Normalization for Improved Robust Speech Recognition. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Trans. Speech Audio Process. ![In: IEEE Trans. Speech Audio Process. 17(2), pp. 205-220, 2009. The full citation details ...](Pics/full.jpeg) |
2009 |
DBLP DOI BibTeX RDF |
|
25 | Yan Guo, Bruno Gas |
Underwater transient and non transient signals classification using predictive neural networks. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IROS ![In: 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, October 11-15, 2009, St. Louis, MO, USA, pp. 2283-2288, 2009, IEEE, 978-1-4244-3803-7. The full citation details ...](Pics/full.jpeg) |
2009 |
DBLP DOI BibTeX RDF |
|
25 | Marco Grimaldi, Fred Cummins |
Speaker Identification Using Instantaneous Frequencies. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Trans. Speech Audio Process. ![In: IEEE Trans. Speech Audio Process. 16(6), pp. 1097-1111, 2008. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP DOI BibTeX RDF |
|
25 | Xiaodan Zhuang, Xi Zhou, Thomas S. Huang, Mark Hasegawa-Johnson |
Feature analysis and selection for acoustic event detection. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICASSP ![In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2008, March 30 - April 4, 2008, Caesars Palace, Las Vegas, Nevada, USA, pp. 17-20, 2008, IEEE, 1-4244-1484-9. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP DOI BibTeX RDF |
|
25 | Yun Tang, Richard C. Rose |
A study of using locality preserving projections for feature extraction in speech recognition. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICASSP ![In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2008, March 30 - April 4, 2008, Caesars Palace, Las Vegas, Nevada, USA, pp. 1569-1572, 2008, IEEE, 1-4244-1484-9. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP DOI BibTeX RDF |
|
25 | Bo-Zhi Fu, Hong-Bin Zhang |
Feature Extraction Using Wavelet Packet Decomposition Based on MPEG-I. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CSSE (1) ![In: International Conference on Computer Science and Software Engineering, CSSE 2008, Volume 1: Artificial Intelligence, December 12-14, 2008, Wuhan, China, pp. 1048-1052, 2008, IEEE Computer Society, 978-0-7695-3336-0. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP DOI BibTeX RDF |
|
25 | Panagiotis Moschonas, Constantine Kotropoulos |
Multimodal Speaker Identification Based on Text and Speech. ![Search on Bibsonomy](Pics/bibsonomy.png) |
BIOID ![In: Biometrics and Identity Management, First European Workshop, BIOID 2008, Roskilde, Denmark, May 7-9, 2008. Revised Selected Papers, pp. 100-109, 2008, Springer, 978-3-540-89990-7. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP DOI BibTeX RDF |
multimodal speaker identification, probabilistic latent semantic indexing, speech, text, Mel-frequency cepstral coefficients, nearest neighbor classifier, convex combination |
25 | Hemant A. Patil, Robin Jain, Prakhar Kant Jain |
Identification of Speakers from Their Hum. ![Search on Bibsonomy](Pics/bibsonomy.png) |
TSD ![In: Text, Speech and Dialogue, 11th International Conference, TSD 2008, Brno, Czech Republic, September 8-12, 2008. Proceedings, pp. 461-468, 2008, Springer, 978-3-540-87390-7. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP DOI BibTeX RDF |
mel cepstrum, hum, polynomial classifier, Speaker recognition, linear prediction |
25 | Ben Milner, Xu Shao |
Prediction of Fundamental Frequency and Voicing From Mel-Frequency Cepstral Coefficients for Unconstrained Speech Reconstruction. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Trans. Speech Audio Process. ![In: IEEE Trans. Speech Audio Process. 15(1), pp. 24-33, 2007. The full citation details ...](Pics/full.jpeg) |
2007 |
DBLP DOI BibTeX RDF |
|
25 | Satya Dharanipragada, Umit H. Yapanel, Bhaskar D. Rao |
Robust Feature Extraction for Continuous Speech Recognition Using the MVDR Spectrum Estimation Method. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Trans. Speech Audio Process. ![In: IEEE Trans. Speech Audio Process. 15(1), pp. 224-234, 2007. The full citation details ...](Pics/full.jpeg) |
2007 |
DBLP DOI BibTeX RDF |
|
25 | Eun Ho Kim, Kyung Hak Hyun, Soo-Hyun Kim, Yoon Keun Kwak |
Speech Emotion Recognition Using Eigen-FFT in Clean and Noisy Environments. ![Search on Bibsonomy](Pics/bibsonomy.png) |
RO-MAN ![In: IEEE RO-MAN 2007, 16th IEEE International Symposium on Robot & Human Interactive Communication, August 26-29, 2007, Jeju Island, South Korea, Proceedings, pp. 689-694, 2007, IEEE, 978-1-4244-1634-9. The full citation details ...](Pics/full.jpeg) |
2007 |
DBLP DOI BibTeX RDF |
|
25 | Andrew Errity, John McKenna, Barry Kirkpatrick |
Manifold Learning-Based Feature Transformation for Phone Classification. ![Search on Bibsonomy](Pics/bibsonomy.png) |
NOLISP ![In: Advances in Nonlinear Speech Processing, International Conference on Non-Linear Speech Processing, NOLISP 2007, Paris, France, May 22-25, 2007, Revised Selected Papers, pp. 132-141, 2007, Springer, 978-3-540-77346-7. The full citation details ...](Pics/full.jpeg) |
2007 |
DBLP DOI BibTeX RDF |
|
25 | Wei Chu, Benoît Champagne 0001 |
An Improved Implementation for an Auditory-Inspired FFT Model with Application in Audio Classification. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICME ![In: Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, ICME 2007, July 2-5, 2007, Beijing, China, pp. 196-199, 2007, IEEE Computer Society, 1-4244-1017-7. The full citation details ...](Pics/full.jpeg) |
2007 |
DBLP DOI BibTeX RDF |
|
25 | Chang-Hsing Lee, Jau-Ling Shih, Kun-Ming Yu, Jung-Mau Su |
Automatic Music Genre Classification using Modulation Spectral Contrast Feature. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICME ![In: Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, ICME 2007, July 2-5, 2007, Beijing, China, pp. 204-207, 2007, IEEE Computer Society, 1-4244-1017-7. The full citation details ...](Pics/full.jpeg) |
2007 |
DBLP DOI BibTeX RDF |
|
25 | Bong-Wan Kim, Dae-Lim Choi, Yong-Ju Lee |
Speech/Music Discrimination Using Mel-Cepstrum Modulation Energy. ![Search on Bibsonomy](Pics/bibsonomy.png) |
TSD ![In: Text, Speech and Dialogue, 10th International Conference, TSD 2007, Pilsen, Czech Republic, September 3-7, 2007, Proceedings, pp. 406-414, 2007, Springer, 978-3-540-74627-0. The full citation details ...](Pics/full.jpeg) |
2007 |
DBLP DOI BibTeX RDF |
|
25 | Marcus Holmberg, David Gelbart, Werner Hemmert |
Automatic speech recognition with an adaptation model motivated by auditory processing. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Trans. Speech Audio Process. ![In: IEEE Trans. Speech Audio Process. 14(1), pp. 43-49, 2006. The full citation details ...](Pics/full.jpeg) |
2006 |
DBLP DOI BibTeX RDF |
|
25 | Avishay Amsalem, Ilan D. Shallom |
Time Frequency Representation for Speech Recognition. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ITRE ![In: ITRE 2006 - 4th International Conference on Information Technology: Research and Education, October 17-18, 2006, Tel Aviv, Israel, Proceedings, pp. 99-103, 2006, IEEE, 1-4244-0858-X. The full citation details ...](Pics/full.jpeg) |
2006 |
DBLP DOI BibTeX RDF |
|
25 | Jinfang Wang, Jinbao Wang |
Speaker Recognition Using Features Derived from Fractional Fourier Transform. ![Search on Bibsonomy](Pics/bibsonomy.png) |
AutoID ![In: Proceedings of the Fourth IEEE Workshop on Automatic Identification Advanced Technologies (AutoID 2005), 16-18 October 2005, Buffalo, NY, USA, pp. 95-100, 2005, IEEE Computer Society, 0-7695-2475-3. The full citation details ...](Pics/full.jpeg) |
2005 |
DBLP DOI BibTeX RDF |
|
25 | Kevin M. Indrebo, Richard J. Povinelli, Michael T. Johnson |
Third-Order Moments of Filtered Speech Signals for Robust Speech Recognition. ![Search on Bibsonomy](Pics/bibsonomy.png) |
NOLISP ![In: Nonlinear Analyses and Algorithms for Speech Processing, International Conference on Non-Linear Speech Processing, NOLISP 2005, Barcelona, Spain, April 19-22, 2005, Revised Selected Papers, pp. 277-283, 2005, Springer, 3-540-31257-9. The full citation details ...](Pics/full.jpeg) |
2005 |
DBLP DOI BibTeX RDF |
|
25 | Andreas K. Maier, Christian Hacker, Stefan Steidl, Elmar Nöth, Heinrich Niemann |
Robust Parallel Speech Recognition in Multiple Energy Bands. ![Search on Bibsonomy](Pics/bibsonomy.png) |
DAGM-Symposium ![In: Pattern Recognition, 27th DAGM Symposium, Vienna, Austria, August 31 - September 2, 2005, Proceedings, pp. 133-140, 2005, Springer, 3-540-28703-5. The full citation details ...](Pics/full.jpeg) |
2005 |
DBLP DOI BibTeX RDF |
|
25 | Roman Jarina, Michal Kuba, Martin Paralic |
Compact Representation of Speech Using 2-D Cepstrum - An Application to Slovak Digits Recognition. ![Search on Bibsonomy](Pics/bibsonomy.png) |
TSD ![In: Text, Speech and Dialogue, 8th International Conference, TSD 2005, Karlovy Vary, Czech Republic, September 12-15, 2005, Proceedings, pp. 342-347, 2005, Springer, 3-540-28789-2. The full citation details ...](Pics/full.jpeg) |
2005 |
DBLP DOI BibTeX RDF |
|
25 | Pei Yin, Irfan A. Essa, James M. Rehg |
Asymmetrically Boosted HMM for Speech Reading. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CVPR (2) ![In: 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2004), with CD-ROM, 27 June - 2 July 2004, Washington, DC, USA, pp. 755-761, 2004, IEEE Computer Society, 0-7695-2158-4. The full citation details ...](Pics/full.jpeg) |
2004 |
DBLP DOI BibTeX RDF |
|
25 | Gustavo Carneiro 0001, Allan D. Jepson |
Flexible Spatial Models for Grouping Local Image Features. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CVPR (2) ![In: 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2004), with CD-ROM, 27 June - 2 July 2004, Washington, DC, USA, pp. 747-754, 2004, IEEE Computer Society, 0-7695-2158-4. The full citation details ...](Pics/full.jpeg) |
2004 |
DBLP DOI BibTeX RDF |
|
25 | Rong Zheng 0005, Shuwu Zhang, Bo Xu 0002 |
Improvement of Speaker Identification by Combining Prosodic Features with Acoustic Features. ![Search on Bibsonomy](Pics/bibsonomy.png) |
SINOBIOMETRICS ![In: Advances in Biometric Person Authentication, 5th Chinese Conference on Biometric Recognition, SINOBIOMETRICS 2004, Guangzhou, China, December 13-14, 2004, Proceedings, pp. 569-576, 2004, Springer, 3-540-24029-2. The full citation details ...](Pics/full.jpeg) |
2004 |
DBLP DOI BibTeX RDF |
|
25 | Todor Ganchev, Mihalis Siafarikas, Nikos Fakotakis |
Speaker Verification Based on Wavelet Packets. ![Search on Bibsonomy](Pics/bibsonomy.png) |
TSD ![In: Text, Speech and Dialogue, 7th International Conference, TSD 2004, Brno, Czech Republic, September 8-11, 2004, Proceedings, pp. 299-306, 2004, Springer, 3-540-23049-1. The full citation details ...](Pics/full.jpeg) |
2004 |
DBLP DOI BibTeX RDF |
|
25 | Ruhsar Soganci, Fikret S. Gürgen, Haluk Topcuoglu |
Parallel Implementation of a VQ-Based Text-Independent Speaker Identification. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ADVIS ![In: Advances in Information Systems, Third International Conference, ADVIS 2004, Izmir, Turkey, October 20-22, 2004, Proceedings, pp. 291-300, 2004, Springer, 3-540-23478-0. The full citation details ...](Pics/full.jpeg) |
2004 |
DBLP DOI BibTeX RDF |
|
25 | Hemant A. Patil, T. K. Basu |
The Teager Energy Based Features for Identification of Identical Twins in Multi-lingual Environment. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICONIP ![In: Neural Information Processing, 11th International Conference, ICONIP 2004, Calcutta, India, November 22-25, 2004, Proceedings, pp. 333-337, 2004, Springer, 3-540-23931-6. The full citation details ...](Pics/full.jpeg) |
2004 |
DBLP DOI BibTeX RDF |
|
25 | Mohamed Kamal Omar, Mark Hasegawa-Johnson |
Approximately independent factors of speech using nonlinear symplectic transformation. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Trans. Speech Audio Process. ![In: IEEE Trans. Speech Audio Process. 11(6), pp. 660-671, 2003. The full citation details ...](Pics/full.jpeg) |
2003 |
DBLP DOI BibTeX RDF |
|
25 | Ningping Fan, Justinian P. Rosca |
Enhanced VQ-Based Algorithms for Speech Independent Speaker Identification. ![Search on Bibsonomy](Pics/bibsonomy.png) |
AVBPA ![In: Audio-and Video-Based Biometrie Person Authentication, 4th International Conference, AVBPA 2003, Guildford, UK, June 9-11, 2003 Proceedings, pp. 470-477, 2003, Springer, 3-540-40302-7. The full citation details ...](Pics/full.jpeg) |
2003 |
DBLP DOI BibTeX RDF |
|
25 | Murat Deviren, Khalid Daoudi |
Frequency and Wavelet Filtering for Robust Speech Recognition. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICANN ![In: Artificial Neural Networks and Neural Information Processing - ICANN/ICONIP 2003, Joint International Conference ICANN/ICONIP 2003, Istanbul, Turkey, June 26-29, 2003, Proceedings, pp. 452-462, 2003, Springer, 3-540-40408-2. The full citation details ...](Pics/full.jpeg) |
2003 |
DBLP DOI BibTeX RDF |
|
25 | Pui-Fung Wong, Man-Hung Siu |
Integration of Tone Related Feature for Chinese Speech Recognition. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICMI ![In: 4th IEEE International Conference on Multimodal Interfaces (ICMI 2002), 14-16 October 2002, Pittsburgh, PA, USA, pp. 64-68, 2002, IEEE Computer Society, 0-7695-1834-6. The full citation details ...](Pics/full.jpeg) |
2002 |
DBLP DOI BibTeX RDF |
|
23 | Shih-Hao Chen, Shi-Huang Chen, Rodrigo Capobianco Guido |
Music Genre Classification Algorithm Based on Dynamic Frame Analysis and Support Vector Machine. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ISM ![In: 12th IEEE International Symposium on Multimedia, ISM 2010, Taichung, Taiwan, December 13-15, 2010, pp. 357-361, 2010, IEEE Computer Society, 978-1-4244-8672-4. The full citation details ...](Pics/full.jpeg) |
2010 |
DBLP DOI BibTeX RDF |
support vector machine (SVM), mel-frequency cepstral coefficient (MFCC), music genre classification |
23 | Orion Fausto Reyes-Galaviz, Carlos A. Reyes García |
Fuzzy Relational Compression Applied on Feature Vectors for Infant Cry Recognition. ![Search on Bibsonomy](Pics/bibsonomy.png) |
MICAI ![In: MICAI 2009: Advances in Artificial Intelligence, 8th Mexican International Conference on Artificial Intelligence, Guanajuato, Mexico, November 9-13, 2009. Proceedings, pp. 420-431, 2009, Springer, 978-3-642-05257-6. The full citation details ...](Pics/full.jpeg) |
2009 |
DBLP DOI BibTeX RDF |
Fuzzy Relational Product, Infant Cry Analysis, Feature Compression, MFCC, Time Delay Neural Networks |
23 | Thilo Stadelmann, Bernd Freisleben |
Unfolding speaker clustering potential: a biomimetic approach. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ACM Multimedia ![In: Proceedings of the 17th International Conference on Multimedia 2009, Vancouver, British Columbia, Canada, October 19-24, 2009, pp. 185-194, 2009, ACM, 978-1-60558-608-3. The full citation details ...](Pics/full.jpeg) |
2009 |
DBLP DOI BibTeX RDF |
temporal context, GMM, speaker identification, MFCC, speaker diarization, one-class SVM, speaker clustering |
23 | Jian-wei Zhu, Shuifa Sun, Xiao-li Liu, Bang Jun Lei |
Pitch in Speaker Recognition. ![Search on Bibsonomy](Pics/bibsonomy.png) |
HIS (1) ![In: 9th International Conference on Hybrid Intelligent Systems (HIS 2009), August 12-14, 2009, Shenyang, China, pp. 33-36, 2009, IEEE Computer Society, 978-0-7695-3745-0. The full citation details ...](Pics/full.jpeg) |
2009 |
DBLP DOI BibTeX RDF |
endpoint detection, speaker recognition, MFCC, pitch, pitch contour |
23 | Stavros Ntalampiras, Ilyas Potamitis, Nikos Fakotakis |
Automatic Recognition of Urban Soundscenes. ![Search on Bibsonomy](Pics/bibsonomy.png) |
New Directions in Intelligent Interactive Multimedia ![In: New Directions in Intelligent Interactive Multimedia, pp. 147-153, 2008, Springer, 978-3-540-68126-7. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP DOI BibTeX RDF |
Automatic audio recognition, MPEG-7 audio, Gaussian mixture model (GMM), MFCC, Computer Audition |
23 | Inge Gavat, Corneliu Octavian Dumitru |
The ASRS_RL - A Research Platform for Spoken Language Recognition and Understanding Experiments. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICCSA (2) ![In: Computational Science and Its Applications - ICCSA 2008, International Conference, Perugia, Italy, June 30 - July 3, 2008, Proceedings, Part II, pp. 1142-1157, 2008, Springer, 978-3-540-69840-1. The full citation details ...](Pics/full.jpeg) |
2008 |
DBLP DOI BibTeX RDF |
PLP, LPC coefficients, feature extraction, hidden Markov model, artificial neural network, Speech recognition, MFCC |
23 | Stefan Schacht, Jacques C. Koreman, Christoph Lauer, Andrew C. Morris, Dalei Wu, Dietrich Klakow |
Frame Based Features. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Speaker Classification (1) ![In: Speaker Classification I: Fundamentals, Features, and Methods, pp. 226-240, 2007, Springer, 978-3-540-74186-2. The full citation details ...](Pics/full.jpeg) |
2007 |
DBLP DOI BibTeX RDF |
LPC, Feature Extraction, Wavelets, Speaker Identification, MFCC |
23 | Szabolcs Levente Tóth, David Sztahó, Klára Vicsi |
Speech Emotion Perception by Human and Machine. ![Search on Bibsonomy](Pics/bibsonomy.png) |
COST 2102 Workshop (Patras) ![In: Verbal and Nonverbal Features of Human-Human and Human-Machine Interaction, COST Action 2102 International Conference, Patras, Greece, October 29-31, 2007. Revised Papers, pp. 213-224, 2007, Springer, 978-3-540-70871-1. The full citation details ...](Pics/full.jpeg) |
2007 |
DBLP DOI BibTeX RDF |
Human Speech Perception, Hidden Markov Models, Emotion Recognition, Automatic Speech Recognition, Speech Technology, MFCC |
18 | Cai Li, Haochang Zhi, Kaiyue Yang, Junyi Qian, Zhihao Yan, Lixuan Zhu, Chao Chen, Xi Wang 0009, Weiwei Shan |
A 0.61-μW Fully Integrated Keyword-Spotting ASIC With Real-Point Serial FFT-Based MFCC and Temporal Depthwise Separable CNN. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE J. Solid State Circuits ![In: IEEE J. Solid State Circuits 59(3), pp. 867-877, March 2024. The full citation details ...](Pics/full.jpeg) |
2024 |
DBLP DOI BibTeX RDF |
|
18 | Siba Prasad Mishra, Pankaj Warule, Suman Deb |
Speech emotion recognition using MFCC-based entropy feature. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Signal Image Video Process. ![In: Signal Image Video Process. 18(1), pp. 153-161, February 2024. The full citation details ...](Pics/full.jpeg) |
2024 |
DBLP DOI BibTeX RDF |
|
18 | Mahendra Kumar Gourisaria, Rakshit Agrawal, Manoj Sahni, Pradeep Kumar Singh 0001 |
Comparative analysis of audio classification with MFCC and STFT features using machine learning techniques. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Discov. Internet Things ![In: Discov. Internet Things 4(1), pp. 1, December 2024. The full citation details ...](Pics/full.jpeg) |
2024 |
DBLP DOI BibTeX RDF |
|
18 | Wei Pei, Yan Li, Peng Wen, Fuwen Yang, Xiaopeng Ji |
An automatic method using MFCC features for sleep stage classification. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Brain Informatics ![In: Brain Informatics 11(1), pp. 6, December 2024. The full citation details ...](Pics/full.jpeg) |
2024 |
DBLP DOI BibTeX RDF |
|
18 | S. Johanan Joysingh, P. Vijayalakshmi 0001, T. Nagarajan 0001 |
Significance of Chirp MFCC as a Feature in Speech and Audio Applications. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2402.12239, 2024. The full citation details ...](Pics/full.jpeg) |
2024 |
DBLP DOI BibTeX RDF |
|
18 | T. M. Nithya, P. Dhivya, S. N. Sangeethaa, P. Rajesh Kanna |
TB-MFCC multifuse feature for emergency vehicle sound classification using multistacked CNN - Attention BiLSTM. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Biomed. Signal Process. Control. ![In: Biomed. Signal Process. Control. 88(Part A), pp. 105688, February 2024. The full citation details ...](Pics/full.jpeg) |
2024 |
DBLP DOI BibTeX RDF |
|
18 | Dali Liu, Hongyuan Yang, Weimin Hou, Baozhu Wang |
A Novel Underwater Acoustic Target Recognition Method Based on MFCC and RACNN. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Sensors ![In: Sensors 24(1), pp. 273, 2024. The full citation details ...](Pics/full.jpeg) |
2024 |
DBLP DOI BibTeX RDF |
|
18 | Weiwei Shan, Junyi Qian, Lixuan Zhu, Jun Yang 0006, Cheng Huang 0005, Hao Cai |
AAD-KWS: A Sub-μ W Keyword Spotting Chip With an Acoustic Activity Detector Embedded in MFCC and a Tunable Detection Window in 28-nm CMOS. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE J. Solid State Circuits ![In: IEEE J. Solid State Circuits 58(3), pp. 867-876, March 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
18 | Mahmud Esad Arar, Herman Sedef |
An efficient lung sound classification technique based on MFCC and HDMR. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Signal Image Video Process. ![In: Signal Image Video Process. 17(8), pp. 4385-4394, November 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
18 | Kalyanapu Jagadeeshwar, T. Sreenivasarao, Padmaja Pulicherla, K. N. V. Satyanarayana, K. Mohana Lakshmi, Pala Mahesh Kumar |
ASERNet: Automatic speech emotion recognition system using MFCC-based LPC approach with deep learning CNN. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Int. J. Model. Simul. Sci. Comput. ![In: Int. J. Model. Simul. Sci. Comput. 14(4), pp. 2341029:1-2341029:22, August 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
18 | Nilashree Wankhede, Sushama Wagh |
Enhancing Biometric Speaker Recognition Through MFCC Feature Extraction and Polar Codes for Remote Application. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Access ![In: IEEE Access 11, pp. 133921-133930, 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
18 | Suprava Patnaik |
Speech emotion recognition by using complex MFCC and deep sequential model. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Multim. Tools Appl. ![In: Multim. Tools Appl. 82(8), pp. 11897-11922, March 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
18 | Mainak Biswas, Saif Rahaman, Ali Ahmadian, Kamalularifin Subari, Pawan Kumar Singh |
Automatic spoken language identification using MFCC based time series features. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Multim. Tools Appl. ![In: Multim. Tools Appl. 82(7), pp. 9565-9595, March 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
18 | Shiying Zhang, Fukun Su, Yi Wang, Songping Mai, Kong-Pang Pun, Xian Tang |
A Low-Power Keyword Spotting System With High-Order Passive Switched-Capacitor Bandpass Filters for Analog-MFCC Feature Extraction. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IEEE Trans. Circuits Syst. I Regul. Pap. ![In: IEEE Trans. Circuits Syst. I Regul. Pap. 70(11), pp. 4235-4248, November 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
18 | Silvestre Carvalho, Elsa Ferreira Gomes |
Automatic Classification of Bird Sounds: Using MFCC and Mel Spectrogram Features with Deep Learning. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Vietnam. J. Comput. Sci. ![In: Vietnam. J. Comput. Sci. 10(1), pp. 39-54, February 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
18 | A. Suresh Rao, A. Pramod Reddy, Pragathi Vulpala, K. Shwetha Rani, P. Hemalatha |
Deep learning structure for emotion prediction using MFCC from native languages. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Int. J. Speech Technol. ![In: Int. J. Speech Technol. 26(3), pp. 721-733, September 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
18 | Mahdi Barhoush, Ahmed Hallawa, Anke Schmeink |
Speaker identification and localization using shuffled MFCC features and deep learning. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Int. J. Speech Technol. ![In: Int. J. Speech Technol. 26(1), pp. 185-196, March 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
18 | Mohammad Reza Hasanabadi |
MFCC-GAN Codec: A New AI-based Audio Coding. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2310.14300, 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
18 | Mohammad Reza Hasanabadi, Majid Behdad, Davood Gharavian |
MFCCGAN: A Novel MFCC-Based Speech Synthesizer Using Adversarial Learning. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CoRR ![In: CoRR abs/2306.12785, 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
18 | Ahmad Abbaskhah, Hamed Sedighi, Hossein Marvi |
Infant cry classification by MFCC feature extraction with MLP and CNN structures. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Biomed. Signal Process. Control. ![In: Biomed. Signal Process. Control. 86(Part B), pp. 105261, September 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
18 | Furqan Rustam, Abid Ishaq, Muhammad Shadab Alam Hashmi, Hafeez Ur Rehman Siddiqui, Luis Alonso Dzul López, Juan Castanedo Galán, Imran Ashraf |
Railway Track Fault Detection Using Selective MFCC Features from Acoustic Data. ![Search on Bibsonomy](Pics/bibsonomy.png) |
Sensors ![In: Sensors 23(16), pp. 7018, August 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
18 | Tariq Al-Maashani, Israel Mendonça, Masayoshi Aritsugi |
Age Classification Based on Voice Using Mel-Spectrogram and MFCC. ![Search on Bibsonomy](Pics/bibsonomy.png) |
DSP ![In: 24th International Conference on Digital Signal Processing, DSP 2023, Rhodes (Rodos), Greece, June 11-13, 2023, pp. 1-5, 2023, IEEE, 979-8-3503-3959-8. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
18 | Haohai Yu, Keyan He, Chen Ling, Yang Liu, Dihu Chen, Tao Su |
A $2.81\mu \mathrm{W}$, Energy Efficient MFCC Feature Extractor for Keyword-Spotting in 65nm CMOS. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICCCS ![In: 8th International Conference on Computer and Communication Systems, ICCCS 2023, Guangzhou, China, April 21-23, 2023, pp. 56-61, 2023, IEEE, 978-1-6654-5612-8. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
18 | Chen Sun, Yujie Wan, Peizhi Zhu, Fanqiang Lin |
A CNN Model for Gas Pipeline Leakage Detection Based on MFCC Feature Extraction. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICNCC ![In: Proceedings of the 12th International Conference on Networks, Communication and Computing, ICNCC 2023, Osaka, Japan, December 15-17, 2023, pp. 288-293, 2023, ACM. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
18 | Ashwini S. Ganakwar, Santosh K. Maher, R. R. Deshmukh |
Enhancing Sanskrit Isolated Word Recognition: A Comparative Analysis of MFCC and SVM Feature Integration. ![Search on Bibsonomy](Pics/bibsonomy.png) |
O-COCOSDA ![In: 26th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2023, Delhi, India, December 4-6, 2023, pp. 1-6, 2023, IEEE, 979-8-3503-4402-8. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
18 | Mohammad Reza Hasanabadi, Majid Behdad, Davood Gharavian |
MFCCGAN: A Novel MFCC-Based Speech Synthesizer Using Adversarial Learning. ![Search on Bibsonomy](Pics/bibsonomy.png) |
ICASSP ![In: IEEE International Conference on Acoustics, Speech and Signal Processing ICASSP 2023, Rhodes Island, Greece, June 4-10, 2023, pp. 1-5, 2023, IEEE, 978-1-7281-6327-7. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
18 | Nur Fatinah Hafiz, Syamsiah Mashohor, M. H. S. E. M. A. Shazril, Mohd Fadlee Abdul Rasid, Azizi Ali |
Comparison of Mel Frequency Cepstral Coefficient (MFCC) and Mel Spectrogram Techniques to Classify Industrial Machine Sound. ![Search on Bibsonomy](Pics/bibsonomy.png) |
SKIMA ![In: 15th International Conference on Software, Knowledge, Information Management and Applications, SKIMA 2023, Kuala Lumpur, Malaysia, December 8-10, 2023, pp. 273-278, 2023, IEEE, 979-8-3503-1655-1. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
18 | Dongyu Wang, Canghong Shi, Junrong Li, Jiaxin Gan, Xianhua Niu, Ling Xiong |
M-GFCC: Audio Copy-Move Forgery Detection Algorithm Based on Fused Features of MFCC and GFCC. ![Search on Bibsonomy](Pics/bibsonomy.png) |
IAIC (1) ![In: IAIC (1), pp. 220-234, 2023. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
18 | Liyuan Guo, Matthias Jobst, Johannes Partzsch, Stefan Scholze, Andreas Dixius, Matthias Lohrmann, Seyed Mohammad Ali Zeinolabedin, Christian Mayr 0001 |
A Low-Power Hardware Accelerator of MFCC Extraction for Keyword Spotting in 22nm FDSOI. ![Search on Bibsonomy](Pics/bibsonomy.png) |
AICAS ![In: 5th IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2023, Hangzhou, China, June 11-13, 2023, pp. 1-5, 2023, IEEE, 979-8-3503-3267-4. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|
18 | Cai Li, Haochang Zhi, Long Chen, Kaiyue Yang, Junyi Qian, Zhihao Yan, Lixuan Zhu, Weiwei Shan |
A 608nW Near-Microphone Keyword-Spotting Chip Using Real-Point Serial FFT-Based MFCC and Temporal Depthwise Separable CNN in 28nm CMOS. ![Search on Bibsonomy](Pics/bibsonomy.png) |
CICC ![In: IEEE Custom Integrated Circuits Conference, CICC 2023, San Antonio, TX, USA, April 23-26, 2023, pp. 1-2, 2023, IEEE, 979-8-3503-9948-6. The full citation details ...](Pics/full.jpeg) |
2023 |
DBLP DOI BibTeX RDF |
|