Publications

Ph.D. Thesis

Sethu, V., (2010) “Automatic emotion recognition: An investigation of acoustic and prosodic parameters”, PhD thesis, the School of Electrical Engineering and Telecommunications, the University of New South Wales (UNSW), Sydney, Australia. [Thesis]

Book Chapters

Sethu, V., Epps, J., and Ambikairajah, E., (2015) “Speech based emotion recognition”, in Ogunfunmi, T., Togneri, R., and Narasimhai, M. (eds), Speech and Audio Processing for Coding Enhancement and Recognition, Springer.

Ambikairajah, E., Sethu, V., Eaton, R., Sheng, M., (2013) “Evolving Use of Educational Technologies - Enhancing Lectures.” Using Technology Tools to Innovate Assessment, Reporting, and Teaching Practices in Engineering Education, Published by IGI Global, 2013.

Journal Articles

Sethu, V., Ambikairajah, E., Epps, J., (2013) “On the use of speech parameter contours for emotion recognition”, EURASIP Journal on Audio, Speech, and Music Processing, vol. 2013:19, [http://dx.doi.org/10.1186/1687-4722-2013-19]

Ambikairajah, E., Li, H., Wang, L., Yin, B., Sethu, V., (2011) “Language Identification: A Tutorial”, IEEE Circuits and Systems Magazine, vol. 11, no. 2, pp. 82-108. [http://dx.doi.org/10.1109/MCAS.2011.941081]

Le, P.N., Ambikairajah, E., Epps, J., Sethu, V., Choi, E.H.C., (2011) “Investigation of spectral centroid features for cognitive load classification”, Speech Communication, vol. 53, pp. 540-55. [http://dx.doi.org/10.1016/j.specom.2011.01.005]

Sethu, V., Ambikairajah, E., GE, L. (2008), “Selective weighting of undecimated wavelet coefficients for noise reduction in SAR interferograms”, EURASIP Journal on Advances in Signal Processing, vol. 2008, Article ID 378092. [http://dx.doi.org/10.1155/2008/378092]

Meng, D., Sethu, V., Ambikairajah, E., GE, L. (2007), “A novel technique for noise reduction in InSAR images”, IEEE Geoscience and Remote Sensing Letters, pp. 226-230. [http://dx.doi.org/10.1109/LGRS.2006.888845]

Conference Papers

Cummins, N., Epps, J., Sethu, V., & Krajewski, J., (2015) “Weighted Pairwise Gaussian Likelihood Regression for Depression Score Prediction”, to appear in the Proceeding of IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2015. [Preprint]

Kua, K., Sethu, V., Le, P., & Ambikairajah, E. (2014) “The UNSW Submission to INTERSPEECH 2014 ComParE Cognitive Load Challenge.” Proceedings of the 15th Annual Conference of the International Speech Communication Association (Interspeech), Singapore. [Preprint]

Cummins, N., Sethu, V., Epps, J., & Krajewski, J. (2014) “Probabilistic Acoustic Volume Analysis for Speech Affected by Depression”, Proceedings of the 15th Annual Conference of the International Speech Communication Association (Interspeech), Singapore. [Preprint]

Cummins, N., Epps, J., Sethu, V., & Krajewski, J. (2014) “Variability compensation in small data: Oversampled extraction of I-vectors for the classification of depressed speech”, to appear in the Proceeding of IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2014.

Sethu, V., Epps, J., Ambikairajah, E., & Li, H. (2013) “GMM Based Speaker Variability Compensated System for Interspeech 2013 ComParE Emotion Challenge”, Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech), Lyon, France. [Preprint]

Cummins, N., Epps, J., Sethu, V., Breakspear, M., & Goecke, R. (2013) “Modeling Spectral Variability for the Classification of Depressed Speech”, Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech), Lyon, France. [Preprint]

Cummins, N., Joshi, J., Dhall, A., Sethu, V., Goecke, R., and Epps, J., (2013) “Diagnosis of Depression by Behavioural Signals: A Multimodal Approach”, Proceedings of ACM Multimedia-2013. [Preprint] [http://dx.doi.org/10.1145/2512530.2512535]

Sethu, V., Epps, J., and Ambikairajah, E., (2013), “Speaker variability in speech based emotion models – Analysis and normalisation”, Proceeding of IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2013, pp. 7522-7526. [Preprint] [http://dx.doi.org/10.1109/ICASSP.2013.6639125]

Ambikairajah, E., Kua, J. M. K., Sethu, V., and Li, H., (2012), “PNCC-ivector-SRC based speaker verification”, Proceedings of Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC), 2012 Asia-Pacific, pp. 1-7. [Preprint]

Ding, N., Sethu, V., Epps, J. and Ambikairajah, E., (2012), “Speaker variability in emotion recognition – An adaptation based approach”, Proceeding of IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2012, pp. 5101-5014. [Preprint] [http://dx.doi.org/10.1109/ICASSP.2012.6289068]

Le, P.N., Sethu, V., Ambikairajah, E., and Kua, J. M. K., (2011) “Investigation of the robustness of a non-uniform filterbank for cognitive load classification”, Proceedings of 8th International Conference on Information, Communications and Signal Processing (ICICS) 2011. [http://dx.doi.org/10.1109/ICICS.2011.6174268]

Ibrahim, R. K., Sethu, V. and Ambikairajah, E. (2010) “Novel delta zero crossing regression features for gait pattern classification”, Proceedings of Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2010, pp. 2427-2430. [http://dx.doi.org/10.1109/IEMBS.2010.5626275]

Sethu, V., Ambikairajah E. and Epps J. (2009) “Pitch contour parameterisation based on linear stylisation for emotion recognition”, Proceedings of INTERSPEECH-09, pp. 2011-2014. [Preprint]

Sethu, V., Ambikairajah, E., and Epps J., (2009) “Speaker dependency of spectral features and speech production cues for automatic emotion classification”, Proceeding of IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2009, pp. 4693-4696. [pdf] [http://dx.doi.org/10.1109/ICASSP.2009.4960678]

Sethu, V., Ambikairajah E. and Epps J. (2008) “Phonetic and speaker variations in automatic emotion classification”, Proceedings of INTERSPEECH-08, pp. 617-620. [Preprint]

Sethu, V., Ambikairajah, E., and Epps J., (2008) “Empirical mode decomposition based weighted frequency feature for speech based emotion classification”, Proceeding of IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2008, pp. 5017-5020.[pdf] [http://dx.doi.org/10.1109/ICASSP.2008.4518785]

Le, P. N., Ambikairajah, E., and Sethu, V., (2008) “Speech enhancement based on empirical mode decomposition” Proceedings of 5th IASTED International Conference, pp. 207-210.

Sethu, V., Ambikairajah E. and Epps J. (2007) “Group Delay Features for Emotion Detection”, Proceedings of INTERSPEECH-07, pp. 2273-2276. [pdf]

Sethu, V., Ambikairajah, E. and Epps, J. (2007),“Speaker Normalisation for Speech-based Emotion detection” Proceedings of 15 International Conference on Digital Signal Processing 2007, pp. 611-614. [pdf] [http://dx.doi.org/10.1109/ICDSP.2007.4288656]

Wang, Y., An, J., Sethu, V., and Ambikairajah, E. (2007) “Perceptually motivated pre-filter for speech enhancement using Kalman filtering,” Proceeding of the 6th IEEE International Conference on Information, Communications and Signal Processing. [http://dx.doi.org/10.1109/ICICS.2007.4449758]

Sethu, V., Ambikairajah, E., Ge, L. (2006) “Noise reduction in SAR interferograms using undecimated wavelet transform,” Proceedings of the 2nd International Symposium on Geo-Information for Disaster Management.