Skip to main page content - your browser does not fully support our CSS, or is text-only.


Publications

This thesis presents a novel two stage multimodal speech enhancement system, making use of both visual and audio information to filter speech, and explores the extension of this system with the use of fuzzy logic to demonstrate proof of concept for an envisaged autonomous, adaptive, and context aware multimodal system.

Books

Cognitively Inspired Audiovisual Speech Filtering: Towards an Intelligent, Fuzzy Based, Multimodal, Two-Stage Speech Enhancement System

Springerbriefs in Cognitive Computation, eBook available, August 2015

This book presents a summary of the cognitively inspired basis behind multimodal speech enhancement, covering the relationship between audio and visual modalities in speech, as well as recent research into audiovisual speech correlation. A number of audiovisual speech filtering approaches that make use of this relationship are also discussed. A novel multimodal speech enhancement system, making use of both visual and audio information to filter speech, is presented, and this book explores the extension of this system with the use of fuzzy logic to demonstrate an initial implementation of an autonomous, adaptive, and context aware multimodal system. This work also discusses the challenges presented with regard to testing such a system, the limitations with many current audiovisual speech corpora, and discusses a suitable approach towards development of a corpus designed to test this novel, cognitively inspired, speech filtering system.

Publications

M. Fan, Q. Zhou, A. Abel, T.F. Zheng, R. Grishman. Probabilistic Belief Embedding for Large-scale Knowledge Population. Cognitive Computation, accepted July 2016.

Z. Yin, A. Abel, X. Zhang, B. Luo. Reversible Data Hiding in Encrypted Image Based on Block Histogram Shifting. 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2016.

Z. Tu, A. Abel, L. Zhang, B. Luo, A. Hussain. A New Spatio-Temporal Saliency-Based Video Object Segmentation. Cognitive Computation, published online March 2016.

L. Xu, X. Niu, , J. Xie, A. Abel. A Local-Global Mixed Kernel with Reproducing Property. Neurocomputing 168, pp. 190-199, 2015

M.K. Yusof, A. Abel, M.Y. Saman, M.N.A. Rahman, J.A. Jusoh, S-Library: A Case Study of NFC Adoption for Information Science, New Library World, vol 116, no. 11/12, 2015.

A. Abel, Dean Hunter, L. Smith. A Biologically Inspired Onset and Offset Speech Segmentation Approach. 2015 International Joint Conference on Neural Networks (IJCNN). IEEE, 2015.

S. Wang, T.J. Koickal, A. Hamilton, E. Mastropaolo, R. Cheung, A. Abel, L.S. Smith, A Power-Efficient Capacitive Read-Out Circuit with Parasitic-Cancellation for MEMS Cochlea Sensors, IEEE Transactions on Biomedical Circuits and Systems, vol 10, no. 1, pp 25-37, pp 25-37.

A. Abel, A. Hussain, B. Luo, Cognitively Inspired Speech Processing for Multimodal Hearing Technology. IEEE CICARE 2014 (IEEE Symposium Series on Computational Intelligence), pp. 56-63, 2014.

A. Abel, A. Hussain. Novel Two-Stage Audiovisual Speech Filtering in Noisy Environments. In Cognitive Computation. Volume. 6, No.2, pp.200-217, 2014.

A. Abel and A. Hussain. Cognitively Inspired Fuzzy Based Audiovisual Speech Filtering. University of Stirling Technical Report, 2014.

A. Abel, A. Hussain, Q.D. Nguyen, F. Ringeval, M. Chetouani, and M. Milgram. Maximising audiovisual correlation with automatic lip tracking and vowel based segmentation. In Biometric ID Management and Multimodal Communication: Joint COST 2101 and 2102 International Conference, BioID_MultiComm 2009, Madrid, Spain, September 16-18, 2009, Proceedings, pages 65-72. Springer-Verlag, 2009.

S. Cifani, A. Abel, A. Hussain, S. Squartini, and F. Piazza. An investigation into audiovisual speech correlation in reverberant noisy environments. In Cross-Modal Analysis of Speech, Gestures, Gaze and Facial Expressions: COST Action 2102 International Conference Prague, Czech Republic, October 15-18,2008 Revised Selected and Invited Papers, volume 5641, pages 331- 343. Springer-Verlag,2009.

A. Abel, A. Hussain. Multi-modal Speech Processing Methods: An Overview and Future Research Directions Using a MATLAB Based Audio-Visual Toolbox. Multimodal Signals: Cognitive and Algorithmic Issues: International School Vietri sul Mare, Italy, April 21-26,2008. Revised Selected and Invited Papers, volume 5398, pages 121-129. Springer-Verlag,2009.

M. Faundez-Zanuy, A. Hussain, J. Mekyska, E. Sesa-Nogueras, E. Monte-Moreno, A.Esposito, M. Chetouani, J. Garre-Olmo, A. Abel, Z. Smekal, K. Lopez-de-Ipiņa. Biometric Applications Related to Human Beings: There Is Life beyond Security. Cognitive Computation, volume 4, pages 1-16. Springer-Verlag, 2012.

PhD Thesis

My completed PhD thesis is now available for viewing, and can be accessed here:

Towards an Intelligent Fuzzy Based Multimodal Two Stage Speech Enhancement System.

This thesis presents a novel two stage multimodal speech enhancement system, making use of both visual and audio information to filter speech, and explores the extension of this system with the use of fuzzy logic to demonstrate proof of concept for an envisaged autonomous, adaptive, and context aware multimodal system.