IJCNN 2015: Special Session: Sound and speech interpretation in real environments

This will be a special session of IJCNN 2015 , to be held in Killarney, Eire (Ireland), 12-17 July 2015. Note paper deadline extended to 5 Feb 2015!

Scope and Motivation

Sounds in real environments arise from many concurrent sound sources, with reverberation induced smearing changing their spectrotemporal content. Eventually, sound (including speech) arrives at the ear or at the microphone as a time-varying signal, with the components of interest being between about 20 and 15,000 Hz. This is, of course, a mix of all the sound sources, all smeared by multiple reflections. How this signal should be processed to provide input to an interpreting system (which presumably would prefer to interpret only the signal of interest) is very much a matter of debate, particularly between those who use traditional MFCC techniques, and those who prefer something more neurally inspired, like a set of spike trains, and perhaps feature detectors as well. How should the sounds from a particular source of interest be segregated? How should interpretation be made invariant under listening conditions? Whatever techniques are used, the result is a time series of some type, and there are many neural techniques of possible interest for different aspects of this problem, from deep neural networks to learning-based spiking neural systems.

This session builds on the IJCNN Special Session in 2011, organised by the late Harry Erwin.

Topics

Sound preprocessing for interpretation: creating suitable spectrotemporal representations for interpretation.
Making spectrotemporal representations invariant under listening conditions;
Interpretation techniques: what type of network might be used: e.g. purely trained, or using self-organisation as well. Can the network help with listening condition invariance?
Implementation techniques: how to aim for real-time sound and speech interpretation for computer and robotic systems: hardware and hardware/software approaches.
Auditory scene analysis using neural networks: separating out streams of sound in multi-source reverberant environments.

Dates and submission

Important Dates

Paper submission:	January 15th, 2015 Now February 5 2015!
Paper Decision notification:	March 15th, 2015 Now March 25 2015!
Camera-ready submission:	April 15th, 2015 Now April 25 2015!
Conference Dates:	July 12 - 17th, 2015

Paper Submission

The paper information and submission process for IJCNN 2015 is available.
Actual submission is at the paper submission page.
Be sure to set the Main research topic to this special session (SS06) (The special sessions are found at the bottom of the list.) when you are submitting the paper.

Organizers

Please contact the organisers if you have any questions at all!

Leslie S. Smith, Computing Science and Mathematics, University of Stirling, Stirling FK9 4LA, Scotland, UK: email l dot s dot smith at cs.stir.ac.uk .
Shih-Chii Liu, Institut für Neuroinformatik, University of Zürich /ETH Zürich, Winterthurerstrasse 190, CH-8057 Zürich, Switzerland

Last updated: Wednesday, 14-Jan-2015 09:17:51 GMT

If you have any difficulties accessing this page, or you have any queries/suggestions arising from this page, please email:
Prof Leslie S Smith (lss(nospam_please)@cs.stir.ac.uk)