Acoustic Source Localisation in constrained environments

dc.contributor.advisorBrown, Doctor Keith Edgar
dc.contributor.advisorSubr, Doctor Kartic
dc.contributor.authorVargas Vargas, Elizabeth
dc.date.accessioned2021-10-15T13:44:30Z
dc.date.available2021-10-15T13:44:30Z
dc.date.issued2020-02
dc.description.abstractAcoustic Source Localisation (ASL) is a problem with real-world applications across multiple domains, from smart assistants to acoustic detection and tracking. And yet, despite the level of attention in recent years, a technique for rapid and robust ASL remains elusive – not least in the constrained environments in which such techniques are most likely to be deployed. In this work, we seek to address some of these current limitations by presenting improvements to the ASL method for three commonly encountered constraints: the number and configuration of sensors; the limited signal sampling potentially available; and the nature and volume of training data required to accurately estimate Direction of Arrival (DOA) when deploying a particular supervised machine learning technique. In regard to the number and configuration of sensors, we find that accuracy can be maintained at state-of-the-art levels, Steered Response Power (SRP), while reducing computation sixfold, based on direct optimisation of well known ASL formulations. Moreover, we find that the circular microphone configuration is the least desirable as it yields the highest localisation error. In regard to signal sampling, we demonstrate that the computer vision inspired algorithm presented in this work, which extracts selected keypoints from the signal spectrogram, and uses them to select signal samples, outperforms an audio fingerprinting baseline while maintaining a compression ratio of 40:1. In regard to the training data employed in machine learning ASL techniques, we show that the use of music training data yields an improvement of 19% against a noise data baseline while maintaining accuracy using only 25% of the training data, while training with speech as opposed to noise improves DOA estimation by an average of 17%, outperforming the Generalised Cross-Correlation technique by 125% in scenarios in which the test and training acoustic environments are matched.en
dc.description.sponsorshipHeriot-Watt University James Watt Scholarship (JSW) in the School of Engineering & Physical Sciences.en
dc.identifier.urihttp://hdl.handle.net/10399/4352
dc.language.isoenen
dc.publisherHeriot-Watt Universityen
dc.publisherEngineering and Physical Sciencesen
dc.titleAcoustic Source Localisation in constrained environmentsen
dc.typeThesisen

Files

Original bundle

Now showing 1 - 1 of 1
Thumbnail Image
Name:
VargasVargasE_0220_eps.pdf
Size:
25.85 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: