Seminar

Iran Roman, Lecturer at QMUL, on Advancing Multimodal Machine Perception Through Neural Dynamics and Music AI

January 26, 2026

Fri 6th FEB 2026, 11.30am, KCL Strand Campus, STRAND BLDG S2.49

Iran Roman, will a give talk at KCL Strand campus as part of the MARC Seminar Series, with refreshments provided courtesy of the NMES Research Culture Fund.

Title : Advancing Multimodal Machine Perception Through Neural Dynamics and Music AI

If you are unable to attend in person, you may use the following MS Teams link to attend the event virtually : MARC Seminar Talk – Iran Roman | Meeting-Join | Microsoft Teams

Abstract

His research integrates three complementary perspectives: multimodal machine perception, theoretical neuroscience, and music AI. In multimodal perception, he develops systems that process audio, visual, and spatial information to understand complex environments, from egocentric action recognition in augmented reality to spatial sound event localization. This work addresses fundamental challenges in cross-modal learning and real-time scene understanding. In theoretical neuroscience, he investigates how neural dynamics explain periodic behaviors, proposing that brain–body systems physically embody temporal structure through resonance and dynamical coupling rather than through predictive models. This framework reveals how spontaneous motor tempo affects synchronization and how delayed feedback shapes anticipatory behavior in rhythmic coordination. In music AI, he probes the perceptual and reasoning capabilities of large language models, revealing persistent gaps between symbolic and acoustic understanding. He evaluates fundamental music perception skills and develops benchmark datasets that expose limitations in current systems’ ability to truly “hear” rather than merely read music. Together, these perspectives advance understanding of intelligent systems that perceive, reason about, and interact with temporal multimedia information.

Speaker

Iran R. Roman is a Lecturer at the School of Electrical Engineering and Computer Science of Queen Mary University of London. Within Queen Mary, he is a member of the Center for Multimodal AI, Center for Digital Music, Center for Human-Centered Computing, the Computer Vision group, and the Cognitive Science group. His research area is machine perception, with the goal of creating algorithms that allow computers to perceive environments as living agents do. To this end, Iran has developed algorithms that leverage multimodal signals to sense, identify, and track objects in the real world. These algorithms draw inspiration from the neural mechanisms that allow living organisms to carry out similar tasks. Iran’s work has found applications in products at companies such as Apple, Tesla, Raytheon/BBN, and Plantronics. His research has been funded by the US National Science Foundation (NSF), the US Defense Advanced Research Projects Agency (DARPA), and the Howard Hughes Medical Institute (HHMI).

Tagged:LLM Machine Perception Multimodal Music AI

Event

Evan Benway and Jeff Larson (CEO & CPO Moodsonic) on Adaptive Soundscaping as a Building System (2 Feb 2026)

Natalia Cotic January 26, 2026

Seminar

Nithya Shikarpur, MIT PhD student, on Generative Modelling and Interactive Performance for Hindustani Music (21 Oct 2025)

Natalia Cotic October 21, 2025

Seminar

DongMin Kim, student from Sogang University, on music computing with traditional Korean sources in DDH Lunchtime Seminar (29 May 2025)

Julie Meyer May 28, 2025

Seminar

Robert Laidlow, AI+ Fellow, on creative AI in musical composition and performance (28 Jan 2026)

E C January 30, 2026

M	T	W	T	F	S	S
			1	2	3	4
5	6	7	8	9	10	11
12	13	14	15	16	17	18
19	20	21	22	23	24	25
26	27	28	29	30	31

🅜🅐🅡🅒 Music and Acoustics Research Centre

🅜🅐🅡🅒 Music and Acoustics Research Centre

Iran Roman, Lecturer at QMUL, on Advancing Multimodal Machine Perception Through Neural Dynamics and Music AI

Evan Benway and Jeff Larson (CEO & CPO Moodsonic) on Adaptive Soundscaping as a Building System (2 Feb 2026)

Nithya Shikarpur, MIT PhD student, on Generative Modelling and Interactive Performance for Hindustani Music (21 Oct 2025)

DongMin Kim, student from Sogang University, on music computing with traditional Korean sources in DDH Lunchtime Seminar (29 May 2025)

Robert Laidlow, AI+ Fellow, on creative AI in musical composition and performance (28 Jan 2026)

Archives

Iran Roman, Lecturer at QMUL, on Advancing Multimodal Machine Perception Through Neural Dynamics and Music AI

You Might Also Like

Evan Benway and Jeff Larson (CEO & CPO Moodsonic) on Adaptive Soundscaping as a Building System (2 Feb 2026)

Nithya Shikarpur, MIT PhD student, on Generative Modelling and Interactive Performance for Hindustani Music (21 Oct 2025)

DongMin Kim, student from Sogang University, on music computing with traditional Korean sources in DDH Lunchtime Seminar (29 May 2025)

Robert Laidlow, AI+ Fellow, on creative AI in musical composition and performance (28 Jan 2026)

Archives