For a list of completed PhD theses at C4DM, please visit our PhD theses page.

R Agrawal, D Wolff, and S Dixon. Structure-aware audio-to-score alignment using progressively dilated convolutional neural networks. In ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, Jun 2021. [ bib | DOI ]
H Bear, V Morfi, and E Benetos. An evaluation of data augmentation methods for sound scene geotagging. pages 581-585. Brno, Czech Republic, International Speech and Communication Association (ISCA), Aug 2021. [ bib | DOI | http ]
A Benito Temprano and AP Mcpherson. A tmr angle sensor for gesture acquisition and disambiguation on the electric guitar. University of Trento (Italy) [Online], Sep 2021. [ bib ]
RPP Bodo, E Benetos, and M Queiroz. A framework for music similarity and cover song identification. Tokyo, Japan, Nov 2021. [ bib | http ]
KW Cheuk, Y-J Luo, E Benetos, and D Herremans. Revisiting the onsets and frames model with additive attention. IEEE, Jul 2021. [ bib | DOI | http ]
A Clemente, MT Pearce, and M Nadal. Musical aesthetic sensitivity. Psychology of Aesthetics Creativity and the Arts, Mar 2021. [ bib | DOI ]
A Clemente, MT Pearce, M Skov, and M Nadal. Evaluative judgment across domains: Liking balance, contour, symmetry and complexity in melodies and visual designs. Brain and Cognition, 151, Jul 2021. [ bib | DOI ]
JT Colonel and J Reiss. Reverse engineering of a recording mix with differentiable digital signal processing. Journal of the Acoustical Society of America, 150(1):608-619, Jul 2021. [ bib | DOI ]
A Daniele, C Di Bernardi Luft, and N Bryan-Kinns. “what is human?” a turing test for artistic creativity. volume 12693 LNCS, pages 396-411. Jan 2021. [ bib | DOI ]
R de Fleurian and MT Pearce. The relationship between valence and chills in music: A corpus analysis. i-Perception, 12(4):1-11, Jul 2021. [ bib | DOI ]
J Del-Bosque-Trevino, M Purver, and J HOUGH. Investigating the semantic wave in tutorial dialogues: An annotation scheme and corpus study on analogy components. Brandeis University, Waltham, MA, USA. [ bib ]
E Demirel, S Ahlbäck, and S Dixon. Low resource audio-to-lyrics alignment from polyphonic music recordings. volume 00, pages 586-590, Jun 2021. [ bib | DOI ]
Y Fang, J Ou, N Bryan-Kinns, Q Kang, J Zhang, and B Guo. Using vibrotactile device in music therapy to support wellbeing for people with alzheimer’s disease. volume 261, pages 353-361. Jan 2021. [ bib | DOI ]
C Ford, N Bryan-Kinns, and C Nash. Creativity in children's digital music composition. NYU Shanghai, Shanghai.,, Jun 2021. [ bib ]
L Gabrielli, G Fazekas, and J Nam. Special issue on deep learning for applications in acoustics: Modeling, synthesis, and listening. Applied Sciences (Switzerland), 11(2):1-4, Jan 2021. [ bib | DOI ]
Y Gan, X Chen, Q Huang, M Purver, JR Woodward, J Xie, and P Huang. Towards robustness of text-to-sql models against synonym substitution. In C Zong, F Xia, W Li, and R Navigli, editors, ACL/IJCNLP (1), pages 2505-2515. Association for Computational Linguistics, 2021. [ bib | http ]
M Graf, HC Opara, and M Barthet. An audio-driven system for real-time music visualisation. Jun 2021. [ bib | http ]
ETR Hall and MT Pearce. A model of large-scale thematic structure. Journal of New Music Research, 50(3):220-241, May 2021. [ bib | DOI ]
NC Hansen, H Kragness, P Vuust, L Trainor, and M Pearce. Predictive uncertainty underlies auditory-boundary perception. Psychological Science. [ bib ]
PMC Harrison, R Bianco, M Chait, and MT Pearce. Erratum: Ppm-decay: A computational model of auditory prediction with memory decay (plos comput biol (2021) 16: 11 (e1008304) doi: 10.1371/journal.pcbi.1008304). PLoS Computational Biology, 17(5), May 2021. [ bib | DOI ]
B Hayes, C Saitis, and G Fazekas. Neural waveshaping synthesis. Online. [ bib | http ]
PGT Healey. Human-like communication. In Human-Like Machine Intelligence, pages 137-151. Jul 2021. [ bib | DOI ]
A Holzapfel, E Benetos, A Killick, and R Widdess. Humanities and engineering perspectives on music transcription. Digital Scholarship in the Humanities. [ bib | http ]
C Jing, N Bryan-Kinns, S Yang, J Zhi, and J Zhang. The influence of mobile phone location and screen orientation on driving safety and the usability of car-sharing software in-car use. International Journal of Industrial Ergonomics, 84, Jul 2021. [ bib | DOI ]
M Karan, P Khare, P Healey, and M Purver. Mitigating topic bias when detecting decisions in dialogue. Jul 2021. [ bib | .pdf ]
T Kirby and M Sandler. The evolution of drum modes with strike intensity: Analysis and synthesis using the discrete cosine transform. J Acoust Soc Am, 150(1):202, Jul 2021. [ bib | DOI | http ]
S Krishnan, D Carey, F Dick, and MT Pearce. Effects of statistical learning in passive and active contexts on reproduction and recognition of auditory sequences. Journal of Experimental Psychology: General. [ bib | DOI ]
MN Lefford, G Bromham, G Fazekas, and D Moffat. Context-aware intelligent mixing systems. AES: Journal of the Audio Engineering Society, 69(3):128-141, Mar 2021. [ bib | DOI ]
S Li, Y Jing, and G Fazekas. A novel dataset for the identification of computer generated melodies in the csmt challenge. volume 761 LNEE, pages 177-186. Jan 2021. [ bib | DOI ]
A Liang, R Stewart, R Freire, and N Bryan-Kinns. Knit stretch sensor placement for body movement sensing. In TEI 2021 - Proceedings of the 15th International Conference on Tangible, Embedded, and Embodied Interaction, Feb 2021. [ bib | DOI ]
L Liu and E Benetos. From audio to music notation. In ER Miranda, editor, Handbook of Artificial Intelligence for Music, number 24 in Artificial Intelligence, pages 693-714. Springer International Publishing, Cham, Switzerland, 1st edition, Aug 2021. [ bib | DOI | http ]
L Liu, G-V Morfi, and E Benetos. Joint multi-pitch detection and score transcription for polyphonic piano music. Toronto, Canada, IEEE, Jun 2021. [ bib | DOI | http ]
S Löbbers, M Barthet, and G Fazekas. Sketching sounds: an exploratory study on sound-shape associations. Santiago de Chile, Chile. [ bib ]
I Manco, E Benetos, E Quinton, and G Fazekas. Muscaps: generating captions for music audio. IEEE, Jul 2021. [ bib | DOI | http ]
J Miller, V Nicosia, and M Sandler. Discovering common practice: Using graph theory to compare harmonic sequences in musical audio collections. In ACM International Conference Proceeding Series, pages 93-97, Jul 2021. [ bib | DOI ]
V Morfi, RF Lachlan, and D Stowell. Deep perceptual embeddings for unlabelled animal sound events. Journal of the Acoustical Society of America, 150(1):2-11, Jul 2021. [ bib | DOI ]
G Moro and AP McPherson. Performer experience on a continuous keyboard instrument. Computer Music Journal, 44(2-3):69-91, Jul 2021. [ bib | DOI ]
AN Nagele, V Bauer, PGT Healey, JD Reiss, H Cooke, T Cowlishaw, C Baume, and C Pike. Interactive audio augmented reality in participatory performance. Frontiers in Virtual Reality, 1:610320, Feb 2021. [ bib | DOI ]
S Nasreen, J HOUGH, and M Purver. Rare-class dialogue act tagging for alzheimer's disease diagnosis. Jul 2021. [ bib | .pdf ]
S Nasreen, M Rohanian, J Hough, and M Purver. Alzheimer’s dementia recognition from spontaneous speech using disfluency and interactional features. Frontiers in Computer Science, 3:640669-640669, Jun 2021. [ bib | DOI ]
A Nonnis and N Bryan-Kinns. Olly: A tangible for togetherness. International Journal of Human Computer Studies, 153, Apr 2021. [ bib | DOI | http ]
K O'Hanlon, E Benetos, and S Dixon. Detecting cover songs with pitch class key-invariant networks. Gold Coast, Queensland, Australia, IEEE, Oct 2021. [ bib | http ]
K O'Hanlon and M Sandler. Fifthnet: Structured compact neural networks for automatic chord recognition. IEEE/ACM Transactions on Audio Speech and Language Processing, 29:2671-2682, Jan 2021. [ bib | DOI ]
Y Ozaki, J McBride, E Benetos, PQ Pfordresher, J Six, A T. Tierney, P Proutskova, E Sakai, H Kondo, H Fukatsu, S Fujii, and PE Savage. Agreement among human and annotated transcriptions of global songs. International Society for Music Information Retrieval, Nov 2021. [ bib | http ]
EE Ozkan, T Gurion, J Hough, PGT Healey, and L Jamone. Specific hand motion patterns correlate to miscommunications during dyadic conversations. In IEEE International Conference on Development and Learning, ICDL 2021, Aug 2021. [ bib | DOI ]
S Park, PGT Healey, and A Kaniadakis. Should robots blush? In Conference on Human Factors in Computing Systems - Proceedings, May 2021. [ bib | DOI ]
A Pelicon, R Shekhar, M Martinc, B Škrlj, M Purver, and S Pollak. Zero-shot cross-lingual content filtering: Offensive language and hate speech detection. pages 30-34. Kyiv (online), Apr 2021. [ bib ]
A Pelicon, R Shekhar, B Skrlj, M Purver, and S Pollak. Investigating cross-lingual training for offensive language detection. PeerJ Comput. Sci., 7:e559-e559, Jun 2021. [ bib ]
LD Pham, H Phan, R Palaniappan, A Mertins, and I Mcloughlin. Cnn-moe based framework for classification of respiratory anomalies and lung disease detection. IEEE Journal of Biomedical and Health Informatics, PP, Mar 2021. [ bib | DOI | http ]
H Phan, OY Chen, MC Tran, P Koch, A Mertins, and M De Vos. Xsleepnet: Multi-view sequential model for automatic sleep staging. IEEE Trans Pattern Anal Mach Intell, PP, Mar 2021. [ bib | DOI | http ]
S Pollak, M Robnik-Šikonja, M Purver, M Boggia, R Shekhar, M Pranjić, S Salmela, I Krustok, T Paju, C-G Linden, L Leppànen, E Zosa, M Ulčar, L Freiental, S Traat, LA Cabrera-Diego, M Martinc, N Lavrač, B Škrlj, M Žnidaršič, A Pelicon, B Koloski, V Podpečan, J Kranjc, S Sheehan, E Boros, J Moreno, A Doucet, and H Toivonen. Embeddia tools, datasets and challenges: Resources and hackathon contributions. pages 99-109. Kyiv (online), Apr 2021. [ bib ]
M Purver, M Sadrzadeh, R Kempson, G Wijnholds, and J Hough. Incremental composition in distributional semantics. Journal of Logic, Language and Information, Jul 2021. [ bib | DOI | http ]
DR Quiroga-Martinez, NC Hansen, A Højlund, M Pearce, E Brattico, E Holmes, K Friston, and P Vuust. Musicianship and melodic predictability enhance neural gain in auditory cortex during pitch deviance detection. Human Brain Mapping, Jan 2021. [ bib | DOI ]
A Ragano, E Benetos, and A Hines. More for less: Non-intrusive speech quality assessment with limited annotations. In, Jun 2021. [ bib | DOI ]
J Ratclife, F Soave, N Bryan-Kinns, L Tokarchuk, and I Farkhatdinov. Extended reality (xr) remote research: A survey of drawbacks and opportunities. Conference on Human Factors in Computing Systems - Proceedings, May 2021. [ bib | DOI ]
J Ratcliffe, F Soave, N Bryan-Kinns, L Tokarchuk, and I Farkhatdinov. Extended reality (xr) remote research: a survey of drawbacks and opportunities. In Y Kitamura, A Quigley, K Isbister, T Igarashi, P Bjørn, and SM Drucker, editors, CHI, pages 527:1-527:1. ACM, 2021. [ bib | http ]
J Ratcliffe, F Soave, M Hoover, FR Ortega, N Bryan-Kinns, L Tokarchuk, and I Farkhatdinov. Remote xr studies: Exploring three key challenges of remote xr experimentation. In Y Kitamura, A Quigley, K Isbister, and T Igarashi, editors, CHI Extended Abstracts, pages 121:1-121:1. ACM, 2021. [ bib | http ]
CN Reed and AP McPherson. Surface electromyography for sensing performance intention and musical imagery in vocalists. In TEI 2021 - Proceedings of the 15th International Conference on Tangible, Embedded, and Embodied Interaction, Feb 2021. [ bib | DOI ]
JD Reiss, HE Tez, and R Selfridge. A comparative perceptual evaluation of thunder synthesis techniques. In 150th Audio Engineering Society Convention, AES 2021, Jan 2021. [ bib ]
N Robson, N Bryan-Kinns, and A Mcpherson. On mediating space, sound and experience: interviews with situated sound art practitioners. Organised Sound: an international journal of music and technology, 28(1). [ bib ]
M Rohanian, J Hough, and M Purver. Multi-modal fusion with gating using audio, lexical and disfluency features for alzheimer's dementia recognition from spontaneous speech. In CoRR, volume abs/2106.09668, 2021. [ bib ]
S Sarkar, E Benetos, and M Sandler. Vocal harmony separation using time-domain neural networks. pages 3515-3519. Brno, Czech Republic, Aug 2021. [ bib | DOI ]
R Shukla, R Stewart, and M Sandler. User hrtf selection for 3d auditory mixed reality. In Online, Jun 2021. [ bib | DOI ]
S Singh, H Bear, and E Benetos. Prototypical networks for domain adaptation in acoustic scene classification. Toronto, Canada, IEEE, Jun 2021. [ bib | DOI | .html ]
S Skach, R Stewart, and PGT Healey. Sensing social behavior with smart trousers. IEEE Pervasive Computing, 20(3):30-40, Jul 2021. [ bib | DOI ]
F Soave, I Farkhatdinov, and N Bryan-Kinns. Multisensory teleportation in virtual reality applications. In Proceedings - 2021 IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops, VRW 2021, pages 377-379, Mar 2021. [ bib | DOI ]
F Soave, A Padma Kumar, N Bryan-Kinns, and I Farkhatdinov. Exploring terminology for perception of motion in virtual reality. In DIS 2021 - Proceedings of the 2021 ACM Designing Interactive Systems Conference: Nowhere and Everywhere, pages 171-179, Jun 2021. [ bib | DOI ]
CJ Steinmetz and JD Reiss. Pyloudnorm: A simple yet flexible loudness meter in python. In 150th Audio Engineering Society Convention, AES 2021, Jan 2021. [ bib ]
V Subramanian, S Gururani, E Benetos, and M Sandler. Anomalous behaviour in loss-gradient based interpretability methods. May 2021. [ bib ]
M Tenderini, E De Leeuw, T Eilola, and M Pearce. Reduced cross-modal affective priming in the l2 of late bilinguals depends on l2 exposure. Journal of Experimental Psychology: Learning, Memory, and Cognition. [ bib ]
L Turchet, D Baker, and T Stockman. Musical haptic wearables for synchronisation of visually-impaired performers: A co-design approach. In IMX 2021 - Proceedings of the 2021 ACM International Conference on Interactive Media Experiences, pages 20-27, Jun 2021. [ bib | DOI ]
C Vahidi, G Fazekas, and C Saitis. A modulation front-end for music audio tagging. Jul 2021. [ bib ]
C Vianna Lordelo, E Benetos, S Dixon, and S Ahlbäck. Pitch-informed instrument assignment using a deep convolutional network with multiple kernel shapes. Nov 2021. [ bib | http ]
C Vianna Lordelo, E Benetos, S Dixon, S Ahlbäck, and P Ohlsson. Adversarial unsupervised domain adaptation for harmonic-percussive source separation. IEEE Signal Processing Letters, 28:81-85, Jan 2021. [ bib | DOI ]
S Yang, CN Reed, E Chew, and M Barthet. Examining emotion perception agreement in live music performance. IEEE Transactions on Affective Computing, Jan 2021. [ bib | DOI ]
Y Zhang, G Xia, M Levy, and S Dixon. Cosmic: A conversational interface for human-ai music co-creation. Apr 2021. [ bib ]
Y Zhao, C Wang, G Fazekas, E Benetos, and M Sandler. Violinist identification based on vibrato features. EURASIP, Aug 2021. [ bib | http ]

