Electronic Engineering and Computer Science

Social robots delivering public speeches have a wide range of practical applications as stand-ins for educators, experts, or entertainers. The goal of our work is to investigate how a social robot should be programmed to ...

Data-driven Behavioural and Affective Nudging of Online Learners: System Architecture and Design

Bourguet, M-L; Urakami, J; Venture, G (2022)

Social Robots that can Sense and Improve Student Engagement

Bourguet, M-L; Jin, Y; Shi, Y; Chen, Y; Rincon-Ardila, L; Venture, G (2020)

Virtual and Augmented Reality for Teaching Materials Science: a Students as Partners and as Producers Project

Bourguet, M-L; Wang, X; Ran, Y; Zhou, Z; Zhang, Y; Romero-Gonzalez, M (2020)

A Mapping Strategy for Interacting with Latent Audio Synthesis Using Artistic Materials

Zheng, S; Xambó, A; Bryan-Kinns, N; Explainable AI for the Arts Workshop 2024 (XAIxArts 2024)

This paper presents a mapping strategy for interacting with the latent spaces of generative AI models. Our approach involves using unsupervised feature learning to encode a human control space and mapping it to an audio ...

From Audio Encoders to Piano Judges: Benchmarking Performance Understanding for Solo Piano

Zhang, H; Liang, J; Dixon, S; International Society for Music Information Retrieval Conference

Computing: Looking Back and Moving Forward

Golec, M; Gill, SS; 21st International Conference on Smart Business Technologies (SCITEPRESS - Science and Technology Publications, 2024-07-15)

The Internet and computer commercialization have transformed the computing systems area over the past sixty years, affecting society. Computer systems have evolved to meet diverse social needs thanks to technological ...

Advancing AI in Music Composition: Refining the Generative Music Overpainting Task

Row, E; FAZEKAS, G; AES International Symposium on AI and the Musician

Real-time Timbre Remapping with Differentiable DSP

Shier, J; Saitis, C; Robertson, A; Mcpherson, A; New Interfaces for Musical Expression

Timbre is a primary mode of expression in diverse musical contexts. However, prevalent audio-driven synthesis methods predominantly rely on pitch and loudness envelopes, effectively flattening timbral expression from the ...

Improved Fine-Tuning by Better Leveraging Pre-Training Data

Liu, Z; Xu, Y; Xu, Y; Qian, Q; Li, H; Ji, X; Chan, AB; Jin, R (2022-01-01)

As a dominant paradigm, fine-tuning a pre-trained model on the target data is widely used in many deep learning applications, especially for small data sets. However, recent studies have empirically shown that training ...

Retrieval-Augmented Multiple Instance Learning

Cui, Y; Liu, Z; Chen, Y; Lu, Y; Yu, X; Liu, X; Kuo, TW; Rodrigues, MRD; Xue, CJ; Chan, AB (2023-01-01)

Multiple Instance Learning (MIL) is a crucial weakly supervised learning method applied across various domains, e.g., medical diagnosis based on whole slide images (WSIs). Recent advancements in MIL algorithms have yielded ...

A Secure Image Watermarking Framework with Statistical Guarantees via Adversarial Attacks on Secret Key Networks

Chen, F; Lin, W; Liu, Z; Chan, A; European Conference on Computer Vision

Resource-Efficient Convolutional Networks: A Survey on Model-, Arithmetic-, and Implementation-Level Techniques

Lee, JK; Mukhanov, L; Molahosseini, AS; Minhas, U; Hua, Y; Martinez Del Rincon, J; Dichev, K; Hong, CH; Vandierendonck, H (2023-07-13)

Convolutional neural networks (CNNs) are used in our daily life, including self-driving cars, virtual assistants, social network services, healthcare services, and face recognition, among others. However, deep CNNs demand ...