CloudAIBus: a testbed for AI based cloud computing environments

Velu, S; Gill, SS; Murugesan, SS; Wu, H; Li, X

View/Open

Accepted version
Embargoed until: 2025-06-05

Publisher

Springer Science and Business Media LLC

Publisher URL

http://dx.doi.org/10.1007/s10586-024-04562-9

DOI

10.1007/s10586-024-04562-9

doi.org/10.1007/s10586-024-04562-9

Journal

Cluster Computing

ISSN

1386-7857

Metadata

Show full item record

Abstract

Smart resource allocation is essential for optimising cloud computing efficiency and utilisation, but it is also very challenging as traditional approaches often overprovision CPU resources, leading to financial inefficiencies. Recently developed Artificial Intelligence (AI) techniques have the potential to solve this problem efficiently; for example, deep learning models can accurately forecast how resources will be used, allowing for more efficient distribution of those resources. Despite these encouraging breakthroughs, researchers have not thoroughly investigated these AI models’ dynamic scaling potential. To address this gap, we developed a new testbed for an AI-driven cloud computing environment called CloudAIBus for effective resource allocation. CloudAIBus employs a deep learning model named DeepAR to provide a robust solution for forecasting CPU usage in order to make cost-effective resource allocation decisions. Furthermore, we implement the DeepAR model using Amazon SageMaker, a robust platform that provides the infrastructure for scalable and efficient training. We evaluated the performance of the DeepAR-based resource management approach (CloudAIBus) using Google Colab, and results show that the proposed approach offers better performance than baselines (LSTM and ARIMA-based resource management) in terms of Mean Absolute Error (MAE), Mean Absolute Percentage Error (MAPE) and Mean Squared Error (MSE). The proposed approach cut the percentage of unused CPUs from 98.65 to 32.35% compared to the GWA-T-12 dataset. This showed that it was effective at reducing over-provisioning by making accurate predictions.

Authors

Velu, S; Gill, SS; Murugesan, SS; Wu, H; Li, X

URI

https://qmro.qmul.ac.uk/xmlui/handle/123456789/97360

Collections

Electronic Engineering and Computer Science [3263]

Language

Licence information

This version of the article has been accepted for publication, after peer review (when applicable) and is subject to Springer Nature’s AM terms of use, but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record is available online at: https://doi.org/10.1007/s10586-024-04562-9