TU Darmstadt / ULB / TUprints

Load Balancing in Compute Clusters With Delayed Feedback

Tahir, Anam ; Alt, Bastian ; Rizk, Amr ; Koeppl, Heinz (2024)
Load Balancing in Compute Clusters With Delayed Feedback.
In: IEEE Transactions on Computers, 2023, 72 (6)
doi: 10.26083/tuprints-00026536
Article, Secondary publication, Publisher's Version

[img] Text
Load_Balancing_in_Compute_Clusters_With_Delayed_Feedback.pdf
Copyright Information: CC BY 4.0 International - Creative Commons, Attribution.

Download (1MB)
Item Type: Article
Type of entry: Secondary publication
Title: Load Balancing in Compute Clusters With Delayed Feedback
Language: English
Date: 30 September 2024
Place of Publication: Darmstadt
Year of primary publication: 2023
Place of primary publication: [Erscheinungsort nicht ermittelbar]
Publisher: IEEE
Journal or Publication Title: IEEE Transactions on Computers
Volume of the journal: 72
Issue Number: 6
Collation: 13 Seiten
DOI: 10.26083/tuprints-00026536
Corresponding Links:
Origin: Secondary publication service
Abstract:

Load balancing arises as a fundamental problem, underlying the dimensioning and operation of many computing and communication systems, such as job routing in data center clusters, multipath communication, Big Data and queueing systems. In essence, the decision-making agent maps each arriving job to one of the possibly heterogeneous servers while aiming at an optimization goal such as load balancing, low average delay or low loss rate. One main difficulty in finding optimal load balancing policies here is that the agent only partially observes the impact of its decisions, e.g., through the delayed acknowledgements of the served jobs. In this paper, we provide a partially observable (PO) model that captures the load balancing decisions in parallel buffered systems under limited information of delayed acknowledgements. We present a simulation model for this PO system to find a load balancing policy in real-time using a scalable Monte Carlo tree search algorithm. We numerically show that the resulting policy outperforms other limited information load balancing strategies such as variants of Join-the-Most-Observations and has comparable performance to full information strategies like: Join-the-Shortest-Queue, Join-the-Shortest-Queue(d) and Shortest-Expected-Delay. Finally, we show that our approach can optimise the real-time parallel processing by using network data provided by Kaggle.

Uncontrolled Keywords: Parallel systems, load balancing, partial observability
Status: Publisher's Version
URN: urn:nbn:de:tuda-tuprints-265367
Classification DDC: 000 Generalities, computers, information > 004 Computer science
600 Technology, medicine, applied sciences > 621.3 Electrical engineering, electronics
Divisions: 18 Department of Electrical Engineering and Information Technology > Self-Organizing Systems Lab
Date Deposited: 30 Sep 2024 09:48
Last Modified: 29 Oct 2024 08:12
URI: https://tuprints.ulb.tu-darmstadt.de/id/eprint/26536
PPN: 522453295
Export:
Actions (login required)
View Item View Item