Data-driven hospitals staff and resources allocation using agent-based simulation and deep reinforcement learning

Research output: Contribution to journalArticlepeer-review

8 Scopus citations

Abstract

Hospital staff and resources allocation (HSRA) is a critical challenge in healthcare systems, as it involves balancing the demands of patients, the availability of resources, and the need to provide high-quality health in resource-bounded settings. Traditional approaches to HSRA have relied on manual planning and ad-hoc adjustments, which can be time-consuming and usually lead to sub-optimal outcomes. Recent studies show that machine learning solutions are able to produce better HSRA results compared to manual planning. However, these outcomes usually focused on a single hospital and objective. In this paper, we solve the HSRA task using a novel agent-based simulation with a deep reinforcement learning agent. We used real-world data to generate a wide range of synthetic instances that were used to train the HSRA agent. Our results show that the proposed model is able to achieve better outcomes in terms of patient treatment success and cost-effectiveness compared to previous resource allocation algorithms. We show that different planning horizons obtain similar performance in handling anomalies. In addition, we show a second-order polynomial connection between the patient treatment success and both the hospital's initial budget and funding over time. These results suggest that our approach has the potential to improve the efficiency and effectiveness of HSRA in healthcare systems.

Original languageEnglish
Article number106783
JournalEngineering Applications of Artificial Intelligence
Volume126
DOIs
StatePublished - Nov 2023
Externally publishedYes

Keywords

  • Agent-based simulation
  • Clinical resource allocation
  • Deep reinforcement learning
  • Sim-to-real transfer

Fingerprint

Dive into the research topics of 'Data-driven hospitals staff and resources allocation using agent-based simulation and deep reinforcement learning'. Together they form a unique fingerprint.

Cite this