Patient-Specific Sedation Management via Deep Reinforcement Learning

Introduction: Developing reliable medication dosing guidelines is challenging because individual dose-response relationships are mitigated by both static (e. g., demographic) and dynamic factors (e.g., kidney function). In recent years, several data-driven medication dosing models have been proposed...

Full description

Saved in:
Bibliographic Details
Main Authors: Niloufar Eghbali (Author), Tuka Alhanai (Author), Mohammad M. Ghassemi (Author)
Format: Book
Published: Frontiers Media S.A., 2021-03-01T00:00:00Z.
Subjects:
Online Access:Connect to this object online.
Tags: Add Tag
No Tags, Be the first to tag this record!

MARC

LEADER 00000 am a22000003u 4500
001 doaj_a81846f9b8264a9c8f8c99cabaf726b2
042 |a dc 
100 1 0 |a Niloufar Eghbali  |e author 
700 1 0 |a Tuka Alhanai  |e author 
700 1 0 |a Mohammad M. Ghassemi  |e author 
245 0 0 |a Patient-Specific Sedation Management via Deep Reinforcement Learning 
260 |b Frontiers Media S.A.,   |c 2021-03-01T00:00:00Z. 
500 |a 2673-253X 
500 |a 10.3389/fdgth.2021.608893 
520 |a Introduction: Developing reliable medication dosing guidelines is challenging because individual dose-response relationships are mitigated by both static (e. g., demographic) and dynamic factors (e.g., kidney function). In recent years, several data-driven medication dosing models have been proposed for sedatives, but these approaches have been limited in their ability to assess interindividual differences and compute individualized doses.Objective: The primary objective of this study is to develop an individualized framework for sedative-hypnotics dosing.Method: Using publicly available data (1,757 patients) from the MIMIC IV intensive care unit database, we developed a sedation management agent using deep reinforcement learning. More specifically, we modeled the sedative dosing problem as a Markov Decision Process and developed an RL agent based on a deep deterministic policy gradient approach with a prioritized experience replay buffer to find the optimal policy. We assessed our method's ability to jointly learn an optimal personalized policy for propofol and fentanyl, which are among commonly prescribed sedative-hypnotics for intensive care unit sedation. We compared our model's medication performance against the recorded behavior of clinicians on unseen data.Results: Experimental results demonstrate that our proposed model would assist clinicians in making the right decision based on patients' evolving clinical phenotype. The RL agent was 8% better at managing sedation and 26% better at managing mean arterial compared to the clinicians' policy; a two-sample t-test validated that these performance improvements were statistically significant (p < 0.05).Conclusion: The results validate that our model had better performance in maintaining control variables within their target range, thereby jointly maintaining patients' health conditions and managing their sedation. 
546 |a EN 
690 |a medication dosing 
690 |a personalized medicine 
690 |a deep reinforcement learning 
690 |a propofol 
690 |a sedation management 
690 |a Medicine 
690 |a R 
690 |a Public aspects of medicine 
690 |a RA1-1270 
690 |a Electronic computers. Computer science 
690 |a QA75.5-76.95 
655 7 |a article  |2 local 
786 0 |n Frontiers in Digital Health, Vol 3 (2021) 
787 0 |n https://www.frontiersin.org/articles/10.3389/fdgth.2021.608893/full 
787 0 |n https://doaj.org/toc/2673-253X 
856 4 1 |u https://doaj.org/article/a81846f9b8264a9c8f8c99cabaf726b2  |z Connect to this object online.