BMC Medical Informatics and Decision Making

Table 4 Expected return and mortality under different polices

From: Towards more efficient and robust evaluation of sepsis treatment with deep reinforcement learning

Policies	\({\textbf{V}}_{{\textbf{DR}}}\)	Mortality
\(Reward_{3.0}\)	\(-0.0284\)	14.5% ± 0.6%
\(Reward_{3.0^+}\)	\(-0.1800\)	17.2% ± 0.5%
\(Reward_{4.0}\)	\(-0.0253\)	14.7% ± 0.6%
\(Reward_{3.0+3.0^+}\)	0.0291	14.1% ± 0.6%
\(Reward_{3.0+4.0}\)	0.0365	13.9% ± 0.5%
\(Reward_{3.0^++4.0}\)	\({\textbf {0.2307}}\)	11.3% ± 0.4%
\(Reward_{all}\)	0.1546	12.2% ± 0.4%
Clinician	\(-0.0294\)	14.5% ± 0.5%

The bold indicates the best performance, while the italics indicate the 95% confidence interval

Back to article page

ISSN: 1472-6947

Contact us

General enquiries: journalsubmissions@springernature.com