Fig. 3From: Towards more efficient and robust evaluation of sepsis treatment with deep reinforcement learningLeft: The negative relationship between cumulative average \(Q (s_t, a_t)\) value and mean of patients mortality; Right: The training loss using different reward functionsBack to article page