Fig. 7From: A dosing strategy model of deep deterministic policy gradient algorithm for sepsis patientsRelationship between the training time and training steps of the two models within the same environment. With the same number of steps and training under the same parameters and configurations, the time required by the model developed by the research teams from MIT and ICL was 1.7-times greater than that of our modelBack to article page