Fig. 1From: Incorporating causal factors into reinforcement learning for dynamic treatment regimes in HIVThe medication regimen a before learning; b-c during learning; and d after learningBack to article page