To reach step N, you can come from step N-1 or step N-2. The total ways to reach step N is the sum of ways to reach the two previous steps.
tai_checkpoint_dir = path_list['model_path_tai_PPO_h'] def PPO_tai_episoid(existing_env=None ,total_episode=0, episode=0, log_writer_tai=None, log_file_latest_tai ...