• Glenn Jocher's avatar
    Implement `--save-period` locally (#5047) · 5afc9c25
    Glenn Jocher 提交于
    This PR adds a new training argument `--save-period` to save training checkpoints every `x` epochs. To save training every 50 epochs for example:
    ```
    python train.py --save-period 50  # saves epoch50.pt, epoch100.pt, epoch150.pt, ... etc.
    ```
    
    This saved checkpoints in addition to existing last.pt and best.pt checkpoints and does not affect their behavior. Default value is -1, i.e. disabled.
    5afc9c25
train.py 31.2 KB