• Glenn Jocher's avatar
    Improved model+EMA checkpointing (#2292) · ec1d8496
    Glenn Jocher 提交于
    * Enhanced model+EMA checkpointing
    
    * update
    
    * bug fix
    
    * bug fix 2
    
    * always save optimizer
    
    * ema half
    
    * remove model.float()
    
    * model half
    
    * carry ema/model in fp32
    
    * rm model.float()
    
    * both to float always
    
    * cleanup
    
    * cleanup
    ec1d8496
test.py 16.0 KB