Skip to content

V3.5.0

Change Log

Feature

Update

  • Update Muon optimizer. (#355, #356) * support decoupled weight decay. * adjust default hyperparameters the same as the original implementation. * support adjusted lr from the Moonlight. you can use it by setting use_adjusted_lr=True.
  • Tune the performance of the coupled Newton iteration method by 5% increase. (#360)
  • Update SCION optimizer. (#361) * add scale parameter. * update get_lmo_direction.

Fix

  • bias_correction2 in ScheduleFreeRAdam optimizer. (#354)
  • potential bug in SPAM optimizer. (#365)
  • initialize the z state within the step() of the ScheduleFreeWrapper. (#363, #366)