Anthropic has been adding so many features to Claude, I had to give it a try for myself ...
AdamW: A standard optimizer used to train deep learning models. Muon: A newer optimizer that Netflix found performs better ...