@karpathy
@giffmana @ChengleiSi Itβs a commit that lowered val loss but *increased* the wall clock time so it gets rejected for being slower. must improve one, the other or both in this version. In my (new) autoresearch repo I have an alternative approach where you *always* train for eg 5 minutes and try to reduce val loss as much as possible. Possibly less confusing but has its own issues too.