recnet_logoRecNet
Back
Yoav Artzi 4/8/2025
Again, not really a paper. But a wild ride through optimization. Pushing a GPT-2 replication to attain a target loss in 3(!) minutes. A lot to get inspired by.
Modded-NanoGPT
Modded-NanoGPTKeller Jordan and cogithub.com
1.17.1
Checking API service...
© 2024 RecNet. All rights reserved.