O(1) Memory Neural Network Training with Reversible Architectures
2 amazedsaint 1 6/21/2025, 8:09:09 PM github.com โ
Comments (1)
amazedsaint ยท 5h ago
Proposing a new architecture for O(1) memory neural network training. ZeroActivation enables training of arbitrarily deep neural networks with constant memory usage, regardless of depth. Perfect for Apple Silicon (M1/M2) users and anyone looking to train impossibly deep models. The kicker is a new model I developed for Reverse SSA, more on that soon - Thanks