A neural network learns division
by growing a Fourier Lissajous structure
mod-97 division · seed — · grokking step ≈ —
The 96 invertible remainders self-organize around two learned Fourier modes.
As they crystallize, the classes leave a cluster at origin and trace a 3D Lissajous
curve — every axis a real projection of WU.
96 classes (invertible remainders)
dlog-order trace
idealized (K₁, K₂) Lissajous reference
1. Before grokking — noise
step —
No structure. Points scatter; spectrum is flat.
2. Spectral ignition
step —
Modes k = —, —, — begin to grow. Torus emerges.
3. Algorithm locks in
step —
Rapid organization. The (—, —) knot crystallizes.
4. Inside the Lissajous
post-roll
The algorithm is stable. The camera flies inside the (K₁, K₂) Lissajous curve.