Skip to content

Non-record: Value Residual (-0.015 BPB) + Gated Attention (-0.003 BPB) with ablations#413

Open
anantdgoel wants to merge 1 commit intoopenai:mainfrom
anantdgoel:value-residual-gated-attention
Open

Non-record: Value Residual (-0.015 BPB) + Gated Attention (-0.003 BPB) with ablations#413
anantdgoel wants to merge 1 commit intoopenai:mainfrom
anantdgoel:value-residual-gated-attention

Commits

Commits on Mar 22, 2026