pgptlformer-tinystories / re-pqt-rmsXrms-2fd4a7cb-930a-464b-90e0-eef9d7551d8d

Commit History

sling the illustrious and mysterious "attention_II" models. also some layerwise rmsnorm, qkprojection rmsnorm models, one twice as large as the other.
1f45909
verified

SQCU commited on