Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
SQCU
/
pgptlformer-tinystories
like
0
roneneldan/TinyStories
Model card
Files
Files and versions
Community
1f45909
pgptlformer-tinystories
/
re-pqt-rmsXrms-ATTNII-697f0113-bb05-480b-b6dc-42a97de0de3e
1 contributor
History:
1 commit
SQCU
sling the illustrious and mysterious "attention_II" models. also some layerwise rmsnorm, qkprojection rmsnorm models, one twice as large as the other.
1f45909
verified
23 days ago
state_step006250.pt
Safe
pickle
Detected Pickle imports (4)
"collections.OrderedDict"
,
"torch.FloatStorage"
,
"torch.ByteStorage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
338 MB
LFS
sling the illustrious and mysterious "attention_II" models. also some layerwise rmsnorm, qkprojection rmsnorm models, one twice as large as the other.
23 days ago