Gemstone-512x13 / README.md
smcleish's picture
Upload GemmaForCausalLM
9ff7a55 verified
|
raw
history blame
416 Bytes
metadata
datasets:
  - allenai/dolma
language:
  - en
library_name: transformers
license: apache-2.0
tags:
  - causal-lm

Model Details

Training

Models trained using litgpt and AxoNN on AMD MI250 GPUs.

Data

Train and validation data is taken from non-overlapping subsets of dolma.