Gemstone-512x13 / README.md
smcleish's picture
Upload GemmaForCausalLM
9ff7a55 verified
|
raw
history blame
416 Bytes
---
datasets:
- allenai/dolma
language:
- en
library_name: transformers
license: apache-2.0
tags:
- causal-lm
---
## Model Details
### Training
Models trained using [litgpt](https://github.com/Lightning-AI/litgpt) and [AxoNN](https://github.com/axonn-ai/litgpt) on AMD MI250 GPUs.
### Data
Train and validation data is taken from non-overlapping subsets of [dolma](https://huggingface.co/datasets/allenai/dolma).