Habana
/

gpt2

regisss HF staff commited on Oct 28, 2022

Commit

88e164b

1 Parent(s): 30315c1

Change usage section

Files changed (1) hide show

README.md CHANGED Viewed

@@ -23,25 +23,23 @@ This enables to specify:
 ## Usage
 The model is instantiated the same way as in the Transformers library.
-The only difference is that there are a few new training arguments specific to HPUs:
-```
-from optimum.habana import GaudiTrainer, GaudiTrainingArguments
-from transformers import GPT2Tokenizer, GPT2Model
-tokenizer = GPT2Tokenizer.from_pretrained('gpt2')
-model = GPT2Model.from_pretrained('gpt2')
-args = GaudiTrainingArguments(
-    output_dir="/tmp/output_dir",
-    use_habana=True,
-    use_lazy_mode=True,
-    gaudi_config_name="Habana/gpt2",
-)
-trainer = GaudiTrainer(
-    model=model,
-    args=args,
-    tokenizer=tokenizer,
-)
-trainer.train()
 ```

 ## Usage
 The model is instantiated the same way as in the Transformers library.
+The only difference is that there are a few new training arguments specific to HPUs.
+[Here](https://github.com/huggingface/optimum-habana/blob/main/examples/language-modeling/run_clm.py) is a causal language modeling example script to pre-train/fine-tune a model. You can run it with GPT2 with the following command:
+```bash
+python run_clm.py \
+    --model_name_or_path gpt2 \
+    --dataset_name wikitext \
+    --dataset_config_name wikitext-2-raw-v1 \
+    --per_device_train_batch_size 4 \
+    --per_device_eval_batch_size 4 \
+    --do_train \
+    --do_eval \
+    --output_dir /tmp/test-clm \
+    --gaudi_config_name Habana/gpt2 \
+    --use_habana \
+    --use_lazy_mode \
+    --throughput_warmup_steps 2
 ```
+Check the [documentation](https://huggingface.co/docs/optimum/habana/index) out for more advanced usage and examples.