uer
/

gpt2-chinese-ancient

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

uer commited on Sep 4, 2023

Commit

21bcad2

·

1 Parent(s): 1752eb1

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -39,14 +39,14 @@ The model is pre-trained by [UER-py](https://github.com/dbiir/UER-py/) on [Tence
 ```
 python3 preprocess.py --corpus_path corpora/ancient_chinese.txt \
-                      --vocab_path models/google_zh_vocab.txt \
                       --dataset_path ancient_chinese_dataset.pt --processes_num 16 \
                       --seq_length 320 --data_processor lm
 ```
 ```
 python3 pretrain.py --dataset_path ancient_chinese_dataset.pt \
-                    --vocab_path models/google_zh_vocab.txt \
                     --config_path models/bert_base_config.json \
                     --output_model_path models/ancient_chinese_gpt2_model.bin \
                     --world_size 8 --gpu_ranks 0 1 2 3 4 5 6 7 \

 ```
 python3 preprocess.py --corpus_path corpora/ancient_chinese.txt \
+                      --vocab_path models/google_zh_ancient_vocab.txt \
                       --dataset_path ancient_chinese_dataset.pt --processes_num 16 \
                       --seq_length 320 --data_processor lm
 ```
 ```
 python3 pretrain.py --dataset_path ancient_chinese_dataset.pt \
+                    --vocab_path models/google_zh_ancient_vocab.txt \
                     --config_path models/bert_base_config.json \
                     --output_model_path models/ancient_chinese_gpt2_model.bin \
                     --world_size 8 --gpu_ranks 0 1 2 3 4 5 6 7 \