Update README.md
Browse files
README.md
CHANGED
@@ -39,14 +39,14 @@ The model is pre-trained by [UER-py](https://github.com/dbiir/UER-py/) on [Tence
|
|
39 |
|
40 |
```
|
41 |
python3 preprocess.py --corpus_path corpora/ancient_chinese.txt \
|
42 |
-
--vocab_path models/
|
43 |
--dataset_path ancient_chinese_dataset.pt --processes_num 16 \
|
44 |
--seq_length 320 --data_processor lm
|
45 |
```
|
46 |
|
47 |
```
|
48 |
python3 pretrain.py --dataset_path ancient_chinese_dataset.pt \
|
49 |
-
--vocab_path models/
|
50 |
--config_path models/bert_base_config.json \
|
51 |
--output_model_path models/ancient_chinese_gpt2_model.bin \
|
52 |
--world_size 8 --gpu_ranks 0 1 2 3 4 5 6 7 \
|
|
|
39 |
|
40 |
```
|
41 |
python3 preprocess.py --corpus_path corpora/ancient_chinese.txt \
|
42 |
+
--vocab_path models/google_zh_ancient_vocab.txt \
|
43 |
--dataset_path ancient_chinese_dataset.pt --processes_num 16 \
|
44 |
--seq_length 320 --data_processor lm
|
45 |
```
|
46 |
|
47 |
```
|
48 |
python3 pretrain.py --dataset_path ancient_chinese_dataset.pt \
|
49 |
+
--vocab_path models/google_zh_ancient_vocab.txt \
|
50 |
--config_path models/bert_base_config.json \
|
51 |
--output_model_path models/ancient_chinese_gpt2_model.bin \
|
52 |
--world_size 8 --gpu_ranks 0 1 2 3 4 5 6 7 \
|