hukuda222 commited on
Commit
0ab94bd
·
verified ·
1 Parent(s): 559a14d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +13 -3
README.md CHANGED
@@ -1,3 +1,13 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - ja
5
+ datasets:
6
+ - cc100
7
+ ---
8
+
9
+ これはひらがなに変換して逆向きに並べ替えたデータセットで事前学習した言語モデルです。
10
+ ひらがなを文字単位でトークンに分割しているため、回文や川柳のような音の数を重視するタスクに適しています。
11
+
12
+ This is a language model pre-trained on a dataset converted into Japaneses-Hiragana and reversed.
13
+ Since it tokenizes Hiragana at the character level, it is suitable for tasks that emphasize the number of sounds, such as palindromes or senryu (a form of Japanese poetry).