hukuda222
/

hiragana-reverse-gpt2-xsmall

Model card Files Files and versions Community

hukuda222 commited on Jan 19

Commit

0ab94bd

·

verified ·

1 Parent(s): 559a14d

Update README.md

Files changed (1) hide show

README.md +13 -3

README.md CHANGED Viewed

@@ -1,3 +1,13 @@
----
-license: apache-2.0
----

+---
+license: apache-2.0
+language:
+    - ja
+datasets:
+    - cc100
+---
+これはひらがなに変換して逆向きに並べ替えたデータセットで事前学習した言語モデルです。
+ひらがなを文字単位でトークンに分割しているため、回文や川柳のような音の数を重視するタスクに適しています。
+This is a language model pre-trained on a dataset converted into Japaneses-Hiragana and reversed.
+Since it tokenizes Hiragana at the character level, it is suitable for tasks that emphasize the number of sounds, such as palindromes or senryu (a form of Japanese poetry).