Update README.md
Browse files
README.md
CHANGED
@@ -88,7 +88,7 @@ Consider splitting long sequences to process them separately.
|
|
88 |
|
89 |
## Training and evaluation data
|
90 |
|
91 |
-
The model was fine-tuned and evaluated on
|
92 |
*DTA reviEvalCorpus* is a parallel corpus of German texts from the period between 1780 to 1899, that aligns sentences in historical spelling of with their normalizations.
|
93 |
The training set contains 96 documents with 4.6M source tokens, the dev and test set contain 13 documents (405K tokens) and 12 documents (381K tokens), respectively.
|
94 |
For more information, see the [dataset card](https://huggingface.co/datasets/ybracke/dta-reviEvalCorpus-v1) of the corpus.
|
|
|
88 |
|
89 |
## Training and evaluation data
|
90 |
|
91 |
+
The model was fine-tuned and evaluated on the [DTA reviEvalCorpus](https://huggingface.co/datasets/ybracke/dta-reviEvalCorpus-v1).
|
92 |
*DTA reviEvalCorpus* is a parallel corpus of German texts from the period between 1780 to 1899, that aligns sentences in historical spelling of with their normalizations.
|
93 |
The training set contains 96 documents with 4.6M source tokens, the dev and test set contain 13 documents (405K tokens) and 12 documents (381K tokens), respectively.
|
94 |
For more information, see the [dataset card](https://huggingface.co/datasets/ybracke/dta-reviEvalCorpus-v1) of the corpus.
|