Update README.md
Browse files
README.md
CHANGED
@@ -30,6 +30,9 @@ Not yet published dataset created from the Historical Land Registry of the city
|
|
30 |
Timeframe: 1400-1700. Language: Early New High German. 661 documents in train, 83 in dev.
|
31 |
Language model based on the full HLRB corpus until 1800, appr. 120k documents.
|
32 |
|
|
|
|
|
|
|
33 |
The training data was prepared in a special way to accommodate nested annotation. See the linked paper for more information.
|
34 |
|
35 |
|
|
|
30 |
Timeframe: 1400-1700. Language: Early New High German. 661 documents in train, 83 in dev.
|
31 |
Language model based on the full HLRB corpus until 1800, appr. 120k documents.
|
32 |
|
33 |
+
The documents were annotated according to the [BeNASch annotation guidelines](https://dhbern.github.io/BeNASch/).
|
34 |
+
For this model, a simplified tagset was used.
|
35 |
+
|
36 |
The training data was prepared in a special way to accommodate nested annotation. See the linked paper for more information.
|
37 |
|
38 |
|