iprada commited on
Commit
b9717c4
·
verified ·
1 Parent(s): 2619cca

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -0
README.md CHANGED
@@ -30,6 +30,9 @@ Not yet published dataset created from the Historical Land Registry of the city
30
  Timeframe: 1400-1700. Language: Early New High German. 661 documents in train, 83 in dev.
31
  Language model based on the full HLRB corpus until 1800, appr. 120k documents.
32
 
 
 
 
33
  The training data was prepared in a special way to accommodate nested annotation. See the linked paper for more information.
34
 
35
 
 
30
  Timeframe: 1400-1700. Language: Early New High German. 661 documents in train, 83 in dev.
31
  Language model based on the full HLRB corpus until 1800, appr. 120k documents.
32
 
33
+ The documents were annotated according to the [BeNASch annotation guidelines](https://dhbern.github.io/BeNASch/).
34
+ For this model, a simplified tagset was used.
35
+
36
  The training data was prepared in a special way to accommodate nested annotation. See the linked paper for more information.
37
 
38