SIGS Kernkorpus

The corpus is annotated as follows:

  • tokenized (graphical and syntactic)
  • POS
  • Lemma
  • Animacy
  • Segmentation into syntactic units

Our data is based upon witch interrogation protocols, edited in Macha et al. (2005). The Kernkorpus consists of 18 protocols equally distributed across geographical space and time.

As a visualization of the corpus there is an interactive map.

Place Region Time
Jever NW 1593
Meldorf NW 1618
Alme NW 1630
Perleberg NO 1588
Güstrow NO 1615
Stralsund NO 1630
Hamm MW 1592
Gaugrehweiler MW 1610
Lemberg MW 1630
Georgenthal MO 1597
Rosenburg MO 1618
Ostrau MO 1628
Riedlingen SW 1596
Günzburg SW 1613
Baden-Baden SW 1628
München SO 1600
Schweinfurt SO 1616
Bamberg SW 1628