Recommended for phonological/metrical queries referencing segments and syllables.
Click here to start ANNIS web.
There are two similar version of the corpus in the portal (see also Annotation 4: Syllables and Meter):
A video tutorial for the ANNIS portal is in preparation. In the meantime, here are some tips to get you started with the corpus:
Among the less obvious features of ANNIS are:
Recommended for simpler, non-phonological queries
Click here to download corpus.
ogr040-txm.zip
archive to any location on your computer.File > Load
and load the file OGR040.txm
contained in the .zip.By default, the concordance and index will show a basic normalized form of each word.
This can be changed to the diplomatic form by setting the concordance to display
dipl
rather than word
as the basic form.
A diplomatic edition is also included. This can be enabled by selecting dipl
as
well as or instead of default
in the bottom left of the edition window.
You can try out some of the sample queries given here.
The TXM binary, including the HTML edition of the corpus, is generated from the TEI P5 sources by adding the files in this zip archive to the folder containing the TEI XML files and running TXM’s “XML Zero + CSV” import.
Click here to download corpus.
If you need to use the corpus offline, you can also — with a little effort — install ANNIS locally.
ogr040-annis.zip
archive to any location on your computer.relANNIS/ogr040
. Repeat for relANNIS/ogrsv040
.Click here to download corpus.
The TEI P5 archive encodes each text and all annotation in a single TEI-compliant XML file. Lines, Words and Segments are encoded inline while Syllables are encoded in standoff.