Quick access the corpus:
Read more about the different versions:
Recommended for phonological/metrical queries referencing segments and syllables.
Click here to start ANNIS web.
There are two similar version of the corpus in the portal (see also Annotation 4: Syllables and Meter):
A video tutorial for the ANNIS portal is in preparation. In the meantime, here are some tips to get you started with the corpus:
Among the less obvious features of ANNIS are:
Recommended for simpler, non-phonological queries
Click here to download corpus.
ogr100-txm.zip
archive to any location on your computer.File > Load
and load the file OGR100.txm
contained in the .zip.By default, the concordance and index will show a basic normalized form of each word.
This can be changed to the diplomatic form by setting the concordance to display
dipl
rather than word
as the basic form.
A diplomatic edition is also included. This can be enabled by selecting dipl
as
well as or instead of default
in the bottom left of the edition window.
You can try out some of the sample queries given here.
Click here to download corpus.
If you need to use the corpus offline, you can install ANNIS locally. With the release of ANNIS 4, this is now really easy!
localhost:5712
ogr100-annis.zip
archive to any location on your computer.Administration > Import Corpus
and load the folder ogr100/relANNIS.zip
. Repeat for ogrsv100/relANNIS.zip
.Click here to download corpus.
The TEI P5 archive encodes each text and all annotation in a single TEI-compliant XML file. Lines, Words and Segments are encoded inline while Syllables are encoded in standoff.