[1. Verse and prose] [2. Annotation available in all texts] [3. Metrical annotation in the OGRSV] [4. The TXM version]
Annotation only available in ANNIS. For the TXM version, see section 4 below.
The OGR corpus contains both verse and prose texts. While all texts contain syllables (and, to an extent, lines) metrical annotation is only found in syllabic verse texts.
The OGR corpus is available in two versions:
Syllables (syll
) are sequences of Segments. They are further subdivided into Onset, Nucleus
and Coda. This subdivision is given by the onc
annotation, which has the values O
, N
or C
.
syll_lstress
: indicates whether the syllable bears a lexical stress.
The primary stressed syllable in a polysyllable and monosyllabic prosodic words are both tagged
y
(yes); all other syllables are tagged n
(no).Lines (line
) are sequences both of Words and Segments and have two primary roles in the OGR:
line_ref
annotation contains a citable reference to the text. In verse texts
it indicates the line of verse; in prose texts the page, column, and manuscript line. Words
broken across two lines in the manuscript are fully included within the Line in which they begin.In the OGRSV corpus, the lines of verse are scanned. Syllables are annotated according to their metrical position and Lines are annotated according to number of syllables and the nature of the cesura and the rhyme. Additionally, a tree-based representation of the metrical structure is available.
A five-character string giving the metrical position of the syllable in the line.
01
, 02
, … 10
.
Elided or uncounted syllables are annotated --
.10
, 09
, … 01
.
Elided or uncounted syllables are annotated --
.c
) or the rhyme (r
), or neither (-
).
Note that the c
and r
annotations are found both on the tonic and on any following uncounted syllables.With a simple regex, syll_metpos
can easily be used to find key syllables in the line, e.g.
syll_metpos=/..01r/
syll_metpos=/0108./
syll_metpos=/----./
syll_metpos="----c"
A five-character string summarizing the key metrical properties of the line.
08
, 10
. If the line is irregular and could not
be scanned, this will give the actual number of syllables in the line rather than the target length.-
for irregular lines.-
for irregular lines.4
, or -
for irregular lines. Note that the scanner only checks for cæsura in
standard positions, so for a decasyllable the cæsura will be either 4
or -
.line_met
can be queried with a simple regex, e.g.:
line_met=/10.e./
line_met=/08f../
There is also a line_correct
tag, with values True
if the line could be scanned and False
if not.
The OGRSV corpus contains an alternative tree visualization of the metrical structure which can be queried
like a syntax tree using hierarchical operators (>
, >*
, etc.). In addition to the Line,
Syllable, ONC, and Segment layers, the tree also contains a halfline and an extra
subdivision of the syllable into Onset and Rhyme. The units in the tree can be queried using the
following cat
properties:
cat="line"
: a linecat="hl"
: a halfline/hemistichcat="sy"
: a counted syllablecat="el"
: an elided syllablecat="On"
: a syllable onset (Onset–Rhyme layer)cat="Rhy"
: syllable rhymecat="O"
: syllable onsetcat="N"
: syllable nucleuscat="C"
: syllable codaA reduced form of this annotation is given in word-level tags in the TXM version.
ref
gives a citable reference for each word in the text (cf. line_ref
above). It appears by default in the left-hand
column of the concordance.
A string representing the syllabification of the word. Segments are transcribed using the phonemic
symbol (see Segment-level annotation). For example, in the string
ʤ/y\S.'t/i\.z/(ə)\
:
.
separates syllables;/...\
delimit the nucleus;'
before a syllable denotes primary stress.Additional markup, such as the parentheses around the nucleus (...)
, exist to provide information
to the scansion algorithm: in this case, that the vowel is elidable.
A three-character string representing the prosody of the word.
cl
: a cliticox
: an oxytone, i.e. stressed on the final syllablepo
: a paroxytone, i.e. stressed on the penultimate syllablepp
: a proparoxytone, i.e. stressed on the antepenultimate syllableFor example for parler prosody="2ox"
(a two-syllable oxytone), for
femme prosody="2po"
(a two-syllable paroxytone) and for le prosody="1cl"
(a monosyllabic clitic).
See section 3.2 above.
A string representing the metrical position of all the syllables in the word. The
principle is similar to syll_metpos
but applied to the whole word.
metpos
counts forwards from the start of the line, soptem
backwards from the end of the line.r
(rhyme), c
(caesura) or -
(neither)For example, the word ancienur (Alexis, l. 1) occupies the last four metrical positions
of a decasyllabic line. It therefore has metpos="07.08.09.10.r"
and soptem="04.03.02.01.r
.
Tip: soptem
is metpos
backwards, to help remember which is which!