Difficult or Non-Decipherable Characters and Omissions
Difficult to read characters or strings (e. g. due to physical deficiencies of the original, weak pressure) are enclosed with the tag <supplied>
:
<supplied>[character or string]</supplied>
If the reading is uncertain for the supplemented string, a @cert
attribute in the supplied element can be used to display the degree of certainty for the supplement. The level of certainty is displayed as follows:
Expression | Meaning |
---|---|
<supplied>
|
Certainty/safe reading (for example, the character/string is still weakly printed, thus ensuring that the reading is assured) |
<supplied cert="high">
|
high certainty of the addition (e. g. the character/string is no longer recognizable, but the reading results from the context and is thus secured) |
<supplied cert="low">
|
low certainty of supplementation/uncertain reading (e. g. the character/string is no longer recognizable and has been reconstructed; other reconstructions are possible) |
If the characters are not recognizable and cannot be reconstructed, the tag <gap/>
is set to indicate the gap. Within the <gap>
tag, the attributes @unit
, @quantity
and @reason
can be used to display how many characters the gap contains, as well as the reason for the missing part:
<gap unit="chars" quantity="[number of characters missing]" reason="[reason for the defect]"/>
The @unit
attribute can have the following values:
chars
|
characters |
lines
|
lines |
pages
|
pages |
words
|
words |
The @reason
attribute can have the following values:
lost
|
damage |
illegible
|
illegible |
fm
|
foreign language material |
insignificant
|
as unimportant for the corpus classified text |