Dealing with Diacritics
Diacritics, which can be mapped to a distinct modern equivalent, are normalized accordingly. This applies to the following characters, for example:
Historical Spelling | Realization as |
---|---|
aͤ, oͤ, uͤ | ä, ö, ü |
ſ | s |
ſs, ſz | ß |
Ꝛ, ꝛ | R, r |
Diacritics for which no unambiguous equivalent can be found (e. g. context-related ů) are not normalized. In this case, the corresponding Unicode character is used.