Dealing with Diacritics

Diacritics, which can be mapped to a distinct modern equivalent, are normalized accordingly. This applies to the following characters, for example:

Historical Spelling	Realization as
aͤ, oͤ, uͤ	ä, ö, ü
ſ	s
ſs, ſz	ß
Ꝛ, ꝛ	R, r

Diacritics for which no unambiguous equivalent can be found (e. g. context-related ů) are not normalized. In this case, the corresponding Unicode character is used.