Sort order

DEX

DEX = „Explicative Dictionary of the Romanian Language“

It is for Romanian what Oxford/Webster dictionaries are for English.

Published by the Romanian Academy

DEX 1
DEX 2
DEX 3
DEX 4
DEX 5

Orto

„Orthographic, Orthoepic And Punctuation Guide Of The Romanian Language“

Published by the Romanian Academy

In picture 4 the 1st paragraph says „The Romanian alphabet is formed from 31 characters, with the following order:“

Orto 1
Orto 2
Orto 3
Orto 4
Orto 5

RoEng

„Romanian-English, English-Romanian Dictionary“

Published by the „Teora“ publishing house. It is a publishing house that concentrates a lot on technical books, computers, programming, etc.

Most of the work is done on computers. As a result, you may notice the S and T with comma under used properly on the cover and on first page, but the incorrect cedilla forms in the content pages.

RoEng 1
RoEng 2
RoEng 3
RoEng 4

The characters

The correct Romanian characters:

CharUnicodeDescLCMapString
ĂU+0102LATIN CAPITAL LETTER A WITH BREVE0x0e 0x08 0x01 0x01 0x12 0x01 0x01 0x00
ăU+0103LATIN SMALL LETTER A WITH BREVE0x0e 0x08 0x01 0x01 0x01 0x01 0x00
ÂU+00C2LATIN CAPITAL LETTER A WITH CIRCUMFLEX0x0e 0x02 0x01 0x12 0x01 0x12 0x01 0x01 0x00
âU+00E2LATIN SMALL LETTER A WITH CIRCUMFLEX0x0e 0x02 0x01 0x12 0x01 0x01 0x01 0x00
ÎU+00CELATIN CAPITAL LETTER I WITH CIRCUMFLEX0x0e 0x34 0x01 0x01 0x12 0x01 0x01 0x00
îU+00EELATIN SMALL LETTER I WITH CIRCUMFLEX0x0e 0x34 0x01 0x01 0x01 0x01 0x00
ȘU+0218LATIN CAPITAL LETTER S WITH COMMA BELOW0x01 0x01 0x01 0x01 0x00
șU+0219LATIN SMALL LETTER S WITH COMMA BELOW0x01 0x01 0x01 0x01 0x00
ȚU+021ALATIN CAPITAL LETTER T WITH COMMA BELOW0x01 0x01 0x01 0x01 0x00
țU+021BLATIN SMALL LETTER T WITH COMMA BELOW0x01 0x01 0x01 0x01 0x00

The incorrect forms:

CharUnicodeDescLCMapString
ŞU+015ELATIN CAPITAL LETTER S WITH CEDILLA0x0e 0x98 0x01 0x01 0x12 0x01 0x01 0x00
şU+015FLATIN SMALL LETTER S WITH CEDILLA0x0e 0x98 0x01 0x01 0x01 0x01 0x00
ŢU+0162LATIN CAPITAL LETTER T WITH CEDILLA0x0e 0x9e 0x01 0x01 0x12 0x01 0x01 0x00
ţU+0163LATIN SMALL LETTER T WITH CEDILLA0x0e 0x9e 0x01 0x01 0x01 0x01 0x00

Windows incorrect sort order:

CharLCMapString
a0x0e 0x02 0x01 0x01 0x01 0x01 0x00
â0x0e 0x02 0x01 0x12 0x01 0x01 0x01 0x00
ă0x0e 0x08 0x01 0x01 0x01 0x01 0x00
b0x0e 0x09 0x01 0x01 0x01 0x01 0x00

All LCMapString keys are generated using LCMapString with LCID=0x418 (Romanian, Romania) and LCMAP_SORTKEY, on Window XP Pro SP2.

The incorrect forms are there since ISO-8859-2, and for all processing should be considered equivalent to the correct ones. (although the glyph for T/t cedilla was often using the comma)

Sometimes it is impossible to say if the correct character is used, beacuse the glyphs are also wrong.
The gliphs in some fonts have comma for the cedilla characters, some have cedilla for the comma characters.
Also, the comma forms are missing from most fonts.

UnicodeArial glyphsArial Unicode glyphsTahoma
S cedillaS cedillaS commaS cedilla
s cedillas cedillas commas cedilla
T cedillaT commaT commaT cedilla
t cedillat commat commat cedilla
S commaS missingmissingS comma
s commas missingmissings comma
T commaT missingmissingT comma
t commat missingmissingt comma