helpinghand
search
needassistance
 

Mimer SQL Multilingual Support

Introduction

Mimer SQL has an extensive support for Unicode and SQL Collations, i.e. multilingual sorting, which makes Mimer SQL an excellent choice when developing applications that should handle texts according to local language rules or simultaneously handle texts in different languages.

Mimer SQL has built-in support for sorting in many languages including over 60 different European languages, Afrikaans, Arabic, Filipino, Hebrew, Thai and Vietnamese. From our Collation Charts Page we also provide downloadable collation definitions for languages such as Chinese, Japanese, Korean and several other languages. These collation definitions can easily be added to your running Mimer SQL system whenever needed with a simple CREATE COLLATION statement in SQL.

Predefined collations or customized ones?

The Unicode 4.0 character set has a code map covering more than one million different characters. The Unicode default sorting order - collation - defines a unified sorting order for all these characters.

Many languages in the world have deviations from the Unicode default sorting order. Therefore, Mimer SQL includes support for collations in more than 120 different languages, either as built-in definitions or as downloadable definitions. You may use several collations on the same data, and it is very simple to modify an existing collation.

If your specific language is missing among the more than 120 predefined ones, a new collation can easily be defined. And, you can also modify an existing collation to fit your specific sorting preferences or demands.

Using Predefined Collations

A collation can have four different comparison levels. For each language that is represented as a predefined collation in Mimer SQL, two levels are available - the primary level and the tertiary level. These levels are denoted by the "_1" and "_3" extensions along with the language name, for example "breton_1" and "breton_3". When using a predefined collation, the language name with one of these extensions should be given as the collation name in the COLLATE clause of the SQL statement.

Typically, the primary level is used to denote differences between base characters, for example, "a < b". This is the strongest difference, usually used in a WHERE clause. As an example, dictionaries are divided into different sections by the base character.

Upper and lower case differences in characters are distinguished at the tertiary level, for example, "ab < Ab < áb". This level is usually used in the ORDER BY clause. A tertiary difference is ignored when there is a primary or secondary difference anywhere in the strings.

For examples on use of various comparison levels, see the Collation FAQ.

Additional information and feedback

For a more thorough description on how to use collations in Mimer SQL, please see the Mimer SQL Version 9.2 Documentation Set. Or, take a look at the feature article Character data, Unicode and Collations that presents the Mimer SQL perspective on multilingual support.

A comprehensive collection of sort ordering information for various languages and the support for this in Mimer SQL can be found at the Mimer SQL Collation Charts Page.

You will find some quick tips on how to use collations in Mimer SQL among our Collation FAQ.

If you have any questions or comments on our collations charts, or if you would like to share your collation definition with the Mimer SQL community, please send us a mail.

 

 

Powered by Mimer SQL

Powered by Mimer SQL