40 lines
		
	
	
		
			1.7 KiB
		
	
	
	
		
			Plaintext
		
	
	
	
			
		
		
	
	
			40 lines
		
	
	
		
			1.7 KiB
		
	
	
	
		
			Plaintext
		
	
	
	
This directory holds configuration files that enable MySQL to work with
 | 
						|
different character sets.  It contains:
 | 
						|
 | 
						|
charset_name.xml
 | 
						|
    Each charset_name.xml file contains information for a simple character
 | 
						|
    set.  The information in the file describes character types,
 | 
						|
    lower- and upper-case equivalencies and sorting orders for the
 | 
						|
    character values in the set.
 | 
						|
 | 
						|
Index.xml
 | 
						|
    The Index.xml file lists all of the available charset configurations,
 | 
						|
    including collations.
 | 
						|
 | 
						|
    Each collation must have a unique number.  The number is stored
 | 
						|
    IN THE DATABASE TABLE FILES and must not be changed.
 | 
						|
 | 
						|
    The max-id attribute of the <charsets> element must be set to
 | 
						|
    the largest collation number.
 | 
						|
 | 
						|
Compiled in or configuration file?
 | 
						|
    When should a character set be compiled in to MySQL's string library
 | 
						|
    (libmystrings), and when should it be placed in a charset_name.xml
 | 
						|
    configuration file?
 | 
						|
 | 
						|
    If the character set requires the strcoll functions or is a
 | 
						|
    multi-byte character set, it MUST be compiled in to the string
 | 
						|
    library.  If it does not require these functions, it should be
 | 
						|
    placed in a charset_name.xml configuration file.
 | 
						|
 | 
						|
    If the character set uses any one of the strcoll functions, it
 | 
						|
    must define all of them.  Likewise, if the set uses one of the
 | 
						|
    multi-byte functions, it must define them all.  See the manual for
 | 
						|
    more information on how to add a complex character set to MySQL.
 | 
						|
 | 
						|
Syntax of configuration files
 | 
						|
    The syntax is very simple.  Words in <map> array elements are
 | 
						|
    separated by arbitrary amounts of whitespace. Each word must be a
 | 
						|
    number in hexadecimal format.  The ctype array has 257 words; the
 | 
						|
    other arrays (lower, upper, etc.) take up 256 words each after that.
 |