Content-type: text/html Man page of iconv_ja

iconv_ja

Section: Standards, Environments, and Macros (5)
Updated: 10 Jan 2003
Index Return to Main Contents
 

NAME

iconv_ja - code set conversions in ja locale  

DESCRIPTION

The following code set conversions are supported:

Code Set Conversions Supported

Source CodeTarget Code

eucJP
eucJP
eucJP
eucJP
eucJP
eucJP
eucJP
eucJP
eucJP
eucJP
SJIS
SJIS
SJIS
SJIS
SJIS
PCK
PCK
PCK
PCK
PCK
PCK
PCK
PCK
ISO-2022-JP
ISO-2022-JP
ISO-2022-JP
ISO-2022-JP
"Unicode"s
"Unicode"s
"Unicode"s
UTF-8
UTF-8
UTF-8-Java
UTF-8-Java
JIS7
jis
jis
jis
ibmj
ibmj
ibmj
ibmj-EBCDIK
ibmj-EBCDIK

Code Set Conversions Supported

Source CodeTarget Code

eucJP
eucJP
eucJP
eucJP
eucJP
eucJP
eucJP
eucJP
eucJP
eucJP
eucJP
eucJP
PCK
PCK
PCK
PCK
PCK
PCK
PCK
PCK
PCK
PCK
PCK
PCK
UTF-8
UTF-8
UTF-8
UTF-8
UTF-8
"Unicode"s
"Unicode"s
UTF-8
UTF-8
UTF-8
UTF-8
UTF-8
UTF-8
UTF-8
UTF-8
UTF-8-ms932
ibm930
ibm930
ibm930
ibm931
ibm931
ibm931
ibm939
ibm939
ibm939
ibm5026
ibm5026
ibm5026
ibm5035
ibm5035
ibm5035
FujitsuJEF-ascii-code
FujitsuJEF-ascii-code
FujitsuJEF-ascii-code
FujitsuJEF-kana-code
FujitsuJEF-kana-code
FujitsuJEF-kana-code
FujitsuJEF-ascii-face
FujitsuJEF-ascii-face
FujitsuJEF-ascii-face

Code Set Conversions Supported

Source CodeTarget Code

FujitsuJEF-kana-face
FujitsuJEF-kana-face
FujitsuJEF-kana-face
HitachiKEIS83
HitachiKEIS83
HitachiKEIS83
HitachiKEIS90
HitachiKEIS90
HitachiKEIS90UTF-8
NECJIPS
NECJIPS
NECJIPS
ms932
eucJP-ms

The descriptions of each code sets in the above table are as follows:

Description of Supported Code Sets

CodesetDescription

eucJP
PCK
SJIS
ISO-2022-JP

Description of Supported Code Sets

CodesetDescription

JIS7
ISO-2022-JP.RFC1468
jis
ibmj
ibmj-EBCDIK
"Unicode"s
UTF-8
UTF-8-Java

Description of Supported Code Sets

CodesetDescription

ibm930
ibm931
ibm939
ibm5026
ibm5035
FujitsuJEF-ascii-code
FujitsuJEF-ascii-face
FujitsuJEF-kana-code
FujitsuJEF-kana-face

Description of Supported Code Sets

CodesetDescription

HitachiKEIS8
HitachiKEIS90
NECJIPS
ms932
UTF-8-ms932

Conversions are performed as described below. For all conversions, if the source code set includes characters not included in the target code set, conversion and output for all such characters will be done using a substitute character.

eucJP to PCK (SJIS) and PCK (SJIS) to eucJP

Conversion between eucJP and PCK (SJIS) can be used to convert JIS X 0201, JIS X 0208, JIS X 0212, and user-defined and vendor-defined characters based on TOG Japanese Vendors Council (TOG/JVC) Recommended Code Set Conversion Specification between Japanese EUC and Shift-JIS. If input data which does not belong to the source code set is encountered, iconv(3C) will return EILSEQ for errno. iconv(1) stops at the last point of successful conversion.

eucJP to ISO-2022-JP(JIS7) and ISO-2022-JP(JIS7) to eucJP

Conversion between eucJP and ISO-2022-JP(JIS7) can be used to convert JIS X 0201, JIS X 0208 and JIS X 0212. If input data which does not belong to the source code set is encountered, iconv(3C) will return EILSEQ for errno. iconv(1) stops at the last point of successful conversion.

eucJP to ISO-2022-JP.RFC1468

Conversion from eucJP to ISO-2022-JP.RFC1468 can be used to convert JIS X 0201 (except for figure character set for katakana) and JIS X 0208. If JIS X 0201 (figure character set for katakana), JIS X 0212, a user-defined, or a vendor-defined character is encountered among input data, it will be replaced with the substitute character ` ? ' (0x3f). If input data which does not belong to these code sets is encountered, iconv(3C) will return EILSEQ for errno. iconv(1) stops at the last point of successful conversion.

eucJP to jis and jis to eucJP

Conversion between eucJP and jis is provided for the compatibility with ujtojis7() and jis7touj() libraries, and euctojis and jistoeuc utilities. It is extended to handle JIS X 0212. See jistoeuc(1).

eucJP to "Unicode"s and "Unicode"s to eucJP

Conversion between eucJP and UTF-8 can be used to convert JIS X 0201, JIS X 0208, JIS X 0212, a user-defined, and a vendor-defined character. If input data which does not have the corresponding character in the target code set is encountered, it will be replaced with the substitute character (eucJP: ` ? ' (0x3f), UTF-8: U+FFFD (0xefbfbd)). If input data which does not belong to these code sets is encountered, iconv(3C) will return EILSEQ for errno. iconv(1) stops at the last point of successful conversion.

eucJP to UTF-8-Java and UTF-8-Java to eucJP

Conversion between eucJP and UTF-8-Java can be used to convert JIS X 0201, JIS X 0208, and JIS X 0212. If a user-defined or vendor-defined character is encountered among input data, it will be replaced with the substitute character (eucJP: ` ? ' (0x3f), UTF-8: U+FFFD (0xefbfbd)). If input data which does not belong to these code sets is encountered, iconv(3C) will return EILSEQ for errno. iconv(1) stops at the last point of successful conversion.

eucJP to ibmj and ibmj to eucJP

Conversion between eucJP and ibmj is based on the IBM Kanji codebook (4th edition - September 1987), JIS X 0201, and JIS X 0208-1983. If you convert eucJP to ibmj, JISX 0201 and JIS X 0201 are all converted to substitute character.

eucJP to ibmj-EBCDIK and ibmj-EBCDIK to eucJP

Conversion between eucJP and ibmj-EBCDIK is based on the IBM Kanji codebook (4th edition - September 1987), JIS X 0201, and JIS X 0208-1983. If you convert eucJP to ibmj-EBCDIK, JISX 0201 and JIS X 0201 that have not correspondence characters with ibmj-EBCDIKare all converted to substitute character.

PCK (SJIS) to ISO-2022-JP and ISO-2022-JP to PCK (SJIS)

Conversion between PCK (SJIS) and ISO-2022-JP can be used to convert JIS X 0201, JIS X 0208, JIS X 0212, and user-defined and vendor-defined characters based on TOG Japanese Vendors Council (TOG/JVC) Recommended Code Set Conversion Specification between Japanese EUC and Shift-JIS. If input data which does not belong to the source code set is encountered, iconv(3C) will return EILSEQ for errno. iconv(1) stops at the last point of successful conversion.

PCK (SJIS) to ISO-2022-JP.RFC1468

Conversion from PCK (SJIS) to ISO-2022-JP.RFC1468 can be used to convert JIS X 0201 (except for figure character set for katakana) and JIS X 0208. If JIS X 0201 (figure character set for katakana), a user-defined, or a vendor-defined character is encountered among input data, it will be replaced with the substitute character ` ? ' (0x3f). If input data which does not belong to these code sets is encountered, iconv(3C) will return EILSEQ for errno. iconv(1) stops at the last point of successful conversion.

PCK (SJIS) to "Unicode"s and "Unicode"s to PCK (SJIS)

Conversion between PCK (SJIS) and UTF-8 can be used to convert JIS X 0201, JIS X 0208, a user-defined, and a vendor-defined character. If input data which does not have the corresponding character in the target code set is encountered, it will be replaced with the substitute character (PCK: ` ? ' (0x3f), UTF-8: U+FFFD (0xefbfbd)). If input data which does not belong to these code sets is encountered, iconv(3C) will return EILSEQ for errno. iconv(1) stops at the last point of successful conversion.

PCK (SJIS) to UTF-8-Java and UTF-8-Java to PCK (SJIS)

Conversion between PCK (SJIS) and UTF-8-Java can be used to convert JIS X 0201 and JIS X 0208. If a user-defined or vendor-defined character is encountered among input data, it will be replaced with the substitute character (PCK: ` ? ' (0x3f), UTF-8: U+FFFD (0xefbfbd)). If input data which does not belong to these code sets is encountered, iconv(3C) will return EILSEQ for errno. iconv(1) stops at the last point of successful conversion.

PCK (SJIS) to jis and jis to PCK (SJIS)

Conversion between PCK (SJIS) and jis is provided for the compatibility with sjtojis7() and jis7tosj() libraries , and sjtojis jistosj utilities. It is extended besed on TOG Japanese Vendors Council (TOG/JVC) Recommended Code Set Conversion Specification between Japanese EUC and Shift-JIS. See jistosj(1).

PCK (SJIS) to ibmj and ibmj to PCK (SJIS)

Conversion between PCK (SJIS) and ibmj is based on the IBM Kanji codebook (4th edition - September 1987), JIS X 0201, and JIS X 0208-1983. If you convert PCK (SJIS) to ibmj, all characters converted to JIS X 0212 by kana characters (0xa1 to 0xdf) and TOG Japanese Vendors Council (TOG/JVC) Recommended Code Set Conversion Specification between Japanese EUC and Shift-JIS are all converted to substitute character.

PCK to ibmj-EBCDIK and ibmj-EBCDIK to PCK

Conversion between PCK and ibmj-EBCDIK is based on the IBM Kanji codebook (4th edition - September 1987), JIS X 0201, and JIS X 0208-1983. If you convert PCK to ibmj-EBCDIK, all characters converted to JIS X 0212 by JIS X 0212 and TOG Japanese Vendors Council (TOG/JVC) Recommended Code Set Conversion Specification between Japanese EUC and Shift-JIS are all converted to substitute character.

ISO-2022-JP to UTF-8 and UTF-8 to ISO-2022-JP

Conversion between ISO-2022-JP and UTF-8 can be used to convert JIS X 0201, JIS X 0208, JIS X 0212, and user-defined and vendor-defined characters. If input data which does not have the corresponding character in the target code set is encountered, it will be replaced with the substitute character (ISO-2022-JP: ` ? ' (0x3f), UTF-8: U+FFFD (0xefbfbd)). If input data which does not belong to these code sets is encountered, iconv(3C) will return EILSEQ for errno. iconv(1) stops at the last point of successful conversion.

UTF-8 to ISO-2022-JP.RFC1468

Conversion from UTF-8 to ISO-2022-JP.RFC1468 can be used to convert JIS X 0201 (except for figure character set for katakana) and JIS X 0208. If JIS X 0201 (figure character set for katakana), JIS X 0212, a user-defined, or a vendor-defined character is encountered among input data, it will be replaced with the substitute character ` ? ' (0x3f). If input data which does not belong to these code sets is encountered, iconv(3C) will return EILSEQ for errno. iconv(1) stops at the last point of successful conversion.

eucJP, PCK, UTF-8 to ibm930, ibm931, ibm939, ibm5026, ibm5035

Conversion from eucJP, PCK, or UTF-8 to ibm930,ibm931,ibm939, ibm5026,ibm5035 can be used to convert JIS X 0201, JIS X 0208, JIS X 0212, IBM extension characters, and user defined character. Input data which does not have corresponding character in the target code set is replaced with the substitute character. Since ibm931 does not support Kana characters in its single byte code set (SBCS), JIS X 0201 Kana characters are replaced with substitute characters in conversion to ibm931.

ibm930, ibm931, ibm939, ibm5026, or ibm5035 to eucJP, PCK, or UTF-8

Conversion from ibm930, ibm931, ibm939, ibm5026, or ibm5035 to eucJP, PCK, or UTF-8 can be used to convert SBCS/DBCS characters defined in input code set. Input data which does not have corresponding character in the target code set is replaced with the substitute character.

FujitsuJEF-ascii-code or FujitsuJEF-kana-code to eucJP, PCK, or UTF-8: eucJP, PCK, or UTF-8 to FujitsuJEF-ascii-code or FujitsuJEF-kana-code

Conversion from FujitsuJEF-ascii-code or FujitsuJEF-kana-code to eucJP, PCK, or UTF-8 and eucJP, PCK, or UTF-8 to FujitsuJEF-ascii-code or FujitsuJEF-kana-code can be used to convert JIS X 0201, JIS X 0208, and JIS X 0212 characters defined in input code set. Input data which does not have corresponding character in the target code set is replaced with the substitude character. Characters handled differently between JIS C 6226:1978 and JIS X 0208:1983 are converted based on code value. In conversion to FujitsuJEF-kana-code, JIS X 0201 Katakana characters are replaced with the substitute character.

FujitsuJEF-ascii-face or FujitsuJEF-kana-face to eucJP, PCK, or UTF-8: eucJP, PCK, UTF-8 to FujitsuJEF-ascii-face or FujitsuJEF-kana-face

Characters handled differently between JIS C 6226:1978 and JIS X 0208:1983 are converted based on character face. Other behavior is same as that of FujitsuJEF-ascii-code or FujitsuJEF-kana-code.

HitachiKEIS83 to eucJP, PCK, or UTF-8: eucJP, PCK, UTF-8 to HitachiKEIS83

Conversion from HitachiKEIS83 to eucJP, PCK, or UTF-8 and from eucJP, PCK, UTF-8 to HitachiKEIS83 can be used to convert JIS X 0201, JIS X 0208, and JIS X 0212 characters. Input data which does not have corresponding character in the target code set is replaced with the substitude character.

HitachiKEIS90 to eucJP, PCK, or UTF-8: eucJP, PCK, or UTF-8 to HitachiKEIS90

Behavior of conversion from HitachiKEIS90 to eucJP, PCK, or UTF-8 and from eucJP, PCK, or UTF-8 to HitachiKEIS90 is same as that of KEIS83 except that this conversion uses KEIS90 instead of KEIS83.

NECJIPS to eucJP, PCK, or UTF-8: eucJP, PCK, or UTF-8 to NECJIPS

Conversion from NECJIPS to eucJP, PCK, or UTF-8 and eucJP, PCK, or UTF-8 to NECJIPS can be used to convert JIS X 0201, JIS X 0208, and JIS X 0212 characters. Input data which does not have corresponding character in the target code set is replaced with the substitude character. EBCDIC Katakana (EK) code is used for single byte character set.

ms932 to "Unicode"s and "Unicode"s to ms932

Conversion between ms932 and UTF-8 is done using same way of mapping characters between the two codesets as Windows NT 3.51 does.

eucJP-ms to "Unicode"s and "Unicode"s to eucJP-ms

Conversion between eucJP and Unicodes is done using same way of mapping characters between the two codesets as Windows NT 3.51 does.

UTF-8 to UTF-8-ms932 and UTF-8-ms932 to UTF-8

This converts between "UTF-8" and "UTF-8-ms932", which are UTF-8 encoded Unicode converted from PCK, and that converted from ms932.

 

SEE ALSO

iconv(1), jistoeuc(1), jistosj(1), iconv(3C), iconv(5), iconv_unicode(5)


 

Index

NAME
DESCRIPTION
SEE ALSO

This document was created by man2html, using the manual pages.
Time: 02:39:51 GMT, October 02, 2010