Content-type: text/html
Man page of iconv_ja
iconv_ja
Section: Standards, Environments, and Macros (5)
Updated: 10 Jan 2003
Index
Return to Main Contents
NAME
iconv_ja - code set conversions in ja locale
DESCRIPTION
The following code set conversions are supported:
Code Set Conversions Supported |
|
|
Source Code | Target Code
|
|
eucJP |
eucJP |
eucJP |
eucJP |
eucJP |
eucJP |
eucJP |
eucJP |
eucJP |
eucJP |
SJIS |
SJIS |
SJIS |
SJIS |
SJIS |
PCK |
PCK |
PCK |
PCK |
PCK |
PCK |
PCK |
PCK |
ISO-2022-JP |
ISO-2022-JP |
ISO-2022-JP |
ISO-2022-JP |
"Unicode"s |
"Unicode"s |
"Unicode"s |
UTF-8 |
UTF-8 |
UTF-8-Java |
UTF-8-Java |
JIS7 |
jis |
jis |
jis |
ibmj |
ibmj |
ibmj |
ibmj-EBCDIK |
ibmj-EBCDIK |
|
Code Set Conversions Supported |
|
|
Source Code | Target Code
|
|
eucJP |
eucJP |
eucJP |
eucJP |
eucJP |
eucJP |
eucJP |
eucJP |
eucJP |
eucJP |
eucJP |
eucJP |
PCK |
PCK |
PCK |
PCK |
PCK |
PCK |
PCK |
PCK |
PCK |
PCK |
PCK |
PCK |
UTF-8 |
UTF-8 |
UTF-8 |
UTF-8 |
UTF-8 |
"Unicode"s |
"Unicode"s |
UTF-8 |
UTF-8 |
UTF-8 |
UTF-8 |
UTF-8 |
UTF-8 |
UTF-8 |
UTF-8 |
UTF-8-ms932 |
ibm930 |
ibm930 |
ibm930 |
ibm931 |
ibm931 |
ibm931 |
ibm939 |
ibm939 |
ibm939 |
ibm5026 |
ibm5026 |
ibm5026 |
ibm5035 |
ibm5035 |
ibm5035 |
FujitsuJEF-ascii-code |
FujitsuJEF-ascii-code |
FujitsuJEF-ascii-code |
FujitsuJEF-kana-code |
FujitsuJEF-kana-code |
FujitsuJEF-kana-code |
FujitsuJEF-ascii-face |
FujitsuJEF-ascii-face |
FujitsuJEF-ascii-face |
|
Code Set Conversions Supported |
|
|
Source Code | Target Code
|
|
FujitsuJEF-kana-face |
FujitsuJEF-kana-face |
FujitsuJEF-kana-face |
HitachiKEIS83 |
HitachiKEIS83 |
HitachiKEIS83 |
HitachiKEIS90 |
HitachiKEIS90 |
HitachiKEIS90UTF-8 |
NECJIPS |
NECJIPS |
NECJIPS |
ms932 |
eucJP-ms |
|
The descriptions of each code sets in the above table are as follows:
Description of Supported Code Sets |
|
|
Codeset | Description
|
|
eucJP |
PCK |
SJIS |
ISO-2022-JP |
|
Description of Supported Code Sets |
|
|
Codeset | Description
|
|
JIS7 |
ISO-2022-JP.RFC1468 |
jis |
ibmj |
ibmj-EBCDIK |
"Unicode"s |
|
|
|
|
UTF-8 |
UTF-8-Java |
|
Description of Supported Code Sets |
|
|
Codeset | Description
|
|
ibm930 |
ibm931 |
ibm939 |
ibm5026 |
ibm5035 |
FujitsuJEF-ascii-code |
FujitsuJEF-ascii-face |
FujitsuJEF-kana-code |
FujitsuJEF-kana-face |
|
Description of Supported Code Sets |
|
|
Codeset | Description
|
|
HitachiKEIS8 |
HitachiKEIS90 |
NECJIPS |
ms932 |
UTF-8-ms932 |
|
Conversions are performed as described below. For all conversions, if
the source code set includes characters not included in the target code set,
conversion and output for all such characters will be done using a substitute
character.
eucJP to PCK (SJIS) and PCK (SJIS) to eucJP
-
Conversion between eucJP and PCK (SJIS) can be used to convert
JIS X 0201, JIS X 0208, JIS X 0212, and user-defined and vendor-defined characters
based on TOG Japanese Vendors Council (TOG/JVC) Recommended Code
Set Conversion Specification between Japanese EUC and Shift-JIS.
If input data which does not belong to the source code set is encountered, iconv(3C) will return EILSEQ for errno. iconv(1) stops
at the last point of successful conversion.
eucJP to ISO-2022-JP(JIS7) and ISO-2022-JP(JIS7)
to eucJP
-
Conversion between eucJP and ISO-2022-JP(JIS7)
can be used to convert JIS X 0201, JIS X 0208 and JIS X 0212. If input data
which does not belong to the source code set is encountered, iconv(3C) will return EILSEQ for errno. iconv(1) stops
at the last point of successful conversion.
eucJP to ISO-2022-JP.RFC1468
-
Conversion from eucJP to ISO-2022-JP.RFC1468 can
be used to convert JIS X 0201 (except for figure character set for katakana)
and JIS X 0208. If JIS X 0201 (figure character set for katakana), JIS X
0212, a user-defined, or a vendor-defined character is encountered among input
data, it will be replaced with the substitute character ` ? ' (0x3f). If input
data which does not belong to these code sets is encountered, iconv(3C) will return EILSEQ for errno. iconv(1) stops
at the last point of successful conversion.
eucJP to jis and jis to eucJP
-
Conversion between eucJP and jis is provided for the compatibility
with ujtojis7() and jis7touj()
libraries, and euctojis and jistoeuc utilities. It is
extended to handle JIS X 0212. See jistoeuc(1).
eucJP to "Unicode"s and "Unicode"s
to eucJP
-
Conversion between eucJP and UTF-8 can be used to convert
JIS X 0201, JIS X 0208, JIS X 0212, a user-defined, and a vendor-defined
character. If input data which does not have the corresponding character
in the target code set is encountered, it will be replaced with the substitute
character (eucJP: ` ? ' (0x3f), UTF-8: U+FFFD (0xefbfbd)). If input
data which does not belong to these code sets is encountered, iconv(3C) will return EILSEQ for errno. iconv(1) stops
at the last point of successful conversion.
eucJP to UTF-8-Java and UTF-8-Java
to eucJP
-
Conversion between eucJP and UTF-8-Java can be
used to convert JIS X 0201, JIS X 0208, and JIS X 0212. If a user-defined
or vendor-defined character is encountered among input data, it will be replaced
with the substitute character (eucJP: ` ? ' (0x3f), UTF-8: U+FFFD (0xefbfbd)).
If input data which does not belong to these code sets is encountered, iconv(3C) will return EILSEQ for errno. iconv(1) stops
at the last point of successful conversion.
eucJP to ibmj and ibmj to eucJP
-
Conversion between eucJP and ibmj is based on the IBM Kanji
codebook (4th edition - September 1987), JIS X 0201, and JIS X 0208-1983.
If you convert eucJP to ibmj, JISX 0201 and JIS X 0201 are all converted
to substitute character.
eucJP to ibmj-EBCDIK and ibmj-EBCDIK to eucJP
-
Conversion between eucJP and ibmj-EBCDIK is based on the IBM
Kanji codebook (4th edition - September 1987), JIS X 0201, and JIS
X 0208-1983. If you convert eucJP to ibmj-EBCDIK, JISX 0201 and JIS
X 0201 that have not correspondence characters with ibmj-EBCDIKare all converted
to substitute character.
PCK (SJIS) to ISO-2022-JP and ISO-2022-JP
to PCK (SJIS)
-
Conversion between PCK (SJIS) and ISO-2022-JP
can be used to convert JIS X 0201, JIS X 0208, JIS X 0212, and user-defined
and vendor-defined characters based on TOG Japanese Vendors Council
(TOG/JVC) Recommended Code Set Conversion Specification between Japanese EUC
and Shift-JIS. If input data which does not belong to
the source code set is encountered, iconv(3C) will return EILSEQ for errno. iconv(1) stops
at the last point of successful conversion.
PCK (SJIS) to ISO-2022-JP.RFC1468
-
Conversion from PCK (SJIS) to ISO-2022-JP.RFC1468
can be used to convert JIS X 0201 (except for figure character set for katakana)
and JIS X 0208. If JIS X 0201 (figure character set for katakana), a user-defined,
or a vendor-defined character is encountered among input data, it will be
replaced with the substitute character ` ? ' (0x3f). If input data which does
not belong to these code sets is encountered, iconv(3C) will return EILSEQ for errno. iconv(1) stops
at the last point of successful conversion.
PCK (SJIS) to "Unicode"s and "Unicode"s
to PCK (SJIS)
-
Conversion between PCK (SJIS) and UTF-8 can be used
to convert JIS X 0201, JIS X 0208, a user-defined, and a vendor-defined character.
If input data which does not have the corresponding character in the target
code set is encountered, it will be replaced with the substitute character
(PCK: ` ? ' (0x3f), UTF-8: U+FFFD (0xefbfbd)). If input data which does
not belong to these code sets is encountered, iconv(3C) will return EILSEQ for errno. iconv(1) stops
at the last point of successful conversion.
PCK (SJIS) to UTF-8-Java and UTF-8-Java
to PCK (SJIS)
-
Conversion between PCK (SJIS) and UTF-8-Java can
be used to convert JIS X 0201 and JIS X 0208. If a user-defined or vendor-defined
character is encountered among input data, it will be replaced with the substitute
character (PCK: ` ? ' (0x3f), UTF-8: U+FFFD (0xefbfbd)). If input data
which does not belong to these code sets is encountered, iconv(3C) will return EILSEQ for errno. iconv(1) stops
at the last point of successful conversion.
PCK (SJIS) to jis and jis to PCK (SJIS)
-
Conversion between PCK (SJIS) and jis is provided for the
compatibility with sjtojis7() and jis7tosj()
libraries , and sjtojis jistosj utilities.
It is extended besed on TOG Japanese Vendors Council (TOG/JVC)
Recommended Code Set Conversion Specification between Japanese EUC and Shift-JIS. See jistosj(1).
PCK (SJIS) to ibmj and ibmj to PCK (SJIS)
-
Conversion between PCK (SJIS) and ibmj is based on the IBM
Kanji codebook (4th edition - September 1987), JIS X 0201, and JIS
X 0208-1983. If you convert PCK (SJIS) to ibmj, all characters converted
to JIS X 0212 by kana characters (0xa1 to 0xdf) and TOG Japanese
Vendors Council (TOG/JVC) Recommended Code Set Conversion Specification between
Japanese EUC and Shift-JIS are all converted to substitute
character.
PCK to ibmj-EBCDIK and ibmj-EBCDIK to PCK
-
Conversion between PCK and ibmj-EBCDIK is based on the IBM
Kanji codebook (4th edition - September 1987), JIS X 0201, and JIS
X 0208-1983. If you convert PCK to ibmj-EBCDIK, all characters converted
to JIS X 0212 by JIS X 0212 and TOG Japanese Vendors Council
(TOG/JVC) Recommended Code Set Conversion Specification between Japanese EUC
and Shift-JIS are all converted to substitute character.
ISO-2022-JP to UTF-8 and UTF-8
to ISO-2022-JP
-
Conversion between ISO-2022-JP and UTF-8
can be used to convert JIS X 0201, JIS X 0208, JIS X 0212, and user-defined
and vendor-defined characters. If input data which does not have the corresponding
character in the target code set is encountered, it will be replaced with
the substitute character (ISO-2022-JP: ` ? ' (0x3f), UTF-8:
U+FFFD (0xefbfbd)). If input data which does not belong to these code sets
is encountered, iconv(3C) will
return EILSEQ for errno. iconv(1) stops at the last point of
successful conversion.
UTF-8 to ISO-2022-JP.RFC1468
-
Conversion from UTF-8 to ISO-2022-JP.RFC1468
can be used to convert JIS X 0201 (except for figure character set for katakana)
and JIS X 0208. If JIS X 0201 (figure character set for katakana), JIS X 0212,
a user-defined, or a vendor-defined character is encountered among input data,
it will be replaced with the substitute character ` ? ' (0x3f). If input data
which does not belong to these code sets is encountered, iconv(3C) will return EILSEQ for errno. iconv(1) stops
at the last point of successful conversion.
eucJP, PCK, UTF-8 to ibm930, ibm931, ibm939, ibm5026,
ibm5035
-
Conversion from eucJP, PCK, or UTF-8 to ibm930,ibm931,ibm939,
ibm5026,ibm5035 can be used to convert JIS X 0201, JIS X 0208, JIS X 0212,
IBM extension characters, and user defined character. Input data which does
not have corresponding character in the target code set is replaced with the
substitute character. Since ibm931 does not support Kana characters in its
single byte code set (SBCS), JIS X 0201 Kana characters are replaced with
substitute characters in conversion to ibm931.
ibm930, ibm931, ibm939, ibm5026, or ibm5035 to eucJP,
PCK, or UTF-8
-
Conversion from ibm930, ibm931, ibm939, ibm5026, or ibm5035
to eucJP, PCK, or UTF-8 can be used to convert SBCS/DBCS characters
defined in input code set. Input data which does not have corresponding character
in the target code set is replaced with the substitute character.
FujitsuJEF-ascii-code or FujitsuJEF-kana-code to eucJP,
PCK, or UTF-8: eucJP, PCK, or UTF-8 to FujitsuJEF-ascii-code or FujitsuJEF-kana-code
-
Conversion from FujitsuJEF-ascii-code or FujitsuJEF-kana-code
to eucJP, PCK, or UTF-8 and eucJP, PCK, or UTF-8 to FujitsuJEF-ascii-code
or FujitsuJEF-kana-code can be used to convert JIS X 0201, JIS X 0208, and
JIS X 0212 characters defined in input code set. Input data which does not
have corresponding character in the target code set is replaced with the substitude
character. Characters handled differently between JIS C 6226:1978 and JIS
X 0208:1983 are converted based on code value. In conversion to FujitsuJEF-kana-code,
JIS X 0201 Katakana characters are replaced with the substitute character.
FujitsuJEF-ascii-face or FujitsuJEF-kana-face to eucJP,
PCK, or UTF-8: eucJP, PCK, UTF-8 to FujitsuJEF-ascii-face or FujitsuJEF-kana-face
-
Characters handled differently between JIS C 6226:1978 and
JIS X 0208:1983 are converted based on character face. Other behavior is same
as that of FujitsuJEF-ascii-code or FujitsuJEF-kana-code.
HitachiKEIS83 to eucJP, PCK, or UTF-8: eucJP, PCK, UTF-8
to HitachiKEIS83
-
Conversion from HitachiKEIS83 to eucJP, PCK, or UTF-8 and
from eucJP, PCK, UTF-8 to HitachiKEIS83 can be used to convert JIS X 0201,
JIS X 0208, and JIS X 0212 characters. Input data which does not have corresponding
character in the target code set is replaced with the substitude character.
HitachiKEIS90 to eucJP, PCK, or UTF-8: eucJP, PCK, or
UTF-8 to HitachiKEIS90
-
Behavior of conversion from HitachiKEIS90 to eucJP, PCK, or
UTF-8 and from eucJP, PCK, or UTF-8 to HitachiKEIS90 is same as that of KEIS83
except that this conversion uses KEIS90 instead of KEIS83.
NECJIPS to eucJP, PCK, or UTF-8: eucJP, PCK, or UTF-8
to NECJIPS
-
Conversion from NECJIPS to eucJP, PCK, or UTF-8 and eucJP,
PCK, or UTF-8 to NECJIPS can be used to convert JIS X 0201, JIS X 0208, and
JIS X 0212 characters. Input data which does not have corresponding character
in the target code set is replaced with the substitude character. EBCDIC Katakana
(EK) code is used for single byte character set.
ms932 to "Unicode"s and "Unicode"s
to ms932
-
Conversion between ms932 and UTF-8 is done using same way
of mapping characters between the two codesets as Windows NT 3.51 does.
eucJP-ms to "Unicode"s and "Unicode"s
to eucJP-ms
-
Conversion between eucJP and Unicodes is done using same way
of mapping characters between the two codesets as Windows NT 3.51 does.
UTF-8 to UTF-8-ms932 and UTF-8-ms932
to UTF-8
-
This converts between "UTF-8" and "UTF-8-ms932",
which are UTF-8 encoded Unicode converted from PCK, and that converted from
ms932.
SEE ALSO
iconv(1), jistoeuc(1), jistosj(1), iconv(3C), iconv(5), iconv_unicode(5)
Index
- NAME
-
- DESCRIPTION
-
- SEE ALSO
-
This document was created by
man2html,
using the manual pages.
Time: 02:39:51 GMT, October 02, 2010