Content-type: text/html
Man page of jiskanji
jiskanji
Section: File Formats (5)
Index
Return to Main Contents
NAME
jiskanji, jiskanji7, JIS7 - A character encoding system (codeset) for Japanese
DESCRIPTION
JIS Kanji is a codeset that uses the JIS X0202 symbol extension method
for encoding the JIS X0208 and JIS X0201 character sets. There are two types
of JIS Kanji encoding: 7-bit JIS Kanji code and 8-bit JIS Kanji code.
7-bit JIS Kanji Code
In 7-bit JIS Kanji encoding, all character values are 7-bit bytes. Characters
are interpreted according to preceding in and out sequences as follows:
Kanji in sequence (ESC $ B)
-
The code values following the Kanji in sequence (ESC $ B)
are treated as characters in the JIS X0208 Kanji character set.
Kanji out sequence (ESC ( B)
-
The code values following the Kanji out sequence (ESC ( B)
are treated as ASCII characters.
Supplementary Kanji in sequence (ESC $ ( D)
-
The code values following the supplementary Kanji in sequence (ESC $ ( D) are treated as characters in the JIS X0212 supplementary
Kanji character set.
User-Defined Character (UDC) in sequence (ESC $ (
0)
-
The code values following the UDC in sequence (ESC $ ( 0)
are treated as characters in the vendor-defined or user-defined character
set.
Kana in (SO) and Kana out (SI) sequences
-
The code values following
SO(0x0e) and preceding
SI(0x0f) are treated as characters in the JIS X0201 Katakana character
set.
Katakana in sequence (ESC ( I)
-
Code values following the Katakana in sequence (ESC ( I)
are treated as characters in the JIS X0201 Katakana character set. In this
case, the Kanji out sequence is used to switch back to ASCII code.
-
The Katakana in and Kanji out sequences are an alternative to using
the Kana in and out sequences (SO/SI).
8-bit JIS Kanji Code
In 8-bit JIS Kanji encoding, the JIS X0201 Katakana characters are represented
as 8-bit bytes. Using this form of encoding, in and out sequences have the
following effect:
Kanji in sequence (ESC $ B)
-
Code values following the Kanji in sequence (ESC $ B)
are treated as characters in the JIS X0208 Kanji character set.
Supplementary Kanji in sequence (ESC $ ( D)
-
Code values following the supplementary Kanji in sequence (ESC $ ( D) are treated as characters in the JIS X0212 supplementary
Kanji character set.
User-Defined Character (UDC) in sequence (ESC $ (
0)
-
Code values following the UDC in sequence (ESC $ ( 0)
are treated as vendor-defined or user-defined characters.
Kanji out sequence (ESC ( B) Code values
following the Kanji out sequence (ESC ( B) are treated
as ASCII characters.
Kana in and out sequences (SI/SO)
-
These sequences are ignored.
Codeset Conversion
The following codeset converter pairs are available for converting Japanese
characters between
jiskanji7
or
JIS7
and other encoding formats. The RESTRICTIONS section discusses some conversion
limitations that apply to these converters.
Refer to
iconv_intro(5)
for an introduction to codeset conversion. For more information
about the other codeset for which
jiskanji7
or
JIS7
is the input or output, see the reference page specified in
the list item.
deckanji_jiskanji7
or
deckanji_JIS7,
jiskanji7_deckanji
or
JIS7_deckanji
-
Converting from and to the DEC Kanji codeset:
deckanji(5).
eucJP_jiskanji7
or
eucJP_JIS7,
jiskanji7_eucJP
or
JIS7_eucJP
-
Converting from and to Japanese Extended UNIX Code:
eucJP(5).
eucTW_jiskanji7
or
eucTW_JIS7,
jiskanji7_eucTW
or
JIS7_eucTW
-
Converting from and to Taiwanese Extended UNIX Code:
eucTW(5).
sdeckanji_jiskanji7
or
sdeckanji_JIS7,
jiskanji7_sdeckanji
or
JIS7_sdeckanji
-
Converting from and to the Super DEC Kanji codeset:
sdeckanji(5).
SJIS_jiskanji7
or
SJIS_JIS7,
jiskanji7_SJIS
or
JIS7_SJIS
-
Converting from and to Shift JIS format:
SJIS(5).
-
Shift JIS encoding format is identical to encoding in Microsoft code-pages
used on PC systems. Therefore, you can use these converters to convert Japanese
characters between JIS Kanji and PC code-page format. For general information
on how the operating system supports PC code pages, see
code_page(5).
RESTRICTIONS
The JIS Kanji codeset is not supported directly by a locale but through
code conversion (through the
iconv
utility, Japanese terminal
(tty) code conversion, and so forth).
In the codeset naming conventions used by the
iconv
utility, the string
JIS7
indicates 7-bit JIS Kanji code
that follows a Katakana in sequence and the string
jiskanji7
indicates 7-bit JIS Kanji code entered between Kana in and out sequences.
The following sequences are valid for input to the
iconv
utility but are not generated when code is converted to
jiskanji7:
Kanji in (ESC $ @)
Kanji in (ESC & @ ESC $ B)
Kanji in (ESC $ ( B)
Kanji in (ESC $ ( @)
Supplementary Kanji in (ESC $ D)
Kana in (ESC ( J)
Kana in (ESC ( H)
In the code naming conventions of the Japanese terminal, the string
jis7
indicates 7-bit JIS Kanji code and the string
jis8
indicates 8-bit JIS Kanji code. When the terminal code is set to
jis7, the Kana in and out sequences (SI/SO) are used for JIS X0201 Katakana character representation.
SEE ALSO
Commands:
locale(1)
Others:
ascii(5),
code_page(5),
deckanji(5),
eucJP(5),
i18n_intro(5),
i18n_printing(5),
iconv_intro(5),
iso2022jp(5),
Japanese(5),
l10n_intro(5),
sdeckanji(5),
shiftjis(5)
Index
- NAME
-
- DESCRIPTION
-
- 7-bit JIS Kanji Code
-
- 8-bit JIS Kanji Code
-
- Codeset Conversion
-
- RESTRICTIONS
-
- SEE ALSO
-
This document was created by
man2html,
using the manual pages.
Time: 02:43:11 GMT, October 02, 2010