Content-type: text/html Man page of asort

asort

Section: User Commands (1)
Index Return to Main Contents
 

NAME

asort - Sorts or merges files and supports multiple collating weight sequences  

SYNOPSIS

asort [-m] [-o output_file] [-Abdfinruv] [-Ccollate_sequence] [-k keydef]... [-t character] [-T directory] [-y][kilobytes] [-z record_size]... file...

asort -c [-u] [-Abdfinruv] [-Ccollate_sequence] [-k keydef]... [-t character] [-T directory] [-y][kilobytes] [-z record_size]... file...

The following syntax is maintained for backward compatibility but may be withdrawn in a future release: asort [-Abcdfimnruv] [-Ccollate_sequence] [-o output_file] [-t character] [-T directory] [-y][kilobytes] [-z record_size] [+fskip][.cskip] [-fskip][.cskip] [-bdfinr]... file...


 

OPTIONS

The asort command includes the same options as the sort command (see sort(1)) in addition to the following options: Specifies the collating weight sequence to be used in sorting the data files. When this option is specified, the asort command does not use the collating table from the locale database. Instead, the command uses a set of special system and user collating tables to determine the collating weights of characters, including user-defined characters (UDCs).

The collate_sequence argument can be in long form (for example, "Pinyin Radical Stroke") or short form (for example, prs). The codeset of the locale determines which collation weight names can be specified for collate_sequence. The following list specifies the long and short collation weight names that are valid for supported codesets.
For DEC Hanzi:
Pinyon (or p) Radical (or r) Stroke (or s) For DEC Hanyu, Taiwanese EUC, and BIG-5:
Phonetic (or p) Radical (or r) Stroke (or s) Uses a breadth-first sorting mechanism instead of the default depth-first mechanism to sort the input data. To have any effect, the -v option must be used together with the -C option.
 

DESCRIPTION

The asort command sorts lines in its input files and writes the result to standard output. The asort command is similar to the sort command. See the sort(1) reference page for information about features the two commands have in common.

The asort command provides additional features for processing multiple collating weight sequences used with Asian languages, such as Chinese. For example, pinyon (p), stroke (s), and radical (r) are three dimensions along which characters can be ordered in Simplified Chinese. The -C option allows users to specify the priority level that these dimensions have during sorting. For example, -C srp specifies that characters should be sorted first by stroke, then by radical, then by pinyon. The specified sequence is applied to user-defined characters (UDCs) as well as to standard characters.

When the -C option is specified, the default behavior of the asort command is to use a depth-first sorting mechanism to sort the input files. With the depth-first mechanism, pairs of multibyte characters in a sort field are compared by exhausting all the specified collating weights and/or internal codes one at a time until the collating order is resolved. Only when two characters are identical is the next pair of characters compared. The depth-first sorting mechanism is also called character sorting.

However, the asort command provides the -v option to use the Asian VMS-like breadth-first sorting mechanism. With the breadth-first mechanism, pairs of multibyte characters in a sort field are compared using the first collating weight for all the characters in the sort field first. Only when two sets of data in a sort field are computed to have the same collating order are succeeding collating weights used for resolving the collating order. The breadth-first sorting mechanism is sometimes called string sorting.
 

NOTES

Currently, the asort command is supported for use only with Chinese codesets.
 

EXIT STATUS

The asort command returns the following exit values: All input files were output successfully, or -c was specified and the input file was correctly sorted. If -c was specified, the file was not ordered as specified, or if the -c and -u options were both specified, two input lines were found with equal keys. An error occurred.
 

EXAMPLES

Unless stated otherwise, the following examples assume the locale setting is zh_TW.dechanyu: To perform character sorting first by stroke and then by radical, enter: asort -C"Stroke Radical" names

This command displays the lines in names sorted in ascending order according to the number of strokes in characters. If the number of strokes happen to be the same for two characters, the radicals of the characters determine how the characters are ordered.An alternative short form of the same command is as follows: asort -Csr names To perform string sorting first by stroke and then by radical in a way similar to the sort command available on an Asian VMS system, enter: asort -v -C"Stroke Radical" names

 

SEE ALSO

Commands: sort(1)

Functions: setlocale(3)

Files: locale(4)

Others: Chinese(5), i18n_intro(5)


 

Index

NAME
SYNOPSIS
OPTIONS
DESCRIPTION
NOTES
EXIT STATUS
EXAMPLES
SEE ALSO

This document was created by man2html, using the manual pages.
Time: 02:43:00 GMT, October 02, 2010