windows-nt/Source/XPSP1/NT/sdktools/mc/mc.txt

436 lines
20 KiB
Plaintext
Raw Normal View History

2020-09-26 03:20:57 -05:00
This document describes how messages will be input, stored and formatted
by a Win32 application.
1. Message Input
Messages are input as ASCII text in a text file. The format of this
file supports specifying multiple versions of the same message text,
one for each language supported. It also supports automatic assignment
of code numbers to each message, along with the generation of a C
language include file for use by the application for accessing the
messages using symbolic constants. The purpose of the message text
file is to define all of the messages needed by an application, in a
format that makes it easy to support multiple languages with the same
image file.
Message text files are converted into binary resource files by the
MC program. These binary resource files are then input to the RC
compiler which will put them in the resource table for an
application or DLL.
The format of the message text file (default extension is .mc).
Basic syntax is Keyword=Value, where spaces around the equal sign
are ignored, and the value is delimited by white space from the next
keyword=value pair. Case is ignored when comparing against keyword
names. The value portion can either be a numeric integer constant,
{NUMBER}, using C syntax; a symbol name, {NAME}, that follows the
rules for C identifier names; or a file name that follows the
rules for the FAT file system (8 characters or less, no periods).
Comment lines are allowed in the message text file. The comment
syntax is the same as for WIN.INI, namely a semicolon begins a
comment which is terminated by the end of the line. Comments that
exist by themselves on a line are copied as is to the output .h
file.
The overall structure of a message text file consists of a header
section which contains zero or more of the following keywords:
MessageIdTypedef={NAME}
SeverityNames=({NAME}={NUMBER}:{NAME})
FacilityNames=({NAME}={NUMBER}:{NAME})
LanguageNames=({NAME}={NUMBER}:{FILENAME}[:{CODEPAGE}])
OutputBase={NUMBER}
These keywords have the following meaning:
MessageIdTypedef - gives a symbolic name that is output as the
typedef name for each numeric MessageId value. The default
value for this is NULL, which means there will be no type
cast output when defining symbolic names for a MessageId.
SeverityNames - defines the set of names that are allowed as the
value of the Severity keyword in the message definition.
The set is delimited by left and right parenthesis.
Associated with each severity name is a number that, when
shifted left by 30, gives the bit pattern to logically OR
with the Facility value and MessageId to come up with the
full 32-bit message code. The default value of this keyword
is:
SeverityNames=(Success=0x0
Informational=0x1
Warning=0x2
Error=0x3
)
Severity values occupy the high two bits of a 32-bit message
code. Any severity value that does not fit in two bits is
an error. The severity codes can be given symbolic names
by following each value with :{NAME}
FacilityNames - defines the set of names that are allowed as the
value of the Facility keyword in the message definition.
The set is delimited by left and right parenthesis.
Associated with each facility name is a number that, when
shift it left by 16 bits, gives the bit pattern to logically
OR with the Severity value and MessageId to come up with the
full 32-bit message code. The default value of this keyword
is:
FacilityNames=(System=0x0FF
Application=0xFFF
)
Facility codes occupy the low order 12 bits of the high
order 16-bits of a 32-bit message code. Any facility code
that does not fit in 12 bits is an error. This allows for
4096 facility codes. The first 256 are reserved for
use by the system software.
The facility codes can be given symbolic names by following
each value with :{NAME}
LanguageNames - defines the set of names that are allowed as the
value of the Language keyword in the message definition.
The set is delimited by left and right parenthesis.
Associated with each language name is a number and a file
name that will be used to name the binary output file that
will contain all of the message text for that language. The
number corresponds to the Language Id tag to use in the
resource table. The number is separate from the file name
with a colon. The initial value of this keyword is:
LanguageNames=(English=1:MSG00001)
Any new names that an application defines in its .mc file
which don't override any of the builtin names will be added
to the list of valid languages. This allows an application
to support private languages with descriptive names.
If the message file contains messages for languages that
must be represented in separate codepages, the optional
fourth (4th) parameter may be used to specify the codepage
that the messages for that Language's messages are in.
LanguageNames=(Japanese=411:MSG00411:932)
The default codepage used, if the codepage is not explicitly
specified, is the OEM Codepage of the system.
OutputBase - sets the output radix for the constants output to C
header file for messages. (It does not set the radix for the
SEVERITY and FACILITY constants. These default to HEX and can be
output in decimal using the -d switch.) If present, Outputbase
overwrites the -d switch for message constants in the header file.
Legal values are 10 and 16.
The OutputBase keyword is legal both in the header section and in the
message definition section of the input file. The OutputBase can be
changed as often as desired.
Following the header section are zero or more message definitions.
Each message definition begins with one or more of the following
keywords.
MessageId={|{NUMBER}|+{NUMBER}}
Severity={SEVERITY_NAME}
Facility={FACILITY_NAME}
SymbolicName={NAME}
The MessageId keyword is required to mark the beginning of the
message definition, although its value is optional. If no value is
specified, then the value used will be the last value used for the
facility, plus one. If the value is specified as +{NUMBER} then
the value used will be the last value used for the facility, plus
the number after the plus sign. Otherwise if a numeric value is
given, that will be value used. Any MessageId value that does not
fit in 16 bits is an error.
Severity and Facility are optional keywords that can specify
additional bits to OR into the final 32-bit message code. If either
of these are not specified they default to the value last specified
for a message definition. The initial values of these prior to
processing the first message definition is:
Severity=Success
Facility=Application
The value associated with these keywords must match one of the names
given to the FacilityNames and SeverityNames keywords. The SymbolicName
keyword allows the ISV to associate a C symbolic constant name with the
final 32-bit message code that is a result of ORing together the
MessageId, Severity and Facility bits. The constant definition is
output to the generated .h file with the following format:
//
// {MESSAGETEXT}
//
#define CONSTANT_SYMBOL_NAME ((MessageIdTypedef) 0x12345678)
where the comment before the definition is a copy of the message
text for the first language specified in the message definition.
The CONSTANT_SYMBOL_NAME is the value of the SymbolicName keyword.
The MessageIdTypedef is not output if it is NULL, the default value.
After the message definition keywords, comes one or more message text
definitions. Each message text definition begins with the Language
keyword that identifies which binary output file this message text
is to be output to. Beginning on the very next line is the first
line of the message text. The message text is terminated by a line
containing a single period at the beginning of the line, immediately
followed by a new line. No spaces allowed around keyword. Within
the message text, blank lines and white space are preserved as part
of the message.
Language={LANGUAGE_NAME}
{MESSAGETEXT}
.
Within the message text, several escape sequences are supported for
dynamically formatting the message. The percent sign character (%)
begins all escape sequences.
%0 - This terminates a message text line without a trailing
newline. This can be used to build up long lines or to
terminate the message itself without a trailing newline,
which is useful for prompt messages.
%n!printf format string! - This identifies an insert. The
value of n can be between 1 and 99. The printf format
string must be bracketed by exclamation marks. It is
optional and defaults to !s! if not specified.
The printf format string can contain the * specifier for
either the precision or width components, and if so, they
will consume inserts %n+1 and %n+2 for their values at run
time. MC will print a warning message if an explicit
reference is made to these inserts elsewhere in the message
text.
Inserts must reference a parameter passed to the FormatMessage API
call. It will return an error if a message refers to an insert that
was not passed to the FormatMessage API call.
Any other character following a percent sign, other than a digit will
be formatted in the output message without the percent sign. Some
examples:
%% - will output a single percent sign in the formatted message text.
%n - will output a hard line break when it occurs at the end of a
a line. Useful when FormatMessage is supplying normal line
breaks so the message fits in a certain width.
%r - will output a hard carriage return, without a trailing newline.
%b - will output a space in the formatted message text. This
can be used to insure there are the appropriate number of
trailing spaces in a message text line.
%t - will output a tab in the formatted message text.
%. - will output a single period in the formatted message text.
This can be used to get a single period at the beginning of
a line without terminating the message text definition.
%! - will output a single exclamation point in the formatted
message text. This can be used to get an exclamation point
immediately after an insert without it being mistaken for
the beginning of a printf format string.
Unicode support is not understood yet. If the input file is ASCII
text, do we need an escape sequence to allow input of Unicode values?
Or do we just let them use DBCS in the text file, assuming they have
a text editor that can do this.
2. Message Compiler (MC)
This program converts .mc message text files into binary files
suitable for inclusion into a .RC file by the resource compiler.
Command line syntax:
MC [-v] [-w] [-s] [-d] [-n] [-h DirSpec] [-e extension] [-r DirSpec] filename[.mc] ...
where:
-v - generates verbose output to stderr.
-w - generates a warning message whenever an insert escape
sequence is seen that is a superset of the type supported
by OS/2 mkmsgf (i.e. anything other than %0 and %n).
Useful for converting old OS/2 message files to this
format.
-s - Add an extra line to the beginning of each message that is
the symbolic name associated with the message id.
-d - Output SEVERITY and FACILTY constants in decimal. Set the
initial output radix for messages to decimal.
-n - Terminates all strings with null's in the message tables.
-e - Specify the extension for the header file. From 1 - 3 chars.
-h DirSpec - specifies the target directory of the generated
.h file. The file name is the name of the .mc file with a
.h extension.
-r DirSpec - specifies the target directory of the generated
.rc file. The file name is the name of the .mc file with a
.rc extension.
filename.mc - specifes one or more input message files that
will be compiled into one or more binary resource
files, one for each language that the message
files contain message text for.
The message compiler reads the .mc file and generates a .h file
containing all the symbolic name definitions. For each LanguageId
that was used to specify message text, it outputs a binary file
containing a message table resource. It also outputs a single .rc
file that contains the appropriate RC syntax to include each binary
file output as a resource with the appropriate name and type ids.
3. Message Win32 API Calls
DWORD
APIENTRY
FormatMessage(
DWORD dwFlags,
LPVOID lpSource,
DWORD dwMessageId,
DWORD dwLanguageId,
LPSTR lpBuffer,
DWORD nSize,
va_list Arguments
)
Routine Description:
This function formats a message string. Input to this function is a
message definition. It can come from a buffer passed into this
function. It can come from a message table resource in a module
already loaded. Or the caller can ask this function to search the
system message table resource(s) for the message. This function
finds the message definition based on the Message Id and the
Language Id and copies the message text to the output buffer,
processing any imbedded insert sequences if requested.
Arguments:
dwFlags - Specifies options to the formatting process along with how
to interpret the lpSource parameter. The low order 16bits of
this parameter are the maximum width of a line, in characters.
Possible values are:
FORMAT_MESSAGE_ALLOCATE_BUFFER - the lpBuffer is a PVOID * and
nSize is the minimum size to allocate. This function will
then allocate a buffer large enought to hold the formatted
message and store the pointer to the buffer in the location
pointed to by lpBuffer. Caller should free the buffer
with LocalFree when they are done using it.
FORMAT_MESSAGE_IGNORE_INSERTS - insert sequences in the message
definition will be ignored and passed through to the output
buffer as is. Useful for fetching a message for later
formatting. If this flag is set, the lpArguments parameter
is ignored.
FORMAT_MESSAGE_FROM_STRING - lpSource is a pointer to a null
terminated message definition. It can contain insert
sequences just as the message text in the .mc file can.
FORMAT_MESSAGE_FROM_HMODULE - lpSource is a module handle that
contains the message table resource(s) to search. If this
handle is NULL, then the current process's application
image file will be searched.
FORMAT_MESSAGE_FROM_SYSTEM - If the requested message was not
found in lpSource or if lpSource was not examined (i.e. neither
of the preceeding two flags was specified), then this function
will search the system message table resource(s).
FORMAT_MESSAGE_ARGUMENT_ARRAY - If set, specifies that the passed
Arguments parameter is NOT a va_list structure but instead is
just a pointer to an array of 32-bit values that represent
the arguments.
FORMAT_MESSAGE_MAX_WIDTH_MASK - The low order 8 bits specify the
maximum width of each line formatted into the output buffer.
A maximum width of zero, means that no restrictions are
placed on the width, and only the line breaks in the message
definition will be placed in the output buffer. If a non-zero
value is specified, then line breaks in the message definition
text are ignored, and instead line breaks are calculated based
on the maximum width, with white space delimited strings never
being split across a line break. Hard coded line breaks in
the message definition text, that are identified by the %n
escape sequence, are always output to the output buffer.
If the width specified is FORMAT_MESSAGE_MAX_WIDTH_MASK, then
line breaks in the message file are ignored and only hard coded
line breaks are kept and none are generated.
lpSource - specifies where to retrieve the message definition from.
The type of this parameter depends upon the settings in the
dwFlags parameter.
FORMAT_MESSAGE_FROM_HMODULE - lpSource is an hModule of the
module that contains the message table to search.
FORMAT_MESSAGE_FROM_STRING - lpSource is an LPSTR that points
to unformatted message text. It will be scanned for
inserts and formatted accordingly.
If neither of these options is specified, then this parameter is
ignored.
dwMessageId - specifices the 32-bit message identifier that identifies
the message being requested. This parameter is ignored if the
FORMAT_MESSAGE_FROM_STRING flag is specified.
dwLanguageId - specifices the 32-bit language identifier that
identifies the language of the message being requested. This
parameter is ignored if the FORMAT_MESSAGE_FROM_STRING flag is
specified.
lpBuffer - specifies a pointer to a buffer where the formatted message
is to be written. A terminating null byte will also be written.
If the FORMAT_MESSAGE_ALLOCATE_BUFFER flag was specified, then
this parameter points to a 32-bit pointer value that is filled in
by this call with a pointer allocated via LocalAlloc to contain
the text of the message.
nSize - specifies the maximum number of bytes that can be stored in
the output buffer. This parameter is ignore if the
FORMAT_MESSAGE_ALLOCATE_BUFFER flag is set.
Arguments - specifies a pointer to variable number of arguments.
These arguments are used to satisfy insert requests in the
format string. Thus %1 in the format string specifies the
first argument in the variable number of arguments described
by the Arguments parameter; %3 would specify the third, etc.
The interpretation of each 32-bit arguments value depends upon
the formatting information associated with the insert in the
message definition. The default is to treat each pointer as a
pointer to a null terminated string.
By default the Arguments parameter is of type va_list, which is
a language and implementation specific data type for describing
a variable number of arguments. If you do not have a pointer of
type va_list, then specify the FORMAT_MESSAGE_ARGUMENT_ARRAY
flag and pass a pointer to an array of 32-bit values that are
are input to the message formatted as the insert values.
Return Value:
DwORD - Returns the number of bytes actually stored in the output
buffer, excluding the terminating null character. Returns 0 if
an error occurred. Extended error status is available via the
GetLastError API.