436 lines
20 KiB
Plaintext
436 lines
20 KiB
Plaintext
|
This document describes how messages will be input, stored and formatted
|
||
|
by a Win32 application.
|
||
|
|
||
|
1. Message Input
|
||
|
|
||
|
Messages are input as ASCII text in a text file. The format of this
|
||
|
file supports specifying multiple versions of the same message text,
|
||
|
one for each language supported. It also supports automatic assignment
|
||
|
of code numbers to each message, along with the generation of a C
|
||
|
language include file for use by the application for accessing the
|
||
|
messages using symbolic constants. The purpose of the message text
|
||
|
file is to define all of the messages needed by an application, in a
|
||
|
format that makes it easy to support multiple languages with the same
|
||
|
image file.
|
||
|
|
||
|
Message text files are converted into binary resource files by the
|
||
|
MC program. These binary resource files are then input to the RC
|
||
|
compiler which will put them in the resource table for an
|
||
|
application or DLL.
|
||
|
|
||
|
The format of the message text file (default extension is .mc).
|
||
|
Basic syntax is Keyword=Value, where spaces around the equal sign
|
||
|
are ignored, and the value is delimited by white space from the next
|
||
|
keyword=value pair. Case is ignored when comparing against keyword
|
||
|
names. The value portion can either be a numeric integer constant,
|
||
|
{NUMBER}, using C syntax; a symbol name, {NAME}, that follows the
|
||
|
rules for C identifier names; or a file name that follows the
|
||
|
rules for the FAT file system (8 characters or less, no periods).
|
||
|
|
||
|
Comment lines are allowed in the message text file. The comment
|
||
|
syntax is the same as for WIN.INI, namely a semicolon begins a
|
||
|
comment which is terminated by the end of the line. Comments that
|
||
|
exist by themselves on a line are copied as is to the output .h
|
||
|
file.
|
||
|
|
||
|
The overall structure of a message text file consists of a header
|
||
|
section which contains zero or more of the following keywords:
|
||
|
|
||
|
MessageIdTypedef={NAME}
|
||
|
SeverityNames=({NAME}={NUMBER}:{NAME})
|
||
|
FacilityNames=({NAME}={NUMBER}:{NAME})
|
||
|
LanguageNames=({NAME}={NUMBER}:{FILENAME}[:{CODEPAGE}])
|
||
|
OutputBase={NUMBER}
|
||
|
|
||
|
These keywords have the following meaning:
|
||
|
|
||
|
MessageIdTypedef - gives a symbolic name that is output as the
|
||
|
typedef name for each numeric MessageId value. The default
|
||
|
value for this is NULL, which means there will be no type
|
||
|
cast output when defining symbolic names for a MessageId.
|
||
|
|
||
|
SeverityNames - defines the set of names that are allowed as the
|
||
|
value of the Severity keyword in the message definition.
|
||
|
The set is delimited by left and right parenthesis.
|
||
|
Associated with each severity name is a number that, when
|
||
|
shifted left by 30, gives the bit pattern to logically OR
|
||
|
with the Facility value and MessageId to come up with the
|
||
|
full 32-bit message code. The default value of this keyword
|
||
|
is:
|
||
|
|
||
|
SeverityNames=(Success=0x0
|
||
|
Informational=0x1
|
||
|
Warning=0x2
|
||
|
Error=0x3
|
||
|
)
|
||
|
|
||
|
Severity values occupy the high two bits of a 32-bit message
|
||
|
code. Any severity value that does not fit in two bits is
|
||
|
an error. The severity codes can be given symbolic names
|
||
|
by following each value with :{NAME}
|
||
|
|
||
|
FacilityNames - defines the set of names that are allowed as the
|
||
|
value of the Facility keyword in the message definition.
|
||
|
The set is delimited by left and right parenthesis.
|
||
|
Associated with each facility name is a number that, when
|
||
|
shift it left by 16 bits, gives the bit pattern to logically
|
||
|
OR with the Severity value and MessageId to come up with the
|
||
|
full 32-bit message code. The default value of this keyword
|
||
|
is:
|
||
|
|
||
|
FacilityNames=(System=0x0FF
|
||
|
Application=0xFFF
|
||
|
)
|
||
|
|
||
|
|
||
|
Facility codes occupy the low order 12 bits of the high
|
||
|
order 16-bits of a 32-bit message code. Any facility code
|
||
|
that does not fit in 12 bits is an error. This allows for
|
||
|
4096 facility codes. The first 256 are reserved for
|
||
|
use by the system software.
|
||
|
|
||
|
The facility codes can be given symbolic names by following
|
||
|
each value with :{NAME}
|
||
|
|
||
|
LanguageNames - defines the set of names that are allowed as the
|
||
|
value of the Language keyword in the message definition.
|
||
|
The set is delimited by left and right parenthesis.
|
||
|
Associated with each language name is a number and a file
|
||
|
name that will be used to name the binary output file that
|
||
|
will contain all of the message text for that language. The
|
||
|
number corresponds to the Language Id tag to use in the
|
||
|
resource table. The number is separate from the file name
|
||
|
with a colon. The initial value of this keyword is:
|
||
|
|
||
|
LanguageNames=(English=1:MSG00001)
|
||
|
|
||
|
Any new names that an application defines in its .mc file
|
||
|
which don't override any of the builtin names will be added
|
||
|
to the list of valid languages. This allows an application
|
||
|
to support private languages with descriptive names.
|
||
|
|
||
|
If the message file contains messages for languages that
|
||
|
must be represented in separate codepages, the optional
|
||
|
fourth (4th) parameter may be used to specify the codepage
|
||
|
that the messages for that Language's messages are in.
|
||
|
|
||
|
LanguageNames=(Japanese=411:MSG00411:932)
|
||
|
|
||
|
The default codepage used, if the codepage is not explicitly
|
||
|
specified, is the OEM Codepage of the system.
|
||
|
|
||
|
OutputBase - sets the output radix for the constants output to C
|
||
|
header file for messages. (It does not set the radix for the
|
||
|
SEVERITY and FACILITY constants. These default to HEX and can be
|
||
|
output in decimal using the -d switch.) If present, Outputbase
|
||
|
overwrites the -d switch for message constants in the header file.
|
||
|
Legal values are 10 and 16.
|
||
|
|
||
|
The OutputBase keyword is legal both in the header section and in the
|
||
|
message definition section of the input file. The OutputBase can be
|
||
|
changed as often as desired.
|
||
|
|
||
|
Following the header section are zero or more message definitions.
|
||
|
Each message definition begins with one or more of the following
|
||
|
keywords.
|
||
|
|
||
|
MessageId={|{NUMBER}|+{NUMBER}}
|
||
|
Severity={SEVERITY_NAME}
|
||
|
Facility={FACILITY_NAME}
|
||
|
SymbolicName={NAME}
|
||
|
|
||
|
The MessageId keyword is required to mark the beginning of the
|
||
|
message definition, although its value is optional. If no value is
|
||
|
specified, then the value used will be the last value used for the
|
||
|
facility, plus one. If the value is specified as +{NUMBER} then
|
||
|
the value used will be the last value used for the facility, plus
|
||
|
the number after the plus sign. Otherwise if a numeric value is
|
||
|
given, that will be value used. Any MessageId value that does not
|
||
|
fit in 16 bits is an error.
|
||
|
|
||
|
Severity and Facility are optional keywords that can specify
|
||
|
additional bits to OR into the final 32-bit message code. If either
|
||
|
of these are not specified they default to the value last specified
|
||
|
for a message definition. The initial values of these prior to
|
||
|
processing the first message definition is:
|
||
|
|
||
|
Severity=Success
|
||
|
Facility=Application
|
||
|
|
||
|
The value associated with these keywords must match one of the names
|
||
|
given to the FacilityNames and SeverityNames keywords. The SymbolicName
|
||
|
keyword allows the ISV to associate a C symbolic constant name with the
|
||
|
final 32-bit message code that is a result of ORing together the
|
||
|
MessageId, Severity and Facility bits. The constant definition is
|
||
|
output to the generated .h file with the following format:
|
||
|
|
||
|
//
|
||
|
// {MESSAGETEXT}
|
||
|
//
|
||
|
|
||
|
#define CONSTANT_SYMBOL_NAME ((MessageIdTypedef) 0x12345678)
|
||
|
|
||
|
where the comment before the definition is a copy of the message
|
||
|
text for the first language specified in the message definition.
|
||
|
The CONSTANT_SYMBOL_NAME is the value of the SymbolicName keyword.
|
||
|
The MessageIdTypedef is not output if it is NULL, the default value.
|
||
|
|
||
|
After the message definition keywords, comes one or more message text
|
||
|
definitions. Each message text definition begins with the Language
|
||
|
keyword that identifies which binary output file this message text
|
||
|
is to be output to. Beginning on the very next line is the first
|
||
|
line of the message text. The message text is terminated by a line
|
||
|
containing a single period at the beginning of the line, immediately
|
||
|
followed by a new line. No spaces allowed around keyword. Within
|
||
|
the message text, blank lines and white space are preserved as part
|
||
|
of the message.
|
||
|
|
||
|
Language={LANGUAGE_NAME}
|
||
|
{MESSAGETEXT}
|
||
|
.
|
||
|
|
||
|
Within the message text, several escape sequences are supported for
|
||
|
dynamically formatting the message. The percent sign character (%)
|
||
|
begins all escape sequences.
|
||
|
|
||
|
|
||
|
%0 - This terminates a message text line without a trailing
|
||
|
newline. This can be used to build up long lines or to
|
||
|
terminate the message itself without a trailing newline,
|
||
|
which is useful for prompt messages.
|
||
|
|
||
|
%n!printf format string! - This identifies an insert. The
|
||
|
value of n can be between 1 and 99. The printf format
|
||
|
string must be bracketed by exclamation marks. It is
|
||
|
optional and defaults to !s! if not specified.
|
||
|
|
||
|
The printf format string can contain the * specifier for
|
||
|
either the precision or width components, and if so, they
|
||
|
will consume inserts %n+1 and %n+2 for their values at run
|
||
|
time. MC will print a warning message if an explicit
|
||
|
reference is made to these inserts elsewhere in the message
|
||
|
text.
|
||
|
|
||
|
Inserts must reference a parameter passed to the FormatMessage API
|
||
|
call. It will return an error if a message refers to an insert that
|
||
|
was not passed to the FormatMessage API call.
|
||
|
|
||
|
Any other character following a percent sign, other than a digit will
|
||
|
be formatted in the output message without the percent sign. Some
|
||
|
examples:
|
||
|
|
||
|
%% - will output a single percent sign in the formatted message text.
|
||
|
|
||
|
%n - will output a hard line break when it occurs at the end of a
|
||
|
a line. Useful when FormatMessage is supplying normal line
|
||
|
breaks so the message fits in a certain width.
|
||
|
|
||
|
%r - will output a hard carriage return, without a trailing newline.
|
||
|
|
||
|
%b - will output a space in the formatted message text. This
|
||
|
can be used to insure there are the appropriate number of
|
||
|
trailing spaces in a message text line.
|
||
|
|
||
|
%t - will output a tab in the formatted message text.
|
||
|
|
||
|
%. - will output a single period in the formatted message text.
|
||
|
This can be used to get a single period at the beginning of
|
||
|
a line without terminating the message text definition.
|
||
|
|
||
|
%! - will output a single exclamation point in the formatted
|
||
|
message text. This can be used to get an exclamation point
|
||
|
immediately after an insert without it being mistaken for
|
||
|
the beginning of a printf format string.
|
||
|
|
||
|
Unicode support is not understood yet. If the input file is ASCII
|
||
|
text, do we need an escape sequence to allow input of Unicode values?
|
||
|
Or do we just let them use DBCS in the text file, assuming they have
|
||
|
a text editor that can do this.
|
||
|
|
||
|
2. Message Compiler (MC)
|
||
|
|
||
|
This program converts .mc message text files into binary files
|
||
|
suitable for inclusion into a .RC file by the resource compiler.
|
||
|
|
||
|
Command line syntax:
|
||
|
|
||
|
MC [-v] [-w] [-s] [-d] [-n] [-h DirSpec] [-e extension] [-r DirSpec] filename[.mc] ...
|
||
|
|
||
|
where:
|
||
|
|
||
|
-v - generates verbose output to stderr.
|
||
|
|
||
|
-w - generates a warning message whenever an insert escape
|
||
|
sequence is seen that is a superset of the type supported
|
||
|
by OS/2 mkmsgf (i.e. anything other than %0 and %n).
|
||
|
Useful for converting old OS/2 message files to this
|
||
|
format.
|
||
|
|
||
|
-s - Add an extra line to the beginning of each message that is
|
||
|
the symbolic name associated with the message id.
|
||
|
|
||
|
-d - Output SEVERITY and FACILTY constants in decimal. Set the
|
||
|
initial output radix for messages to decimal.
|
||
|
|
||
|
-n - Terminates all strings with null's in the message tables.
|
||
|
|
||
|
-e - Specify the extension for the header file. From 1 - 3 chars.
|
||
|
|
||
|
-h DirSpec - specifies the target directory of the generated
|
||
|
.h file. The file name is the name of the .mc file with a
|
||
|
.h extension.
|
||
|
|
||
|
-r DirSpec - specifies the target directory of the generated
|
||
|
.rc file. The file name is the name of the .mc file with a
|
||
|
.rc extension.
|
||
|
|
||
|
filename.mc - specifes one or more input message files that
|
||
|
will be compiled into one or more binary resource
|
||
|
files, one for each language that the message
|
||
|
files contain message text for.
|
||
|
|
||
|
The message compiler reads the .mc file and generates a .h file
|
||
|
containing all the symbolic name definitions. For each LanguageId
|
||
|
that was used to specify message text, it outputs a binary file
|
||
|
containing a message table resource. It also outputs a single .rc
|
||
|
file that contains the appropriate RC syntax to include each binary
|
||
|
file output as a resource with the appropriate name and type ids.
|
||
|
|
||
|
3. Message Win32 API Calls
|
||
|
|
||
|
DWORD
|
||
|
APIENTRY
|
||
|
FormatMessage(
|
||
|
DWORD dwFlags,
|
||
|
LPVOID lpSource,
|
||
|
DWORD dwMessageId,
|
||
|
DWORD dwLanguageId,
|
||
|
LPSTR lpBuffer,
|
||
|
DWORD nSize,
|
||
|
va_list Arguments
|
||
|
)
|
||
|
|
||
|
Routine Description:
|
||
|
|
||
|
This function formats a message string. Input to this function is a
|
||
|
message definition. It can come from a buffer passed into this
|
||
|
function. It can come from a message table resource in a module
|
||
|
already loaded. Or the caller can ask this function to search the
|
||
|
system message table resource(s) for the message. This function
|
||
|
finds the message definition based on the Message Id and the
|
||
|
Language Id and copies the message text to the output buffer,
|
||
|
processing any imbedded insert sequences if requested.
|
||
|
|
||
|
Arguments:
|
||
|
|
||
|
dwFlags - Specifies options to the formatting process along with how
|
||
|
to interpret the lpSource parameter. The low order 16bits of
|
||
|
this parameter are the maximum width of a line, in characters.
|
||
|
Possible values are:
|
||
|
|
||
|
FORMAT_MESSAGE_ALLOCATE_BUFFER - the lpBuffer is a PVOID * and
|
||
|
nSize is the minimum size to allocate. This function will
|
||
|
then allocate a buffer large enought to hold the formatted
|
||
|
message and store the pointer to the buffer in the location
|
||
|
pointed to by lpBuffer. Caller should free the buffer
|
||
|
with LocalFree when they are done using it.
|
||
|
|
||
|
FORMAT_MESSAGE_IGNORE_INSERTS - insert sequences in the message
|
||
|
definition will be ignored and passed through to the output
|
||
|
buffer as is. Useful for fetching a message for later
|
||
|
formatting. If this flag is set, the lpArguments parameter
|
||
|
is ignored.
|
||
|
|
||
|
FORMAT_MESSAGE_FROM_STRING - lpSource is a pointer to a null
|
||
|
terminated message definition. It can contain insert
|
||
|
sequences just as the message text in the .mc file can.
|
||
|
|
||
|
FORMAT_MESSAGE_FROM_HMODULE - lpSource is a module handle that
|
||
|
contains the message table resource(s) to search. If this
|
||
|
handle is NULL, then the current process's application
|
||
|
image file will be searched.
|
||
|
|
||
|
FORMAT_MESSAGE_FROM_SYSTEM - If the requested message was not
|
||
|
found in lpSource or if lpSource was not examined (i.e. neither
|
||
|
of the preceeding two flags was specified), then this function
|
||
|
will search the system message table resource(s).
|
||
|
|
||
|
FORMAT_MESSAGE_ARGUMENT_ARRAY - If set, specifies that the passed
|
||
|
Arguments parameter is NOT a va_list structure but instead is
|
||
|
just a pointer to an array of 32-bit values that represent
|
||
|
the arguments.
|
||
|
|
||
|
FORMAT_MESSAGE_MAX_WIDTH_MASK - The low order 8 bits specify the
|
||
|
maximum width of each line formatted into the output buffer.
|
||
|
A maximum width of zero, means that no restrictions are
|
||
|
placed on the width, and only the line breaks in the message
|
||
|
definition will be placed in the output buffer. If a non-zero
|
||
|
value is specified, then line breaks in the message definition
|
||
|
text are ignored, and instead line breaks are calculated based
|
||
|
on the maximum width, with white space delimited strings never
|
||
|
being split across a line break. Hard coded line breaks in
|
||
|
the message definition text, that are identified by the %n
|
||
|
escape sequence, are always output to the output buffer.
|
||
|
|
||
|
If the width specified is FORMAT_MESSAGE_MAX_WIDTH_MASK, then
|
||
|
line breaks in the message file are ignored and only hard coded
|
||
|
line breaks are kept and none are generated.
|
||
|
|
||
|
lpSource - specifies where to retrieve the message definition from.
|
||
|
The type of this parameter depends upon the settings in the
|
||
|
dwFlags parameter.
|
||
|
|
||
|
FORMAT_MESSAGE_FROM_HMODULE - lpSource is an hModule of the
|
||
|
module that contains the message table to search.
|
||
|
|
||
|
FORMAT_MESSAGE_FROM_STRING - lpSource is an LPSTR that points
|
||
|
to unformatted message text. It will be scanned for
|
||
|
inserts and formatted accordingly.
|
||
|
|
||
|
If neither of these options is specified, then this parameter is
|
||
|
ignored.
|
||
|
|
||
|
dwMessageId - specifices the 32-bit message identifier that identifies
|
||
|
the message being requested. This parameter is ignored if the
|
||
|
FORMAT_MESSAGE_FROM_STRING flag is specified.
|
||
|
|
||
|
dwLanguageId - specifices the 32-bit language identifier that
|
||
|
identifies the language of the message being requested. This
|
||
|
parameter is ignored if the FORMAT_MESSAGE_FROM_STRING flag is
|
||
|
specified.
|
||
|
|
||
|
lpBuffer - specifies a pointer to a buffer where the formatted message
|
||
|
is to be written. A terminating null byte will also be written.
|
||
|
If the FORMAT_MESSAGE_ALLOCATE_BUFFER flag was specified, then
|
||
|
this parameter points to a 32-bit pointer value that is filled in
|
||
|
by this call with a pointer allocated via LocalAlloc to contain
|
||
|
the text of the message.
|
||
|
|
||
|
nSize - specifies the maximum number of bytes that can be stored in
|
||
|
the output buffer. This parameter is ignore if the
|
||
|
FORMAT_MESSAGE_ALLOCATE_BUFFER flag is set.
|
||
|
|
||
|
Arguments - specifies a pointer to variable number of arguments.
|
||
|
These arguments are used to satisfy insert requests in the
|
||
|
format string. Thus %1 in the format string specifies the
|
||
|
first argument in the variable number of arguments described
|
||
|
by the Arguments parameter; %3 would specify the third, etc.
|
||
|
|
||
|
The interpretation of each 32-bit arguments value depends upon
|
||
|
the formatting information associated with the insert in the
|
||
|
message definition. The default is to treat each pointer as a
|
||
|
pointer to a null terminated string.
|
||
|
|
||
|
By default the Arguments parameter is of type va_list, which is
|
||
|
a language and implementation specific data type for describing
|
||
|
a variable number of arguments. If you do not have a pointer of
|
||
|
type va_list, then specify the FORMAT_MESSAGE_ARGUMENT_ARRAY
|
||
|
flag and pass a pointer to an array of 32-bit values that are
|
||
|
are input to the message formatted as the insert values.
|
||
|
Return Value:
|
||
|
|
||
|
DwORD - Returns the number of bytes actually stored in the output
|
||
|
buffer, excluding the terminating null character. Returns 0 if
|
||
|
an error occurred. Extended error status is available via the
|
||
|
GetLastError API.
|