Quick Guide to Understanding the Documentation
For the GPO
Photocomposition Coding Scheme
Of the U.S.
Code
The
GPO photocomposition code scheme was originally (in the mid-1970s) designed to
work on a GOP-modified version of the ATEX text editing system from Digital
Equipment Corp. The software to handle typesetting was the MTP
(�Multi-Typography Program�) developed in-house by GPO.� In 1996, the ATEX system in the Office of
the Law Revision Counsel was replaced with XYWrite (a PC-based clone of
ATEX).� At roughly the same time, the
MTP software was replaced by MicroComp (also developed in-house by GPO).
Photocomposition coded U.S. Code data consists of a stream of data with typesetting instructions contained in character strings that follow one of several special characters.� The most widely used command character is the bell (HEX 07).
Bell codes
A bell (HEX 07)
signals that a command is to be given.�
The character that immediately follows a bell indicates the type of
command.� Some of these commands have
arguments � these arguments come immediately after the character following the
bell.� For instance,
������ Bell-I80
is a bell-I with an argument of 80.� Note that the bell commands are case sensitive.
A brief description of each type of bell-code is contained
on the attached copy of page 4 of GPO Coding for Selected
Publications:� Standardized locator
lists and marked up samples showing how to key selected publications, U.S. Government Printing
Office, February 1988:
�����������������������
Formats and
Locators (Bell-F, Bell-S, and Bell-I)
The following tables define the 99 locator codes associated with each U.S. Code format.� The headings of the table have the following meanings:
PTSZ�������������������������� point
size
LDG��������������������������� leading (points from the baseline of one character to the baseline of the character above it)
LGH��������������������������� line length in points
INDENT PRI�������������� primary indent (the indent (in points) of the first line of a paragraph)
INDENT SEC������������ secondary indent (the indent (in
points) of all lines in a paragraph after the first line)
GRID�������������������������� grid call � see �Grids and Typefaces,�
below
TF������������������������������� typeface call � see �Grids and
Typefaces,� below
LNTP������������������������� line type (C = center, J = justify, L =
flush left, and R = flush right)
FOTP������������������������� format type (used to format rudimentary
tabular data, e.g., tables of contents) � if the previous locator code had a
numeric FOTP value and the current locator code has a numeric FOTP value that
is greater than the FOTP value of the previous locator code, then do not start
a new line.
RULE������������������������� if the RULE value is �R�, then generate a
horizontal rule.
(Additional definitions may be found in Publishing From a Full Text Data Base,�� Government Printing Office, Graphic Systems
Development Division, GPO Publication 200.4 (2nd edition), February
1983).
�����������������������������������
�����������������������
and
for format 5804
�����������������������
With the format 5800 documentation, you should add
the following:
�
Locator-86
includes Presidential Proclamations, but does not include Reorganization Plans
�
Locator-87
includes Definition notes
The typeface and type style (bold, italic, etc.) are specified by a combination of a grid call and a typeface call.� Each locator code contains a default grid and typeface value.� These values can be overridden however.� Bell-T followed by a one-digit argument will change the typeface call to the value of the argument.� Bell-G followed by a one-digit argument (or bell-g followed by a three digit argument) will change the grid call to the value of the argument.� A bell-K will change the typeface and grid calls back to the defaults specified by the last locator code.
When
the current grid is the result of a bell-g with a three-digit argument, then
the typeface and type style of the text are as set out in pages i to xvii of MicroComp Grid List, U.S. Government Printing Office, May 1996
�����������������������
Of
the grids listed in MicroComp Grid List,
the U.S. Code currently uses grids 007, 025, 077,� 401, 710, 741, 742, and 940.
When
the value of the grid call is 1 to 4 and the current grid is the result of
either a bell-G or the default value from a locator code, then use the table in
the middle of page 4 of the format 5800 documentation, above.
When
the value of the current grid call is 5 to 8, use the following VideoComp table
�����������
A bell-R followed by two digits, indicates the start of a new grouping above the section level.� The two-digit argument indicates where the grouping sits within the hierarchy.� The lower the bell-R argument, the higher in the hierarchy the grouping is.� For example, a bell-R01 indicates the start of a new title � the highest hierarchical level within the U.S. Code.� By convention, bell-R05 is always used to indicate the start of a title appendix.
�����������������������
Complex
tables start with a bell-c.� The end of
such a table is marked with a bell-e.�
In between a bell-c and a bell-e most of the rules for how to process
photocomposition codes are replaced with the special rules set out in Subformat Generation:� MicroComp, U.S. Government Printing
Office, February 1, 1997:
�����������������������
An
addition to the 1997 documentation should be made on pages 2 and 5.� A bell-I97 has been added.� The bell-I97 does the same function as the
bell-I96, except that on a table that is continued on the next page (or column
of a page), a bell-I96 headnote will have ��continued� automatically appended
to the text of the headnote, whereas a bell-I97 headnote will not have
��continued� automatically appended.
In addition to the bell character, the following special characters are valid in Office of the Law Revision Counsel publications:
Name |
Hex Value |
Notes |
EOF |
00 |
End
of file |
Sectwst |
06 |
� |
Crlf |
0A |
Carriage
return � line feed |
Cents |
0B |
� |
Pgmark |
0C |
� |
Noprnst |
0E |
No
print � start |
Noprnend |
0F |
No
print � end |
Prime |
10 |
' |
Bldlbrak |
13 |
Bold
left bracket |
Bldrbrak |
14 |
Bold
right bracket |
Emspace |
18 |
M-space |
Enspace |
19 |
N-space |
Plusminus |
1B |
� |
Dscrhyph |
1C |
Discretionary
hyphen |
Dagger |
1E |
� |
Xquotes
(open) |
27 |
� |
Less
than |
3C |
< |
Greater
than |
3E |
> |
Footnote |
5C |
Convert
numbers between footnote characters (backslashes) into superscripts |
minus |
5E |
- |
Emdash |
5F |
� |
Xquotes
(close) |
60 |
� |
Degreemk |
AB |
� |
Dbldagger |
BD |
� |
Muchless |
BE |
n |
Balbox |
BF |
□ |
An
escape character (Hex FF) followed by the following two (or three) characters
represents the following characters
Name |
ESC sequence |
Notes |
Thin
space |
ESC
1A |
Thin
space |
N-dash |
ESC
09 |
N-dash |
Multiplication |
ESC
0A |
H |
All
Mark |
ESC
08 |
O |
Copyright |
ESC
AF |
� |
AC_BREVE |
ESC
AE0 |
˘ |
AC_ACUTE |
ESC
AE1 |
΄ |
AC_GRAVE |
ESC
AE2 |
` |
AC_CIRCUMFLEX |
ESC
AE3 |
^ |
AC_DIERESIS |
ESC
AE4 |
� |
AC-CARON |
ESC
AE5 |
ˇ |
AC_TILDE |
ESC
AE6 |
~ |
AC_ANGSTROM |
ESC
AE7 |
˚ |
AC_MACRON |
ESC
AE8 |
ˉ |
AC_CEDILLA |
ESC
AE9 |
� |
All of the above documentation provides explanations for each of the
photocomposition codes used in the U.S. Code.�
Below is a proposed specification for converting U.S. Code
photocomposition codes into XHTML
�����������������������
[1] The only exceptions to this are (1) the number of lines within the running heads and (2) whether or not the first page must start on an odd page number.