stripsgml

Remove SGML markup.


Description

stripsgml removes SGML markup tags from a file. stripsgml also attempts to translate standard special character entity references to ASCII (or the character set specified by the -charset option).

stripsgml cannot handle document type declarations. It only can handle document instances.


Usage

stripsgml is invoked from the Unix shell as follows:

% stripsgml [options] < sgmlfile > asciifile

The following are the list of options available:

-charset charset

Use charset as the character set to use while processing. By default, ASCII is assumed, so entity references for special characters and character references are mapped to ASCII text. set_charset allows the entity references and character references to interpreted under a different character set. Only the ISO-8859 character sets (1-10) are supported.

-help

Print out a terse description of all options available.

-html

Cause URLs in anchor elements in an HTML document to be preserved in the output.

-ignore name

Set parameter entity with the name, name, to "IGNORE". This option may be useful if the input data contains marked sections where parameter entity references are used to control the status keyword.

This option can be specified multiple times to set multiple parameter entities.

-include name

Set parameter entity with the name, name, to "INCLUDE". This option may be useful if the input data contains marked sections where parameter entity references are used to control the status keyword.

This option can be specified multiple times to set multiple parameter entities.


Availability

This software is part of the perlSGML package; see (http://www.oac.uci.edu/indiv/ehood/perlSGML.html)


Author

Earl Hood
ehood@medusa.acs.uci.edu
Copyright © 1997

97/09/12 20:13:17