1

Topic: '&' mark feature in new version 1.0.6

A new version, 1.0.6, of htmLawed has been released. The only change is the introduction of $config["and_mark"].

If 'and_mark" is turned on, the '&' characters in the original input are replaced with the '\x06' control character (& characters introduced by htmLawed, e.g., after converting < to &lt;, are not affected).

This allows one to distinguish, say, an '&gt;' introduced by htmLawed and an '&gt;' put in by the input writer, and can be helpful in further processing of the htmLawed-processed text (e.g., to identify the character sequence 'o(><)o' to generate an emoticon image).

When this feature is active, admins should ensure that the htmLawed output is not directly used in web pages or XML documents as the presence of the '\x06' control character can break documents. Before use in such documents, and preferably before any storage, any remaining '\x06' should be changed back to '&'.