[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

ANON: Textual analysis

> This reveals a minor and probably obvious weakness of pseudonyms--writing
> styles.

We probably need "rephrasing remailers" which do some rudimentary
grammar parsing on input text, and randomly substitute equivalent
constructs such as switching active/passive voice, synonyms, changing
the word order where it is insignificant, joining/splitting sentences,
etc.  Anyone here have any experience in NLP (natural language processing),
specifically parsing english?

A possible start would be to look at "grammar checker" programs that 
check for various grammatical mistakes/misusages and suggest improvements.

Another starting point is language translation software.  After your text
has been translated automatically to spanish -> french -> german -> english,
not much of the orignal style will remain.  Hopefully, enough meaning
will be preserved to allow understanding.

Are there any public domain programs that do one of the above?

One constraint on these is that the message must be present in clear text,
so that it must be the last remailer in the chain.

> examination of punctuation styles (e.g., some people use _this_ for
> emphasis while others use *this*)

This could be alleviated by using a standard markup format, such as
MIME RichText, or the simpler markup convention recently proposed
on the mime list.

Yanek Martinson
[email protected]