[Am-info] Embarrassing stuff left in Word docs on MSFT sites

Erick Andrews Erick Andrews" <eandrews@star.net
Thu, 01 Apr 2004 21:54:13 -0400 (EDT)


On Fri, 02 Apr 2004 11:22:52 +0900 (JST), Stephen A. Carter wrote:

>"[...W]hy not run a gentle web spider against all Microsoft sites in
>English, specifically looking for other instances of tracking data
>not removed from documents? I coded a bunch of scripts and let them
>run through the night, fetching approximately 10,000 unique
>documents; over 10% was identified as containing change tracking
>records. I decided to collect only those with deleted text still
>present, yielding a crop of over 5% of all documents."
>
>http://lcamtuf.coredump.cx/strikeout/
>
>
>-- 
>Stephen A. Carter            High-Tech Information Center Nagoya, Ltd.
>mailto:scarter@hticn.com                                 Nagoya, Japan
>http://www.geekynetstuff.com                   PGP key ID:  0x59B4F7AD

The issue of 'embedded' stuff in MS documents has been understood
by many folks for many years.  And that's why MS Word docs are an
order of magnitude bigger (file-wise) than any comparable text file.

I'm not sure where you want to take this.  Is there a public policy position
that you are proposing to safeguard individual authorship?

-- 
Erick Andrews