New Officeshots feature: ODF Anonymiser

by Sander Marechal

I have just released a new feature for Officeshots: The ODF anonymiser. The ODF Anonymiser tries to make your document completely anonymous while maintaining it's overall structure. All metadata is removed or cleaned. All text in the document is replaces with gibberish text that has approximately the same word length and word distribution. All images are replaced with placeholder images. All unknown content is removed.

The result of the anonymiser is a document that has the same general structure but with made-up contents. If your original document does not work in a certain application, the anonymised version of the document should fail in the same manner. By using the anonymiser you can test your private documents without exposing the contents to our rendering clients.

To use the Anonymiser, simply check the appropriate checkbox on the Officeshots front page.

The ODF Anonymiser is written and maintained by the people who created the iTools Python libraries. The Anonymiser is part of that library (called ODF Greek). If you want to use the anonymiser yourself, just install iTools and use the iodf-greek.py script. Many thanks for their contribution.

Creative Commons Attribution-ShareAlike

Comments

Nobody has posted any comments yet.

Post a new comment

Registration is not required to post comments, but cookies must be enabled. One of the advantages of registration is that you can edit your comments later on (editing not yet implemented). You can register or login here.




Your e-mail address will not be published, but your website URL will. All links that you post will tagged rel="nofollow" to throw off spammers. You are allowed to use the following XHTML tags in your comment: <em> <strong> <u> <b> <i> <strike> <blockquote> <big> <small> <ul> <ol> <li> <a href=""> <pre> <code> <tt> <br>. Please allow up to 60 second processing time after you post a comment. Our spam filters may take some time.