35 pts.
 My RSS Feed keeps breaking from pasted MS Word “smart” characters. How can I filter them out of a rich-text field?
I have an older application that publishes an RSS feed.  The content of the <description> element is an abstract from a RichText field.  It appears I have a recurring problem with users pasting in content from MS Word that contain characters XML parsers can't handle.

Is there a simple (or even not so simple) way to scrub the RichText field and replace/delete these characters?  Currently I am doing a substitution for higher ASCII values (ie: —) and CDATA tags.

I've also changed the encoding from UTF 8 to iso-8859-1 but this hasn't cured the underlying problem.

Be glad to kiss the virtual feet for a solution...  :-)



Software/Hardware used:
Lotus Notes/Domino 8.5.2
ASKED: February 8, 2012  8:21 PM
UPDATED: February 28, 2012  3:11 PM

Answer Wiki:
Last Wiki Answer Submitted:  Be the first to answer this question.
All Answer Wiki Contributors:  Be the first to answer this question. Michael Tidmarsh   14,000 pts. , Michael Tidmarsh   0 pts.
To see all answers submitted to the Answer Wiki: View Answer History.


Discuss This Question:
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _