We aren't sure how, but in a few of our pages using CQ5.4, the jcr:description field (when viewed in CRXDE Lite) contains the ASCII ETX character. This character is not visible in OSX Lion, but it can be seen in Windows and maybe older versions of OSX.
An example of our jcr:description:
"The Nike USATF Collection is a modern mix of vintage classics infused with military aesthetic."
When we copy paste the description and put it into CRXDE or a different textbox like the CRXDE Lite path box or in Chrome address bar:
"The Nike USATF Collection is a modern mix ^C of vintage classics infused with military aesthetic."
Note the ^C above is where the ETX character is being placed.
Has anyone ever experienced this issue before? It is breaking our endecca indexing at the moment on certain pages with this issue. Thanks.
Edit: The code that is breaking is actually the JCR API which our search team is using to gather the data.