Some Eclipse Foundation services are deprecated, or will be soon. Please ensure you've read this important communication.

Bug 437545

Summary: [server] SolrException: An invalid XML character (Unicode: 0xffffffff) was found
Product: [ECD] Orion Reporter: Anthony Hunter <ahunter.eclipse>
Component: ServerAssignee: Project Inbox <orion.server-inbox>
Status: RESOLVED WONTFIX QA Contact:
Severity: normal    
Priority: P3    
Version: 6.0   
Target Milestone: ---   
Hardware: PC   
OS: Linux   
Whiteboard:
Attachments:
Description Flags
File with characters that causes the issue none

Description Anthony Hunter CLA 2014-06-16 12:05:19 EDT
Created attachment 244287 [details]
File with characters that causes the issue

Following from Bug 345782 :

> I am only going to check for the problem character that we know about
> (Unicode FFFF). There are other invalid Unicode characters in XML files, but
> no sense adding them all unless we know they might occur in a file.

I have attached a file with a different character(s). We have this error many times on our production server:

org.apache.solr.client.solrj.SolrServerException: org.apache.solr.client.solrj.SolrServerException: org.apache.solr.common.SolrException: An invalid XML character (Unicode: 0xffffffff) was found in the element content of the document.
        at org.apache.solr.client.solrj.embedded.EmbeddedSolrServer.request(EmbeddedSolrServer.java:153) ~[org.apache.solr.core_3.5.0.v20120530-0900.jar:na]
        at org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:105) ~[na:na]
        at org.apache.solr.client.solrj.SolrServer.add(SolrServer.java:121) ~[org.apache.solr.client.solrj_3.5.0.v20120530-0900.jar:na]
        at org.apache.solr.client.solrj.SolrServer.add(SolrServer.java:106) ~[org.apache.solr.client.solrj_3.5.0.v20120530-0900.jar:na]
        at org.eclipse.orion.internal.server.search.Indexer.indexProject(Indexer.java:272) [org.eclipse.orion.server.search_1.0.0.v20140425-2212.jar:na]
        at org.eclipse.orion.internal.server.search.Indexer.indexWorkspace(Indexer.java:183) [org.eclipse.orion.server.search_1.0.0.v20140425-2212.jar:na]
        at org.eclipse.orion.internal.server.search.Indexer.indexUser(Indexer.java:162) [org.eclipse.orion.server.search_1.0.0.v20140425-2212.jar:na]
        at org.eclipse.orion.internal.server.search.Indexer.run(Indexer.java:354) [org.eclipse.orion.server.search_1.0.0.v20140425-2212.jar:na]
        at org.eclipse.core.internal.jobs.Worker.run(Worker.java:53) [org.eclipse.core.jobs_3.6.0.v20140303-1513.jar:na]
Caused by: org.apache.solr.client.solrj.SolrServerException: org.apache.solr.common.SolrException: An invalid XML character (Unicode: 0xffffffff) was found in the element content of the document.
        at org.apache.solr.client.solrj.embedded.EmbeddedSolrServer.request(EmbeddedSolrServer.java:142) ~[org.apache.solr.core_3.5.0.v20120530-0900.jar:na]
        ... 8 common frames omitted
Caused by: org.apache.solr.common.SolrException: An invalid XML character (Unicode: 0xffffffff) was found in the element content of the document.
        at org.apache.solr.handler.XMLLoader.load(XMLLoader.java:81) ~[na:na]
        at org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(ContentStreamHandlerBase.java:58) ~[na:na]
        at org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:129) ~[na:na]
        at org.apache.solr.core.SolrCore.execute(SolrCore.java:1372) ~[org.apache.solr.core_3.5.0.v20120530-0900.jar:na]
        at org.apache.solr.client.solrj.embedded.EmbeddedSolrServer.request(EmbeddedSolrServer.java:140) ~[org.apache.solr.core_3.5.0.v20120530-0900.jar:na]
        ... 8 common frames omitted
Caused by: javax.xml.stream.XMLStreamException: An invalid XML character (Unicode: 0xffffffff) was found in the element content of the document.
        at com.ibm.xml.xlxp.api.stax.msg.StAXMessageProvider.throwWrappedXMLStreamException(StAXMessageProvider.java:73) ~[na:na]
        at com.ibm.xml.xlxp.api.stax.XMLStreamReaderImpl.produceFatalErrorEvent(XMLStreamReaderImpl.java:2103) ~[na:na]
        at com.ibm.xml.xlxp.api.stax.XMLStreamReaderImpl.reportFatalError(XMLStreamReaderImpl.java:2109) ~[na:na]
        at com.ibm.xml.xlxp.scan.DocumentEntityScanner.reportFatalError(DocumentEntityScanner.java:479) ~[na:na]
        at com.ibm.xml.xlxp.scan.DocumentEntityScanner.scanContentUnbuffered2(DocumentEntityScanner.java:2968) ~[na:na]
        at com.ibm.xml.xlxp.scan.DocumentEntityScanner.scanContentUnbuffered(DocumentEntityScanner.java:2755) ~[na:na]
        at com.ibm.xml.xlxp.api.util.SimpleScannerHelper.scanContentUnbuffered(SimpleScannerHelper.java:1309) ~[na:na]
        at com.ibm.xml.xlxp.scan.DocumentEntityScanner.stateUnbufferedContent(DocumentEntityScanner.java:551) ~[na:na]
        at com.ibm.xml.xlxp.scan.DocumentEntityScanner.produceEvent(DocumentEntityScanner.java:644) ~[na:na]
        at com.ibm.xml.xlxp.api.stax.XMLStreamReaderImpl.getNextScannerEvent(XMLStreamReaderImpl.java:1645) ~[na:na]
        at com.ibm.xml.xlxp.api.stax.XMLStreamReaderImpl.next(XMLStreamReaderImpl.java:536) ~[na:na]
        at com.ibm.xml.xlxp.api.stax.XMLInputFactoryImpl$XMLStreamReaderProxy.next(XMLInputFactoryImpl.java:180) ~[na:na]
        at org.apache.solr.handler.XMLLoader.readDoc(XMLLoader.java:295) ~[na:na]
        at org.apache.solr.handler.XMLLoader.processUpdate(XMLLoader.java:157) ~[na:na]
        at org.apache.solr.handler.XMLLoader.load(XMLLoader.java:79) ~[na:na]
        ... 12 common frames omitted
Comment 1 Aidan Redpath CLA 2014-06-23 12:29:04 EDT
I cannot reproduce this on windows
Comment 2 Anthony Hunter CLA 2015-01-19 15:19:21 EST
We are no longer using Apache Solr on the Orion server for search.