cf26721

MicrosoftContentParser failure - java.lang.Exception

Discussion created by cf26721 on Mar 31, 2017
Latest reply on Jan 31, 2018 by cm29392

Yesterday morning, we "upgraded" from Q2 2016 CU3 to CU4.  Following the upgrade, we began to notice a significant spike in errors in the bb-services logs across multiple application servers, with the base error being:

 

MicrosoftContentParser failure - java.lang.Exception

 

We also noticed that, at the same time as those errors climbed, the CPU load on all of our application servers spiked as well, climbing from < 5% to > 50%, and it remained elevated until the MicrosoftContentParser failures errors subsided.  I searched BtBb for this error, but came up empty handed.  I submitted a ticket, and got a solution along with the associated kb article (Article #000041865 - High CPU and High Load Caused by Script MicrosoftDocumentParser.sh).

 

The solution entails modifying .../blackboard/config/internal/xythos-indexing-filter.txt to exclude parsing any MS Office files, and adding the following entries:

 

application/vnd.openxmlformats-officedocument.wordprocessingml.document
application/doc
application/x-doc
application/msword
application/vnd.openxmlformats-officedocument.presentationml.presentation
application/ppt
application/x-ppt
application/vnd.ms-powerpoint
application/vnd.openxmlformats-officedocument.presentationml.slideshow
application/pps
application/x-pps
application/vnd.openxmlformats-officedocument.spreadsheetml.sheet
application/xls
application/x-xls

 

 

Additionally, the BBLEARN_CMS/CMS schema must be updated:

 

UPDATE XY_SERVER_GROUP_PARAMETERS
 SET PARAMETER_VALUE = 'exe,dll,zip,jpg,gif,tif,tiff,dmp,pdf,doc,docx,ppt,pps,pptx,ppsx,ppts,xlsx,xls'
 WHERE PARAMETER_NAME = 'Xythos.Search.ExtensionsNotToIndex';
 
 UPDATE XY_SERVER_GROUP_PARAMETERS
 SET PARAMETER_VALUE = 'application/pdf,application/x-pdf,application/vnd.ms-excel,
 application/msexcel,
 application/x-msexcel,application/x-ms-excel,
 application/x-excel,
 application/x-dos_ms_excel
 ,application/xls,
 application/x-xls
 ,application/doc,
 application/x-doc,
 application/msword
 ,application/ppt,
 application/x-ppt,
 application/vnd.ms-powerpoint,
 application/pps
 ,application/x-pps
  ,application/vnd.openxmlformats-officedocument.spreadsheetml.sheet,
  application/vnd.openxmlformats-officedocument.wordprocessingml.document
  ,application/vnd.openxmlformats-officedocument.presentationml.presentation,
  application/vnd.openxmlformats-officedocument.presentationml.slideshow'
 WHERE PARAMETER_NAME = 'Xythos.Search.MIMETypesNotToIndex';

 

Outcomes