I am currently trying to find file size limit for text extraction and document conversion.
I have following questions :
1. What is relation between document conversion and text extraction ?
2. Is there any way to bypass text-extraction and document conversion for file with size greater than predetermined configurable limit ?
a. I have come across below system properties for limiting file size for various purposes:
attachments.maxAttachmentSize : Max size for attachments
search.binaryContentByteLimit : Limit Search Index per file.
officeintegration.conversion.filelimit : document conversion limit
docbody.maxBodySize : Max Uploaded File Size Limit
b. Still if I am right that document conversion follows text extraction, and if in someway am able to limit text extraction for files above threshold, there should be no use of docbody.maxBodySize usage, right ?
c. Is officeintegration.conversion.filelimit system property limit also applicable for non MS Office file types like pdf ?
Similar Case: Performance Enhancement During Large File Uploads
I see that you have opened up a case in your customer group to further investigate this behavior. As such I would recommend following-up there if you have any further questions.