Skip to content

Missing log configuration defaults for verbose third-party packages #633

@AFaust

Description

@AFaust

In previous versions of Alfresco Content Services which included in-process transformers, the log configuration already included necessary configuration to avoid common and expectable warnings, e.g. from PDFBox, to spam the application logs. The TIKA and AIO transformers are missing similar log configuration defaults. With various documents in an AIO-based setup, we see the following messages extremely frequently:

WARN 1 --- [nio-8090-exec-8] org.apache.fontbox.ttf.CmapSubtable      : cmap format 4 subtable is empty
WARN 1 --- [nio-8090-exec-9] o.a.pdfbox.pdmodel.font.PDType0Font      : No Unicode mapping for CID+86 (86) in font TWRVUV+ArialUnicodeMS-Identity-H
WARN 1 --- [nio-8090-exec-7] o.a.pdfbox.pdmodel.font.PDType1Font      : Using fallback font LiberationSans for Helvetica
WARN 1 --- [nio-8090-exec-3] org.apache.pdfbox.pdfparser.COSParser    : The stream doesn't provide any stream length, using fallback readUntilEnd, at offset 164607
WARN 1 --- [nio-8090-exec-3] org.apache.pdfbox.pdfparser.COSParser    : Unexpected XRefTable Entry: 0000165164 00000
WARN 1 --- [nio-8090-exec-4] org.apache.pdfbox.pdfparser.COSParser    : The end of the stream doesn't point to the correct offset, using workaround to read the stream, stream start position: 1099, length: 0, expected end position: 1099
WARN 1 --- [nio-8090-exec-6] org.apache.fontbox.ttf.TTFParser         : Skip table 'kern' which goes past the file size; offset: 0, size: 102260, font size: 12436
WARN 1 --- [nio-8090-exec-1] org.apache.pdfbox.pdmodel.font.PDFont    : Invalid ToUnicode CMap in font MalgunGothicRegular
WARN 1 --- [nio-8090-exec-1] org.apache.pdfbox.pdmodel.font.PDFont    : Using predefined identity CMap instead

and so on...
It seems prudent to include the following log configuration defaults based on what was previously provided in ACS and what warnings we have observed:

logging.level.org.apache.fontbox.ttf=ERROR
logging.level.org.apache.pdfbox.cos.COSDocument=ERROR
logging.level.org.apache.pdfbox.pdfparser=ERROR
logging.level.org.apache.pdfbox.filter.FlateFilter=ERROR
logging.level.org.apache.pdfbox.pdmodel.font=ERROR
logging.level.org.apache.pdfbox.pdmodel.font.PDSimpleFont=FATAL
logging.level.org.apache.pdfbox.pdmodel.font.PDFont=FATAL
logging.level.org.apache.pdfbox.pdmodel.font.PDCIDFont=FATAL

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions