Extend the file processing pipeline to support attachment extraction, text extraction, metadata retrieval, and imaging capabilities for the following file extensions: .hwp, .hwpx, .alz, and .egg.
Background
Our current …...content (body text, headers, footers, comments, etc.) for indexing...