I have a usecase where I am getting an OCRed copy. All the header, footer and page number data has been converted into text. And it has broken the document formatting as well. Attaching a sample document for the referenc…... Now When I am trying to parse the document, I am getting the...footers, and page numbers being parsed as regular text, you can implement...