I am supplying the regex “\S+” to a TextFragmentAbsorber in order to split a PDF page into individual words. This has worked well on many, many documents, but I have hit a snag with a recent one (see attached PDF below). …...other line the words are separated cleanly into two pieces. In...