Hi,
Part of my papertiger OCR project involves converting a scanned image of a document into a clean tiff image ready for OCR.
I used to use sane's scanimage --swcrop=yes; however that corrupted colour scanning for me, and lead to occasionally cropping too much. Apart from that, I'd like to be able to process images that are produced by a different method (e.g. another computer with another scanning package giving the "raw" output).
So now I'm looking for a software method to detect the difference between the page and the scanner in the image.
What approach do you guys suggest? I could mess with the Pascal imagemagick bindings to try, or look into another option (read in image with BGRA? process?).
I'm a graphics newbie, so suggestions about custom programs that already do this, algorithms, links to relevant sites etc are more than welcome as well.
Thanks.