this looks pretty fragile to me. Characters will end up in bitmaps with
interpolated gray pixels, and so it depends not only on pdftex but also
on any subtlety of the rendering engine. And if the md5sum doesn't
match, you know nothing without the original file. Maybe some
crosscorrelation between images with some given tolerance limit would be

