logo Sign In

OCR Scan program suggestions - dependent on source

Author
Time

Anyone familiar with OCR Scanning.  Looking for suggestions on programs which would work better with different types of scans.  For instance books are relatively easy since they don't have columns, but magazine and comic would require a different process.  Also foreign languages is probably another issue to consider.

Would like to have .txt files of scanned articles which have been accumulated, so that they are searchable.  Then for the foreign stuff online translation sites could help out.

If the OCR program played well with .cbr and .pdf formats that would be teh best.

and ideas, programs or suggestions of other issues to consider is welcomed.

none

Author
Time

Not used it myself, but I asked a similar question a while ago on another forum and was recommended to try Presto OCR.

Guidelines for post content and general behaviour: read announcement here

Max. allowable image sizes in signatures: reminder here

Author
Time

Thanks for the suggestion.  With all the millions of scans out there, and the depth of some of the archival sites, i'm surprised more .txt archives are not out there to search through.

Would like to start some kind of system, post a scan with raw OCR see if a community would do the proofing, then have all the .txts in a bundle so people could take and host, and it would make article quoting much easier for the community, since at this point much of what's talked about is recycled....