none
This user is offline.
Anyone familiar with OCR Scanning. Looking for suggestions on programs which would work better with different types of scans. For instance books are relatively easy since they don't have columns, but magazine and comic would require a different process. Also foreign languages is probably another issue to consider.
Would like to have .txt files of scanned articles which have been accumulated, so that they are searchable. Then for the foreign stuff online translation sites could help out.
If the OCR program played well with .cbr and .pdf formats that would be teh best.
and ideas, programs or suggestions of other issues to consider is welcomed.
none
Moth3r
This user is offline.
Better Grumpy than DopeyNot used it myself, but I asked a similar question a while ago on another forum and was recommended to try Presto OCR.
none
This user is offline.
Thanks for the suggestion. With all the millions of scans out there, and the depth of some of the archival sites, i'm surprised more .txt archives are not out there to search through.
Would like to start some kind of system, post a scan with raw OCR see if a community would do the proofing, then have all the .txts in a bundle so people could take and host, and it would make article quoting much easier for the community, since at this point much of what's talked about is recycled....
none
This user is offline.
For those into old .txt archives, The Archive Team saved much of starwars.yahoo.com before it went down:
http://www.archiveteam.org/index.php?title=Starwars.yahoo.com