logo Sign In

Post #196685

Author
tweaker
Parent topic
The Thief and the Cobbler: Recobbled Director's Cut (Released)
Link to post in topic
https://originaltrilogy.com/post/id/196685/action/topic#196685
Date created
30-Mar-2006, 1:18 AM
Hey OCP, I'd like to offer my services, for anything that you need done. I have a ton of experience with OCRing scanned text. While I don't use Omnipage, I am extremely familiar with it's most well known competitor, a program called ABBYY Finereader. In a past life, I was into amateur pyrotechnics, and when I learned about OCR software, I came up with the idea of digitally archiving books and articles on professioal pyrotechnics. Over the course of a couple years, I worked on several projects, and probably OCRed around 2000 to 2500 pages of material. My biggest challenge was the restoration of a very rough PDF made up of 500 scanned images of an old textbook on military pyrotechnics. Not the most informative text, but I felt it was a rather necessary classical text on the subject, so I extracted the images from the PDF as jpegs, imported them into Finereader, and OCRed them. Not easy, considering the damn thing had tables, diagrams, chemical equations with subscripts, and so on. Another text I worked on used handwritten subscripts and superscripts for equations and footnotes. So I can deal with some really problematic shit.

I'll send you some samples of my work in a bit.