[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: OCR and Machine Readable Text
/**\\anonymous/**\\ allegedly said:
>
> Alan Olsen wrote:
> > I used to work for a company that would transfer entire archives of medical
> > journals. Much of it we would just OCR. Some of it we would send off
> > shore. The OCR software was about 95% reliable and this was over 5 years
> > ago. (And we were using 286 boxes for much of the OCR work. Not a heavy
> > technoligical investment.) I am sure that things have improved a great
> > deal since then. (My new scanner included OCR software. I will have to
> > run a test and report the findings.
>
> I'd like to know what OCR software you were using. All tests we
> completed at my place of employment were very poor quality wise. We
> showed
> a %65 accuracy rate. Not very good when you need to transfer a five
> year
> backlog of medical and technical journals. This was using a high
> resolution
> scanner with a package that was bundled along with it. About a year
> ago,
> my employer considered transfering data taken off of forms into a
> relational
> database using an OCR program. Again, we found the findings to be too
> innacurate for our needs. I may have just been using the wrong programs
> for
> the job, but the findings were depressing...
My understanding is that the most efficient way of inputting text is
"double typing" where two people type the same document, and a
mechanical comparison of the result is used to find errors.
--
Kent Crispin "No reason to get excited",
[email protected],[email protected] the thief he kindly spoke...
PGP fingerprint: 5A 16 DA 04 31 33 40 1E 87 DA 29 02 97 A3 46 2F