GreenLance Communications: News • Media • Reviews

Sunday, November 2, 2008

How I Read the Research

[caption id="attachment_228" align="alignleft" width="126" caption="OCR Terminal"]OCR Terminal[/caption]

In the course of researching this year's NaNoWriMo novel, I've been undertaking a fair bit of research. What I've discovered is that scanned-in text images are a bear to work with; I much prefer straight, simple text. I've been most grateful this month for the services of OCRTerminal, a groovy little tool that slurps up .jpg files or .pdf files, reads them, and converts them to text. It outputs page contents as pure text, if you like, or it can maintain formatting by outputting it as Word files, RTF files, or XML files. What a find!

No comments:

Post a Comment