| Abstract: |
Providing easy-to-use search and retrieval of image, text, and other
files is one of the biggest challenges in archiving activities. Across the DOE Complex,
documents are commonly made available via the World Wide Web in the Portable Document
Format (PDF). While this approach makes it easy to retrieve files across all
computer platforms, it does not provide a uniform method for searching the content of
these files, nor does it address the wealth of legacy documents, artwork, and photographs.
At LLNL alone, hundreds of thousands of pages of text and an equal volume of images need
to be made accessible.
While the mechanism for scanning, performing optical character recognition, and
conversion to PDF exists, one challenge is to make these documents conveniently searchable
and browse-able from the desktops of end users. We in the Technical Information Department
at LLNL worked with Intradoc software from Intranet Solutions, Inc. to solve this problem. |