Sender: |
|
Date: |
Thu, 3 May 2007 15:40:44 -0400 |
Reply-To: |
|
Subject: |
|
MIME-Version: |
1.0 |
Content-Transfer-Encoding: |
7bit |
In-Reply-To: |
|
Content-Type: |
text/plain; charset="us-ascii" |
From: |
|
Parts/Attachments: |
|
|
I have implemented Hummingbird DM and RM for numerous clients, several of
whom use the Full Text indexing on their scanned documents.
The biggest issue with the Indexes in general is the drive space that they
consume. Hummingbird recommends anywhere from 60% to 100 % of the disk space
that is used for document storage to be set aside for the indexes.
The next biggest issue is OCR and QA. If the OCR process is not very
accurate and/or the QA process is not very efficient, the old adage garbage
in garbage out applies. If the OCR doesn't properly recognize the
characters, the words can't be properly indexed.
The other issue is that the Indexes are prone to corruption, so in my
designs I always create two Indexes on each library, so if one gets
corrupted, I can switch the back up into production while the corrupted
index is rebuilt. In many implementations there are also two Indexers (or
more) for FOLB. This doubles (or more) the disk space needed for indexes.
This one may be obvious, but I mention it just in case: If you are not doing
OCR on the images, there won't be anything for the Indexer to index except
for the Meta Data hence no "Full Text" indexing.
Feel free to contact me directly if you have additional questions.
Brent
[log in to unmask]
-----Original Message-----
From: Records Management Program [mailto:[log in to unmask]] On Behalf
Of Records Management
Sent: Thursday, May 03, 2007 3:06 PM
To: [log in to unmask]
Subject: Full Text Indexing
Aloha All,
I was asked by a department manager whether anyone uses "full-text"
indexing for their document imaging system...
I would be interested to know if anyone does and what type of problems
they have encountered...
Mahalo for your feedback...
Brian
Brian A. Moriki
Assistant Vice President
Records Management Department
First Hawaiian Bank
808-844-3056
808-265-7449 (cell)
808-844-3494 (fax)
[log in to unmask]
***FHB RECORDS MANAGEMENT DEPARTMENT: WE SAY YES!!!***
----------------------------------------------------------
This email is intended only for the person or entity
to which it is addressed and may contain
confidential information. Any review,
retransmission, dissemination or other use of, or
taking of any action in reliance upon, this
information by persons or entities other than the
intended recipient is prohibited. If you receive this
e-mail in error, please contact the sender by
replying to this e-mail and delete this e-mail and
any attachments from all computers without
reading or saving the same in any matter
whatsoever.
List archives at http://lists.ufl.edu/archives/recmgmt-l.html
Contact [log in to unmask] for assistance
List archives at http://lists.ufl.edu/archives/recmgmt-l.html
Contact [log in to unmask] for assistance
|
|
|