Category Archives for "Document Conversion"

The Nexus Scoop

It’s been a while since I posted anything new to the ECM support blog.  If you want to know why, see Brandon’s post from a while back- Support: A Day in the Life Of.  I feel his pain!  But there have been some exciting things happening here and you should know about them.

First of all, I got the low-down on Nexus ’11 the other day!  It seems that the goal is to make every year bigger and better then the next.  This year there are some very well-known and engaging speakers (such as Dr. James Brown), break-out sessions held by ECM professionals, deep dives with ECM field specific experts and of course the after-hours social events.  If you have not been to Nexus, you need to register for this event.  If you sign up early, you get a pricing discount!  Tell us if you want to attend!  (The best way to get a hold of us is put in a support ticket and in the description put that you are interested in Nexus.)

On the software front, I have been busy working with development teams on ILINX® products!  Content Store specifically has been really taking off and there are new improvements being added all the time!  The whole goal of ILINX is to provide a simple user experience, while providing a powerful administration interface, all though thin-client technology.  The ILINX Products Suite has everything covered – conversion, workflow, storage and integration.  Tell us if you want a demo – we have great people that know their stuff in the world of Content Management that would love to work with you to see if ILINX is right for you!

Last but not least, I want to thank all the reader’s outside of Washington State.  Thank you for finally sending us some warmer weather – it took ‘til August but summer is finally here as well!  As always, any questions or comments, please call or email me and we will give you an answer!

Mike Peterson
Support Engineer
ImageSource, Inc

College Transcript Processing

College Transcript Processing refers to converting a paper based transcript into an electronic transcript via software that OCR’s the scanned paper version, locates specific data within the transcript and saves that data for later use.  The reason for processing a transcript via software is to improve the rate of data transfer to another system for storage and retrieval versus manual data entry by a data entry specialist.  This is a somewhat difficult task due to the following reasons:

  1. Each and every College presents similar data in a very different format.
  2. Almost all colleges attempt to prevent the copying of the paper transcript through various copy protection methods.  Most of these methods render the data on the transcript almost un-readable.

The data that is similar on a transcript falls into several main areas:

  1.  College Identifying Information
  2. Student Identifying Information
  3. Session/Course Information
  4. Previous Colleges Attended Information
  5. Degrees Awarded Information

The data is similar but not the same on each college transcript.  In addition, the layout of a transcript varies greatly between the various colleges.  Session/Course data could take up the entire width of the paper for one college, but be formatted as multiple columns of data for another college.  There are many, many variations that need to be taken into consideration when attempting to OCR to find and extract the data.

So far the Abbyy FlexiCapture 9.x software has been able to handle most of these issues out of the box.  One of its most powerful features I am finding out is the scripting language to write rule, custom scripts and export scripts that can correct OCR issues and assist the Verification Operator improving efficiency and throughput.

The scripts for rules, custom scripts or export can be written in VBasic or Jscript.  There is some documentation on the Abbyy classes and objects, but not a whole lot.  Most of what I have done has been through trial and error or in specific cases from examples provided by Tech Support.  However, what scripts that have been developed work well for correcting OCR issues and providing automated checks of extracted field data.  Through Custom scripts there is even the option to use a Database lookup on extracted data and return other fields from the database to assist in providing a complete set of validated information.

This has been a learning experience but it is proving to be well worth the effort in getting the data off the paper and into the system used to evaluate a student for enrollment by cutting down on the man hours required under the old manual data entry.

LiquidOffice / TeleForm Tango

In the past I have blogged about exporting Autonomy Cardiff’s TeleForm forms into LiquidOffice using the File Exchange Format.  And then populating those LiquidOffice forms with OCR’d metadata from TeleForm data using LiquidOffice’s virtual submit feature. So in this dance, TeleForm is the lead.

And a lead dancer’s job is to make the other dancer look good, right?

Time for a swap  – let’s let LiquidOffice lead.

Here we’ll leverage the TeleForm LiquidOffice SOAP connect agent.  Price: FREE with TeleForm. Using this method offers a helpful twist: you can attach data and documents to an active LiquidOffice process if you wish.  That is not achievable with the virtual submission method.

Though described in the help files as a “complex subject”, there’s some scenarios that comply with the KISS approach (my favorite). Time to jump in but, warning, danger, disclaimer: this blog assumes you’ve spent some quality time with LiquidOffice and TeleForm.
Continue reading

ILINX Capture: Scanning in a Production Environment

ILINX Capture utilizes a web based platform that combines functionality as well as ease of use while scanning in a production environment.  This platform allows production workers to remotely tap into the system to perform any task in the production workflow.  Capture makes use of many different image enhancement techniques.  In ILINX Capture’s production environment, batches move through the processes in an efficient and organized workflow.

Continue reading

eForms Technology Short Cuts That Really Work!

Often we are up against tight deadlines and we need to use all our tricks and tools to help increase efficiency and provide a better client experience.  When we have clients with large form libraries that need converting to eForms; one of my favorite tools to use is FormBridge for Liquid Office.  FormBridge does direct conversions of PDF, word, excel and other common form files to Liquid Office xfm files like magic!  The converted forms are fully editable and are amazingly accurate copies with minor tweaks for formatting once translated.  FormBridge automatically creates fillable fields, just as a forms designer would and this is a huge time saver.

Forms kick-off workflows and drive business.  Moving paper based and un-editable eforms to an intelligent digital format has many benefits such as cost savings based on efficiencies and increased accessibility.  Even a small business may have hundreds of forms.  As a system integrator of ECM technologies we know what tools and tips to help your eForms initiative become a huge success.

Leigh Woody
Program Manager
ImageSource, Inc.