Introduction
Each and every capture process involving paper has the same problem. How to reduce the cost of scanning and data entry?

With Square 9 Advanced Capture users now have the ability to compile many types of paper documents without separater sheets, place them into the scanner and have a technology not only classify but also intelligently extract non-templated OCR data contained within the document. Better yet while it classifies and extracts data, it also
integrates to the SmartSearch document management system for storage at the same time as it feeds another line of business application for adjudication purposes.


Reference 1: Classification is performed by keyword search from the OCR output. Spatial relationships between keywords as well as hierarchical classification allows thousands of types of documents to be captured and interpreted.


Reference 2: The classification process behaves like a mail room person - rough sort first, then a finer sort. Leveraging “Following Page” detection, removes the need for separator sheets between documents and rules for detecting “following pages” are configurable.

What Is It?
Square9 Advanced Capture is a platform for the development of applications for document classification and data extraction leveraging best-of-breed technology
solutions from Paradatec's Prosar- Aida, Captaris' DokuStar, or Kofax Transformations.

■ Advanced Capture is an unstructured data capture product that can intelligently extract and classify documents using revolutionary artificial intelligence (AI) capabilities. This solution will allow between 70%-96% capture automation, meaning the percentage of documents that flow through Advanced Capture without the need to be manually corrected.

■ Square9 Advanced Capture is the solution for the common capture problems that companies face when implementing document management–which is how to control the cost of capture and data entry.

■ It can be used to tackle all kinds of input management tasks, regardless of whether the volume of documents is high or low, the workflow is straightforward or complex, the processing is central or distributed, or if the documents to be processed are structured or unstructured.

Classification and Data Extraction
■Advanced Capture is text recognition and analysis software based on artificial intelligence that can ascertain document types and classify images for appropriate
routing and archival.

■ The process begins with a highspeed and efficient full page OCR scan of each image. This first step allows Advanced Capture to search each and every word of every document to discover the document content.

■ In order to classify the current document, Square9 Advanced Capture searches the document for tell-tale characteristics specified by the administrator using pattern
matching, spatial relationships and synonyms like invoice number, invoice #, inv_num, etc. These tend to be certain keywords or phrases

■ This overall process of discovery is unique and makes Advanced Capture very different from the typical template based solutions. With other solutions the document
content is expected and assumed to always appear in the same geographical areas.

■A significant advantage of Advanced Capture document content discovery method is its ability to process virtually unlimited number of versions and formats of particular
document types.

■Advanced Capture is confidence based, meaning if the Advanced Capture engine does not have a high degree of confidence, the data value is flag and sent for correction within the data validation module.

■ Since the Advanced Capture engine is transparent to the user, the only interaction the user has with the system is within the validation module, simply correcting values
and rerouting them back into the process workflow.


Reference 3: During the extraction process label-value pair logic is used to find data. Utilizing the extracted OCR, the solution finds all synonyms of a type like “invoice date”, “inv date”, “date”. Then the artificial intelligence engine discovers the date that is spatially in the correct RELATIVE location to the label. Note – no fixed zones are used – anywhere!


Reference 4: Advanced Capture supports tabular data extraction for any type
of document: EOB, invoice, traffic instructions or sales orders. Each type of document is processed same way, with no limit to the variety of layouts

Application Use Scenario's
Square 9 Advanced Capture can be used in many business applications but typically its used in a few specific verticals, namely: AP, healthcare, mortgage, media broadcasting and manufacturing.

■ Automate AP invoice, EOB and sales order entry – capture sales orders from fax, mail, email or print and automate the process of classification, data entry as well as
integration with internal line-ofbusiness systems and document management archiving.

■ Automate traffic instructions – media outlets like cable, radio and satellite receive large amounts of traffic instructions which tell the media provider which commercial
to run, how long, on what station and at what frequency.

■ Document classification and distribution in mail system.

■ Extraction of business process relevant data from documents.

■ Capturing unstructured forms, surveys, assessments and exams.

Use Case
• By improving the speed and efficiency at which a company captures and processes documents such as patient Explanation of Benefits (EOB) information, Square9 Advanced Capture can produce measurable benefits at every level of the organization. The benefits resulting from unstructured data capture encompass additional revenues as well as savings from reduced costs. The savings are real- hard cost dollars from reducing an enormous data entry staff both on the payables and document management application side.

Hard-dollar benefits include:
Labor savings
• Reduced need for data entry operators; document prep workers, sorters, etc.,
• Increased accuracy of data extracted from EOB’s, invoices, loan documents reduces the time spent on error correction

Improved cash management
• Faster payment postings and access to more accurate payment data, means increased control over billing and accounting processes, which leads to better overall cash flow management
• Faster, more accurate generation of critical payment data, expedites the process of Secondary Claim submissions

Rapid return on investment
• In mid to high volume organizations, the payback on unstructured data capture is less than a year.

Business Case
• One of the largest healthcare providers on the west coast, spread across nine states, incurred an enormous cost processing and indexing EOB’s into the existing MedSeries4 patient billing adjudication and document management software's.

• With the shear number of EOB’s the hospital processed daily, the 20 FTE were being augmented with 15 full-time temps.

• The bottleneck in the process was separately indexing the EOB into the host document management system and into MedSeries4. Additionally since the layout of the EOB was wide ranging and difficult to read, there was no good solution to accurately indexing the EOB into the document management solution, so the healthcare provider settled on an inefficient full text index schema. As a result thousands of EOB’s are lost each day and productivity of FTE’s was further impacted by the full text engine that required 1.5-2.0 minutes processing time per EOB.

• With 20 FTE’s and 15 FT temps, the healthcare provider still had a significant backlog of EOB’s which required overtime night and weekend processing which escalated staff costs. Based on a strategic assessment, it was estimated that they needed 50 FTE’s at a cost of $160,000 per month to stabilize the processing of the current EOB backlog and this estimate did not include future growth.

• The patient financial services (PFS) department decided to implement an unstructured data capture solution to fully meet the patient billing backlog

• The ROI on this solution was 1.5 months and only required a PFS FTE count of 6 down from 20 and no requirement for additional staff augmentation.

< back | top >


business applications

Key Points
• Reduce hard costs – FTE’s, benefits, temp or part time works, office space
• Reduce soft costs – productivity, efficiency related to processing paper
• Non-template based
• Solution can learn from mistakes
• Supports table line-item data extraction
• Supports page detection and separation
• Seamless data output to most any line-of-business system
• Supports ODBC data updates and logic
• Reads all ODBC data into memory for fast processing
• Solution can poll directory for files
to process
• Front end capture through Kofax
Capture or Captiva Input Accel
• Pre-packaged with AP Invoice and
EOB rule trees

Target Market
• Healthcare
• Accounts payable (AP)
• Banks and mortgage lenders
• Media outlets (TV, radio)
• Manufacturing

Target Business Processes
• EOB (explanation of benefits)
• Insurance claims
• HCFA
• Invoices
• Mortgage documents
• Traffic instructions
• Applications
• Hospital admitting forms
• Remittance
• Sales order processing

Questions to Ask
• How many documents do you process per month?
• How much time is spent per document to file into a cabinet or DMS?
• How many FTE data entry personnel do you employ to handle data entry and capture?
• Is the number of FTE’s sufficient to keep up with demand?
• At what rate does your business plan to grow within the next year, 3 and 5 years?
• What is the current error rate?
• Is there adjudication software that requires entry?
• If so, is double data entry performed?
• Is there an approval associated with the business process?
• If healthcare, what percentage of EOB’s are submitted in paper form?
• How many insurance carriers do you deal with?
• What percentages of documents are received via fax, mail, EDI, phone, web?


smart search