events contact us
Search the complete PARC site
 

Intelligent Image Recognition
Enabling machines to accurately understand and classify scanned or digital document content

Meaning in documents is conveyed not only through text content, but also through visual structure reflected in layout, fonts, graphics, tables, diagrams, logos, and annotations. Though rules-based technologies assist machines in understanding document content, these approaches are often brittle when there are variations in the document collection.

PARC Solution & Approach

In contrast to the above approach, PARC researchers apply theories of perceptual document analysis – with computer vision techniques – to provide highly flexible, accurate document recognition and classification.

These techniques apply both to scanned paper documents and digital documents that may be difficult or impossible to parse (such as slide presentations and graphical web pages). Strengths of PARC's approach include:

  • automatic document classification;
  • text and graphics tagging;
  • data extraction

...all of which reduce  the need for manual operators in document processing.

Example: ScanScribe

One example of PARC’s work in intelligent image recognition is "ScanScribe" – a perceptual-based document image editor that offers:
-   intelligent grouping;
-   selection; and
-   editing of graphical objects in sketches and whiteboard diagram images.

Download the ScanScribe™ Image Editor


 

 

BUSINESS CONTACT
Lawrence Lee
Director of Business Development, Intelligent Systems Laboratory
650-812-4756
KEYWORDS

computer vision ∙ document classification ∙ image recognition ∙ perceptual image analysis

DOWNLOADS

ScanScribe™ Image Editor

RELATED WEBPAGES

Perceptual Document Analysis [researcher website]

PUBLICATIONS

Perceptual organization in semantic role labeling

ScanScribe: perceptually supported diagram creation and editing

Stylus input and editing without prior selection of mode

   

  (Logo/Homepage) PARC - Palo Alto Research Center

Copyright © 2002-2007 Palo Alto Research Center Incorporated. All Rights Reserved.
PARC, the PARC Logo, AspectJ, DataGlyph, Obje, Silx, StressedMetal, and ClawConnect
are trademarks or registered trademarks of Palo Alto Research Center Incorporated.