home › event - document content analysis for digital archives
EVENT:
Document content analysis for digital archives
Conferences & Talks
description
The critical ingredient to an effective digital archive is metadata. There are at least two major problems with metadata. First, it has to be gathered from the data items, which can be expensive and error-prone. Second, the metadata always depends on some model for what should be recorded about items, but such a model is never complete enough to satisfy the scope of uses for the archive. For these reasons, the key to releasing the potential of digital archives is automatic content analysis.
This talk will touch on the state of the art of automatic content analysis for scanned and electronic documents. Both academic research and commercial applications are driving technology developments in this field. At the current stage, however work on digital archives is not heavily resourced, so most of us with projects in this area fall under the category of "hobbiest". We can still dream up scenarios and designs for systems that will enable our archiving projects, and we can scour the landscape for camera-based document scanners, OCR, doctype classification, and other technical elements we can assemble in our garages.
upcoming events
view all

ENC Spring Summit 2013
20 May 2013 - 21 May 2013
PARC, a Xerox company
Special Event
Learning from Demonstration to be a Good Team Member in a Role-Playing Game
Jonathan Rubin, Author, Michael Youngblood, Author, Ashwin Ram, Author
22 May 2013
Conferences & Talks
Bob Metcalfe Leads a Celebration of 40 Years of Ethernet Innovation
22 May 2013 | Mountain View, CA
Conferences & Talks
Interest flooding attack and countermeasures in Named Data Networking
Priya Mahadevan
22 May 2013
Conferences & Talks
Random Acts of Kindness: The Intelligent and Context-Aware Future of Reciprocal Altruism and Community Collaboration
Victoria Bellotti, Keynote
23 May 2013 | San Diego, CA
Conferences & Talks
