home › event - document content analysis for digital archives

EVENT:

Document content analysis for digital archives
Conferences & Talks

Personal Archiving 2010

16 February 2010
San Francisco, California, USA

 

description

The critical ingredient to an effective digital archive is metadata. There are at least two major problems with metadata.  First, it has to be gathered from the data items, which can be expensive and error-prone. Second, the metadata always depends on some model for what should be recorded about items, but such a model is never complete enough to satisfy the scope of uses for the archive.  For these reasons, the key to releasing the potential of digital archives is automatic content analysis.

This talk will touch on the state of the art of automatic content analysis for scanned and electronic documents.  Both academic research and commercial applications are driving technology developments in this field.  At the current stage, however work on digital archives is not heavily resourced, so most of us with projects in this area fall under the category of "hobbiest".  We can still dream up scenarios and designs for systems that will enable our archiving projects, and we can scour the landscape for camera-based document scanners, OCR, doctype classification, and other technical elements we can assemble in our garages.

 

 

upcoming events   view all 

The Future of Making Things and the Business of Breakthroughs
Stephen Hoover, Keynote Speaker
21 April 2015 - 22 April 2015 | Montreal, Canada
Conferences & Talks  

Disruptive Technologies in Manufacturing (Opening Keynote)
Stephen Hoover
28 April 2015 | Seattle, WA
Conferences & Talks  

The First Five Kilobytes are the Hardest
George Dyson
29 April 2015 | George E. Pake Auditorium, PARC
PARC Forum  

Printed Hybrid Logic Circuits
Janos Veres
29 April 2015 | Berlin, Germany
Conferences & Talks  

IoT User Experience Design
Mike Kuniavsky
12 May 2015 | San Francisco, CA
Conferences & Talks