High-Quality Capture of Documents on a Cluttered Tabletop with a 4K Video Camera

Chelhwon Kim, Patrick Chiu and Henry Tang

Abstract

We present a novel system for detecting and capturing paper documents on a tabletop using a 4K video camera mounted overhead on pan-tilt servos. Our automated system first finds paper documents on a cluttered tabletop based on a text probability map, and then takes a sequence of high-resolution frames of the located document to reconstruct a high quality and fronto -parallel document page image. The quality of the resulting images enables OCR processing on the whole page. We performed a preliminary evaluation on a small set of 10 document pages and our proposed system achieved 98% accuracy with the open source Tesseract OCR engine.

See more details and examples in the following paper.

· C. Kim, P. Chiu and H. Tang, "High-Quality Capture of Documents on a Cluttered Tabletop with a 4K Video Camera," Proceedings of ACM DocEng 2015.
· T. Dunnigan, J. Doherty, D. Avrahami, J. Biehl, P. Chiu, C. Kim, Q. Liu, H. Tang and L. Wilcox, ”Evolution of a Tabletop Telepresence System through Art and Technology”, ACM Multimedia 2015.
DocEng15 Poster