Two Papers Accepted for ICDAR 2019

Two papers have been accepted for this year’s International Conference on Document Image Analysis (ICDAR) and the accompanying GREC workshop on Graphics Image Recognition in Sydney. We will present recent work in which we trained deep fully convolutional neural networks for image segmentation on the Graphic Narrative corpus. David Dubray and Jochen Laubrock present a paper on speech balloon detection and segmentation in comics, which Jochen Laubrock and David Dubray extend to multi-class semantic segmentation.

New preprint on CNN-based speech balloon detection available

David Dubray and Jochen Laubrock just published a preprint on automatic speech balloon detection and segmentation using deep CNNs, https://arxiv.org/abs/1902.08137. The fully convolutional model, trained on our GNC annotations, achieves state-of-the-art performance on the GNC as well as the eBDtheque data sets. Such semantic segmentation of images is an interesting problem in computer vision and document analysis. Speech balloon (and caption) segmentation can also be considered an important step in building an OCR pipeline for analyzing text in graphic novels.