Towards Visual Explanations for Document Table Detection using Coarse Localization Maps

Autor(en): Chowdhury, Arnab Ghosh
Atzmueller, Martin 
Herausgeber: Reuss, P.
Eisenstadt, V.
Schonborn, J.
Schafer, J.
Stichwörter: Ablation-CAM; Barlow twin; Barlow Twins; Convolutional neural networks; Deep neural networks; Document images; Domain specific; Grad-CAM; Grad-CAM++; Image enhancement; Information retrieval; Information retrieval systems; Localisation; Table detection; Tabular Information Extraction; Vision-based methods
Erscheinungsdatum: 2022
Herausgeber: CEUR-WS
Journal: CEUR Workshop Proceedings
Volumen: 3341
Startseite: 161 – 172
Zusammenfassung: 
Computer-vision-based methods using deep neural networks offer considerable opportunities to extract tabular information from richly-structured documents. However, it is extremely challenging to build a unified framework for tabular information extraction, for example, due to a variety of document templates, as well as, diverse document table templates. Earlier, we proposed a transfer learning based table detection approach[1] and a supervised table detection framework initialized with pre-trained self-supervised image classification model weight for table detection [2] on domain specific document images. In this paper, we investigate different document table detection techniques with respect to explainability issues. These enable, e. g., diagnostics and method refinement, towards a complete tabular data extraction pipeline and tool. In particular, we present visual explanation approaches of earlier proposed table detection models on domain specific document images in order to enhance the explainability of the applied Convolutional Neural Network (CNN) based models. We discuss first experimental results for visual explanations of those models and outline several challenges in this context. © 2022 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
Beschreibung: 
Cited by: 0; Conference name: 2022 Learning, Knowledge, Data, Analysis, LWDA 2022 - Workshops: Special Interest Group on Knowledge Management (FGWM), Knowledge Discovery, Data Mining, and Machine Learning (FGKD) and Special Interest Group on Database Systems (FGDB); Conference date: 5 October 2022 through 7 October 2022; Conference code: 186661
ISSN: 1613-0073
Externe URL: https://www.scopus.com/inward/record.uri?eid=2-s2.0-85148661130&partnerID=40&md5=b784412e109a4a8be91ba3f66454d660

Show full item record

Page view(s)

6
Last Week
0
Last month
0
checked on May 18, 2024

Google ScholarTM

Check