Current Issue Cover
表格文件图象逻辑结构提取方法

刘 冰1, 江 早1, 胡军安1, 何耀东1, 赵 宏1(东北大学软件中心,沈阳 110006)

摘 要
近几年来,国内外已提出了许多关于表格文件图象分析的方法,但其中关于表格逻辑结构提取的方法却很少.为此,提出了一种关于表格文件逻辑结构提取的方法.此方法主要分为整表的全局划分、局部的逻辑结构分析和整表的再次全局划分3个步骤.该方法强调对文件全局和局部布局结构的综合分析.与以往的仅仅从局部上对表格逻辑结构进行确定的方法相比,它具有较高的识别正确率,并可以识别结构更为复杂的表格文件.
关键词
Logical Structure Extraction of Form Document Image

()

Abstract
Many methods on form document image analysis have been proposed, but few have treated the extraction of logical structure. A new method for the logical structure extraction of form document is proposed in this paper. The algorithm of it consists of three phases: global division of the whole document, local logical structure analysis and global re-division of the whole document. This method emphasizes the synthetic analysis of the global and local layout structure of a form document. Compared with other local-layout-structure-analysis methods, it has higher accuracy and can treat with more complex form documents.
Keywords

订阅号|日报