Current Issue Cover
版面分割中文本区域最佳结构表示树的生成算法

张利1, 朱颖1, 吴国威1(清华大学电子工程系,北京 100084)

摘 要
版面分割在将印刷品转换成电子版的过程中是必不可少的,而对于分割后的各区域进行理解,达到有效分类的目的显得更为重要。本文给出了一种用最佳树对Manhatan文本结构进行描述的算法,利用该算法可以满足那些单靠图象分析而解决不了的高层次版面理解要求。
关键词
An Algorithm to Establish Optimal Trees for the Description of Document Structures in Document Segmentation

()

Abstract
In order to save space of newspapers and magazines in an electric way, it is important to classify their contents automatically after they are scanned into computer. An algorithm which can establish optimal trees for the description of document structures in document segmentation is given in this paper. We can get better understanding of the document structures by using this method.
Keywords

订阅号|日报