Multiscale Page Segmentation using Wavelet Packet Analysis

##article.authors##

  • Przemysław Górecki
  • Laura Caponetti
  • Ciro Castiello

##semicolon##

https://doi.org/10.1685/

##article.abstract##

In this paper, a novel method for document page segmentation using Wavelet Packet analysis is proposed. To discriminate between text and non-text regions, the image is represented by means of a wavelet packet analysis tree. Successively a feature image is introduced to synthetize the information related to some nodes selected from the quadtree. The most discriminant nodes are derived using an optimality criterion and a genetic algorithm. Finally the selected feature image is segmented by means of a Fuzzy C-Means clustering. The approach provides good segmentation results and shows to be invariant to page skew and font variations. [DOI: 10.1685 / CSC06090] About DOI

##submission.downloads##

##submissions.published##

2007-10-01

##issue.issue##

##section.section##

Articles