Vision based page segmentation algorithm: Extended and perceived success


Akpinar M. E., YILMAZ Y.

13th International Conference on Web Engineering, ICWE 2013, Aalborg, Denmark, 8 - 12 July 2013, vol.8295 LNCS, pp.238-252, (Full Text) identifier

  • Publication Type: Conference Paper / Full Text
  • Volume: 8295 LNCS
  • Doi Number: 10.1007/978-3-319-04244-2_22
  • City: Aalborg
  • Country: Denmark
  • Page Numbers: pp.238-252
  • Keywords: Reverse engineering, User study, Web accessibility, Web page segmentation
  • Middle East Technical University Northern Cyprus Campus Affiliated: Yes

Abstract

Web pages consist of different visual segments, serving different purposes. Typical structural segments are header, right or left columns and main content. Segments can also have nested structure which means some segments may include other segments. Understanding these segments is important in properly displaying web pages for small screen devices and in alternative forms such as audio for screen reader users. There exist different techniques in identifying visual segments in a web page. One successful approach is Vision Based Segmentation Algorithm (VIPS Algorithm) which uses both the underlying source code and also the visual rendering of a web page. However, there are some limitations of this approach and this paper explains how we have extended and improved VIPS and built it in Java.We have also conducted some online user evaluations to investigate how people perceive the success of the segmentation approach and in which granularity they prefer to see a web page segmented. This paper presents the preliminary results which show that, people perceive segmentation with higher granularity as better segmentation regardless of the web page complexity. © Springer International Publishing 2013.