Implementation of Vision Based Page Segmentation algorithm in Java.
The implementation utilizes CSSBox (X)HTML/CSS rendering engine written by Radek Burget.
Description of VIPS and my implementation in my master's thesis (in Czech)
http://www.fit.vutbr.cz/study/DP/DP.php?id=14163&file=t
Original work by Microsoft
http://www.cad.zju.edu.cn/home/dengcai/VIPS/VIPS_July-2004.pdf
CSSBox
This project uses Apache Maven. Compile it by running mvn package
and run java -cp target/vips-java-*jar-with-dependencies.jar org.fit.vips.VipsTester
to start the VipsTester.
Just pass the URL of web page you want to analyze as argument to VipsTester class.
Preferences of implementation can be changed also there.