Webpage Segmentation repository
Segmentation details
GOSH:blog:20190617155939:5:SEG-2538:BOM:chrome
URL
Original:
http://twitterthecomic.tumblr.com
In cache:
http://twitterthecomic.tumblr.com
Dataset code
20190617155939
Algorithm
BOM
Browser
chrome
Geometry
685x12593
Category
Google Search - blog
Granularity
5
Word count
584
Taken
2014-09-17 19:56:33
From
132.227.204.64
Screenshot
BId
Block geometry
Gran.
Label
Elem.
Words
Imp.
Text
Density
Images
x
y
w
h
L4
0.00
8.00
685.00
218.00
0
16
16
1
0.00
L19
0.00
218.00
685.00
12,552.00
0
501
501
25
0.00
L356
185.00
12,572.00
685.00
12,593.00
0
3
3
0
0.00