Webpage Segmentation repository
Segmentation details
GOSH:blog:20190617155939:5:SEG-2441:GT:chrome
URL
Original:
http://twitterthecomic.tumblr.com
In cache:
http://twitterthecomic.tumblr.com
Dataset code
20190617155939
Algorithm
GT
Browser
chrome
Geometry
685x12658
Category
Google Search - blog
Granularity
5
Word count
508
Taken
2014-09-15 20:06:38
From
132.227.207.239
Screenshot
BId
Block geometry
Gran.
Label
Elem.
Words
Imp.
Text
Density
Images
x
y
w
h
G1
0.00
8.00
685.00
217.00
0
16
27
1
0.00
G2
0.00
217.00
685.00
12,566.00
0
516
272
25
0.00
G3
185.00
12,586.00
685.00
12,658.00
0
4
7
0
0.00