Login / Signup
Don't read, just look: Main content extraction from web pages using visually apparent features.
Geunseong Jung
Sungjae Han
Hansung Kim
Jaehyuk Cha
Published in:
CoRR (2021)
Keyphrases
</>
feature vectors
image features
automatically extracted
high level
prior knowledge
low level
data mining
machine learning
website
case study
feature space
feature set
false positives
salient features