Login / Signup

Finding Viable Seed URLs for Web Corpora: A Scouting Approach and Comparative Study of Available Sources.

Adrien Barbaresi
Published in: WaC@EACL (2014)
Keyphrases
  • comparative study
  • web corpora
  • information retrieval
  • data sources
  • query logs
  • query translation
  • web pages
  • information retrieval systems
  • information sources
  • machine translation