Producing a Large-scale Encyclopedic Corpus over the Web.
Atsushi FujiiKatunobu ItouTetsuya IshikawaPublished in: LREC (2002)
Keyphrases
- web scale
- website
- web resources
- web pages
- chinese web
- web mining
- real world
- web data
- link analysis
- textual features
- real life
- semantic web
- small scale
- newspaper articles
- linked data
- end users
- web content
- web technologies
- specific domains
- information sources
- co occurrence
- million images
- plain text
- text corpora
- wikipedia articles
- high quality
- web search
- web users