EuroGOV: Engineering a Multilingual Web Corpus.
Börkur SigurbjörnssonJaap KampsMaarten de RijkePublished in: CLEF (Working Notes) (2005)
Keyphrases
- multi lingual
- web applications
- multilingual documents
- website
- information access
- log analysis
- link analysis
- web documents
- web data
- software engineering
- specific domains
- information sources
- artificial intelligence
- newspaper articles
- web scale
- engineering design
- web content
- web mining
- digital libraries
- computer science
- web pages
- linked data
- web users
- text categorization
- web resources
- semantic web
- information extraction
- manually annotated
- end users
- plain text
- textual features
- information retrieval