A Web Corpus and Word Sketches for Japanese.
Irena Srdanovic ErjavecTomaz ErjavecAdam KilgarriffPublished in: Inf. Media Technol. (2008)
Keyphrases
- web applications
- word frequencies
- word pairs
- semantic web
- website
- multiword
- newspaper articles
- web technologies
- training corpus
- statistical machine translation
- noun phrases
- web resources
- english words
- lexical features
- specific domains
- parallel corpus
- linked data
- web content
- web mining
- text corpus
- web scale
- spontaneous speech
- unknown words
- word segmentation
- word sense
- natural language text
- co occurrence
- natural language processing
- end users
- web services