Selection of Japanese-English Equivalents by Integrating High-quality Corpora and Huge Amounts of Web Data.
Qing MaKoichi NakaoMasaki MurataHitoshi IsaharaPublished in: LREC (2008)
Keyphrases
- web data
- huge amounts
- high quality
- web mining
- web content
- massive amounts
- semi structured
- web pages
- web information
- statistical machine translation
- web usage mining
- web sources
- natural language
- web documents
- incremental mining
- native speakers
- parallel corpus
- natural language processing
- query logs
- target language
- machine learning
- web information extraction
- chinese english
- cross language information retrieval
- sequential patterns