Automatic Acquisition of Parallel Corpora from Websites with Dynamic Content.
Yulia TsvetkovShuly WintnerPublished in: LREC (2010)
Keyphrases
- dynamic content
- parallel corpora
- labor intensive
- website
- web applications
- data intensive
- web server
- php and mysql
- machine translation
- cross language information retrieval
- databases
- web pages
- language independent
- data management
- cross lingual
- back end
- statistical machine translation
- machine translation system
- word pairs
- fully automatic
- semi automatic
- database systems
- search engine