Automatic Acquisition of Parallel Corpora from Websites with Dynamic Content.

Yulia Tsvetkov Shuly Wintner

Published in: LREC (2010)

Keyphrases

dynamic content
parallel corpora
labor intensive
website
web applications
data intensive
web server
php and mysql
machine translation
cross language information retrieval
databases
web pages
language independent
data management
cross lingual
back end
statistical machine translation
machine translation system
word pairs
fully automatic
semi automatic
database systems
search engine