Unsupervised Extraction of False Friends from Parallel Bi-Texts Using the Web as a Corpus.
Svetlin NakovPreslav NakovElena PaskalevaPublished in: RANLP (2009)
Keyphrases
- web information extraction
- newspaper articles
- website
- data extraction
- information extraction
- web pages
- web applications
- natural language text
- machine learning
- web mining
- linked data
- information extraction systems
- textual features
- business intelligence
- genia corpus
- automatic extraction
- natural language
- semantic web
- keywords
- parallel implementation
- semi supervised
- web documents
- unsupervised learning
- text mining
- information sources
- training corpus
- linguistic information
- user generated content
- pos tagging
- shared memory
- web resources
- web content
- web people search