Discovering Parallel Text from the World Wide Web.
Jisong ChenRowena ChauChung-Hsing YehPublished in: ACSW (2004)
Keyphrases
- text retrieval
- automatically discovering
- machine learning
- database
- html pages
- automatically extracting
- parallel implementation
- researching on the internet
- parallel computation
- information retrieval
- text mining
- web pages
- semantic information
- feature selection
- parallel programming
- document analysis
- automatically extracted
- parallel computing
- keywords
- text data
- web documents
- text analysis
- text processing
- website
- shared memory
- free text
- learning algorithm