A Machine Learning Based Language Specific Web Site Crawler.
Punnawat TadapakThanaphon SuebchuaArnon RungsawangPublished in: NBiS (2010)
Keyphrases
- website
- machine learning
- language specific
- natural language
- web pages
- language independent
- information extraction
- cross lingual
- machine learning algorithms
- n gram
- machine translation
- specific features
- learning algorithm
- text classification
- feature selection
- web users
- semi supervised learning
- artificial intelligence
- knowledge representation
- out of vocabulary
- data mining
- natural language processing
- labor intensive