Using UTF-8 to Extract Main Content of Right to Left Language Web Pages.
Hadi MohammadzadehFranz SchweiggertGholamreza NakhaeizadehPublished in: ICSOFT (1) (2011)
Keyphrases
- web pages
- web documents
- web page classification
- website
- natural language
- search engine
- programming language
- language learning
- specification language
- keywords
- web content mining
- web information extraction
- web content
- web search
- web browser
- link analysis
- web logs
- language processing
- data records
- web search engines
- google search engine
- natural language processing