Data-Driven Approach to Identification of Latin Phrases in Russian Web-Crawled Corpora.
Vladimír BenkoKatarina RausovaPublished in: IMS (CLCO) (2020)
Keyphrases
- web pages
- web applications
- website
- data driven
- semantic web
- information sources
- web crawling
- web scale
- web mining
- text classification
- web documents
- information access
- web resources
- optical character recognition
- web communities
- multi lingual
- end users
- text mining
- web content
- web data
- web users
- link analysis
- natural language processing