Automated Mining Of Names Using Parallel Hindi-English Corpus.
R. Mahesh K. SinhaPublished in: ALR7@IJCNLP (2009)
Keyphrases
- person names
- proper names
- statistical machine translation
- contextual features
- machine translation
- noun phrases
- named entities
- named entity recognition
- link grammar
- comparable corpora
- language identification
- text mining
- natural language
- parallel processing
- indian languages
- broad coverage
- open domain
- data mining
- annotated corpus
- parallel corpus
- bilingual dictionaries
- data mining techniques
- wide coverage
- machine translation system
- training corpus
- parallel corpora
- contextual information
- text corpora
- spoken language
- mono lingual
- english words
- english text
- entity extraction
- sequential patterns
- conditional random fields
- semantic roles
- penn treebank
- query translation
- cross lingual
- pattern mining
- itemsets
- co occurrence
- natural language processing
- knowledge discovery
- information retrieval
- machine learning