Information Extraction from Hindi Texts.
Kamlesh DuttaSaroj KaushikNupur PrakashPublished in: LREC (2004)
Keyphrases
- information extraction
- named entity recognition
- information extraction systems
- natural language text
- text documents
- machine translation
- natural language processing
- named entities
- natural language
- linguistic patterns
- free text
- text mining
- information retrieval
- relation extraction
- text summarization
- natural language generation
- machine learning
- precision and recall
- web mining
- conditional random fields
- structured data
- web documents
- text processing
- optical character recognition
- textual data
- proper names
- contextual features
- domain dependent
- semantic tagging
- keywords
- open domain
- semi structured
- markov random field
- maximum entropy
- text corpus
- language identification
- comparable corpora
- domain specific
- legal texts
- data sets
- relational learning
- extracting meaningful